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GCCGCGTTGG AGTGCTACAA CACGTTCATT GGCGAGAGAA CTGTAGGAGC GCTCCAGGTC 3900 

CTAGGTACTG AAGCCCAGTC TTCACTTTTG AAAGCAGTGG CTTTCTTCTT AGAAAGCATT 3960 

GCCATGCATG ACATTATAGC AGCAGAAAAG TGCTTTGGCA CTGGGGCAGC AGGTAACA6A 4020 

ACAAGCCCAC AAGAGGGAGA AAGGTACAAC TACAGCAAAT GCACCGTTGT GGTCCGGATT 4080 

ATGGAGTTTA CCACGACTCT GCTAAACACC TCCCCGGT^G GATGGAAGCT CCTGAAGAAG 4140 

GACTTGTGTA ATACACACCT GATGAGAGTC CTGGTGCAGA CGCTGTGTGA GCCCGCAAGC 42 00 

ATAGGTTTCA ACATCGGAGA CGTCCAGGTT ATGGCTCATC TTCCTGATGT TTGTGTGAAT 4260 

CTGATGAAAG CTCTAAAGAT GTCCCCATAC AAAGATATCC TAGAGACCCA TCTGAGAGAG 4320 

AAAATAACAG CACAGAGCAT TGAGGAGCTT TGTGCCGTCA ACTTGTATGG CCCTGACX3CX3 4380 

CAAGTGGACA GGAGCAGGCT GGCTGCTGTT GTGTCTGCCT GTRAACAGCT TCACAGAGCT 4440 

GGGCTTCTGC ATAATATATT ACCGTCTCAG TCCACAGATT TGCATCATTC TGTTGGCACA 4500 

GAACTTCTTT CCCTGGTTTA TAAAGGCATT GCCCCTGGAG ATGAGAGACA GTGTCTGCCT 4560 

TCTCTAGACC TCAGTTGTAA GCAGCTGGCC AGCGGACTTC TGGAGTTAGC CTTTGCTTTT 4620 

GGAGGACTGT GTGAGCGCCT TGT6AGTCTT CTCCTGTVACC CAGCX3GTGCT GTCCACGGCG 4680 

TCCTTGGGCA GCTCACAGGG CAGCGTCATC CACTTCTCCC ATGGGGAGTA TTTCTATAGC 4740 

TTGTTCTCAG AAACGATCAA CACGGAATTA TTGAAAAATC TGGATCTTGC TGTATTGGAG 4800 

CTCATGCAGT CTTCAGTGGA TAATACCAAA ATGGTGAGTG CCGTTTTGAA CGGCATGTTA 4860 

GACCAGAGCT TCAGGGAGCG AGCAAACCAG AAACACCAAG GACTGAAACT TGCGACTACA 4920 

ATTCTGCAAC ACTGGAAGAA GTGTGATTCA TGGTGGGCCA AAGATTCCCC TCTCGAAACT 4980 

AAAATGGCAG TGCTGGCCTT ACTGGCAAAA ATTTTACAGA TTGATTCATC TGTATCTTTT 5040 

AATACAAGTC ATGGTTCATT CCCTGJ\AGTC TTTACAACAT ATATTAGTCT ACTTGCTGAC 5100 

ACAAAGCTGG ATCTACATTT 7VAAGGGCCAA GCTGTCACTC TTCTTCCATT CTTCACCAGC 5160 

CTCACTGGAG GCAGTCTGGA GGAACTTAGA CGTGTTCTGG AGCAGCTCAT CGTTGCTCAC 5220 

TTCCCCATGC AGTCCAGGGA ATTTCCTCCA GGAACTCCGC GGTTCAATAA TTATGTGGAC 52 80 

TGCATGAAAA AGTTTCTAGA TGCATTGGAA TTATCTCAAA GCCCTATGTT GTTGGAATTG 5340 

ATGACAGAAG TTCTTTGTCG GGAACAGCAG CATGTCATGG AAGAATTATT TCAATCCAGT 5400 

TTCAGGAGGA TTGCCAGAAG GGGTTCATGT GTCACACAAG TAGGCCTTCT GGAAAGCGTG 5460 

TATGAAATGT TCAGGAAGGA TGACCCCCGC CTAAGTTTCA CACGCCAGTC CTTTGTGGAC 552 0 

CGCTCCCTCC TCACTCTGCT GTGGCACTGT AGCCTGGATG CTTTGAGAGA ATTCTTCAGC 5580 

ACAATTGTGG TGGATGCCAT TGATGTGTTG AAGTCCAGGT TTACAAAGCT A7\ATGAATCT 5640 

ACCTTTGATA CTCAAATCAC CAAGAAGATG GGCTACTATA AGATTCTAGA CGTGATGTAT 5700 

TCTCGCCTTC CCAAAGATGA TGTTCATGCT AAGGAATCAA AAATTAATCA AGTTTTCCAT 5760 

GGCTCGTGTA TTACAGAAGG AAATGAACTT ACAAAGACAT TGATTAAATT GTGCTACGAT 5820 

GCATTTACAG AGAACATGGC AGGAGAGAAT CAGCTGCTGG AGAGGAGAAG ACTTTACCAT 5880 

TGTGCAGCAT ACAACTGCGC CATATCTGTC ATCTGCTGTG TCTTCAATGA GTTAAAATTT 5940 

TACCAAGGTT TTCTGTTTAG TGAAAAACCA GAAAAGAACT TGCTTATTTT TGAAAATCTG 6000 

ATCGACCTGA AGCGCCGCTA TAATTTTCCT GTAGAAGTTG AGGTTCCTAT GGAAAGAAAG 6060 

AAAAAGTACA TTGAAATTAG GAAAGAAGCC AGAGAAGCAG CAAATGGGGA TTCAGATGGT 6120 

CCTTCCTATA TGTCTTCCCT GTCATATTTG GCAGACAGTA CCCTGAGTGA GGAAATGAGT 6180 

CAATTTGATT TCTCAACCGG AGTTCAGAGC TATTCATACA GCTCCCAAGA CCCTAGACCT 6240 

GCCACTGGTC GTTTTCGGAG ACGGGAGCAG CGGGACCCCA CGGTGCATGA TGATGTGCTG 6300 

GAGCTGGAGA TGGACGAGCT CAATCGGCAT GAGTGCATGG CGCCCCTGAC GGCCCTGGTC 6360 

AAGCACATGC ACAGAAGCCT GGGCCCGCCT CAAGGAGAAG AGGATTCAGT GCCAAGAGAT 6420 

CTTCCTTCTT GGATGAAATT CCTCCATGGC AAACTGGGAA ATCCAATAGT ACCATTT^AAT 6480 

ATCCGTCTCT TCTTAGCCAA GCTTGTTATT 7\ATACAGAAG AGGTCTTTCG CCCTTACGCG 6540 

AAGCACTGGC TTAGCCCCTT GCTGCAGCTG GCTGCTTCTG AAAACAATGG AGGAGAAGGA 6600 

ATTCACTACA TGGTGGTTGA GATAGTGGCC ACTATTCTTT CATGGACAGG CTTGGCCACT 6660 

CCAACAGGGG TCCCTAAAGA TGAAGTGTTA GCAAATCGAT TGCTTAATTT CCTAATGAAA 6720 

CATGTCTTTC ATCCAAAAAG AGCTGTGTTT AGACACAACC TTGAAATTAT AAAGACCCTT 6780 

GTCGAGTGCT GGAAGGATTG TTTATCCATC CCTTATAGGT TAATATTTGA T^AAGTTTTCC 6840 

GGTAAAGATC CTAATTCTAA AGACAACTCA GTA6GGATTC AATTGCTAGG CATCGTGATG 6900 

GCCAATGACC TGCCTCCCTA TGACCCACAG TGTGGCATCC AGAGTAGCGA ATACTTCCAG 6960 

GCTTTGGTGA ATAATATGTC CTTTGTAAGA TATAAAGAAG TGTATGCCGC TGCAGCAGAA 7020 

GTTCTAGGAC TTATACTTCG ATATGTTATG GAGAGAAAAA ACATACTGGA GGAGTCTCTG 7080 

TGTGAACTGG TTGCGAAACA ATTGAAGCAA CATCAGAATA CTATGGAGGA CAAGTTTATT 7140 

GTGTGCTTGA ACAAAGTGAC CAAGAGCTTC CCTCCTCTTG CAGACAGGTT CATGAATGCT 7200 

GTGTTCTTTC TGCTGCCAAA ATTTCATGGA GTGTTGAAAA CACTCTGTCT GGAGGTGGTA 7260 

CTTTGTCGTG TGGAGGGAAT GACAGAGCTG TACTTCCAGT TAAAGAGCAA GGACTTCGTT 7320 

CAAGTCATGA GACATAGAGA TGATGAAAGA CAAAAAGTAT GTTTGGACAT AATTTATAAG 7380 

ATGATGCCAA AGTTAAAACC AGTAGAACTC CGAGAACTTC TGAACCCCGT TGTGGAATTC 7440 

GTTTCCCATC CTTCTACAAC ATGTAGGGAA CAAATGTATA ATATTCTCAT GTGGATTCAT 7500 

GATAATTACA GAGATCCAGA AAGTGAGACA GATAAT6ACT CCCAGGAAAT ATTTAAGTTG 7560 

GCAAAAGATG TGCTGATTCA AGGATTGATC GATGAGAACC CTGGACTTCA ATTAATTATT 7620 

CGAAATTTCT GGAGCCATGA AACTAGGTTA CCTTCAAATA CCTTGGACCG GTTGCTGGCA 7680 

CTAAATTCCT TATATTCTCC TAAGATAGAA GTGCACTTTT TAAGTTTAGC AACAAATTTT 7740 

CTGCTCGAAA TGACCAGCAT GAGCCCAGAT TATCCAAACC CCATGTTCGA GCATCCTCTG 7800 

TCAGAATGCG AATTTCAGGA ATATACCATT GATTCTGATT GGCGTTTCCG AAGTACTGTT 7860 

CTCACTCCGA TGTTTGTGGA GACCCAGGCC TCCCAGGGCA CTCTCCAGAC CCGTACCCAG 7920 

GAAGGGTCCC TCTCAGCTCG CTGGCCAGTG GCAGGGCAGA TAAGGGCCAC CCAGCAGCAG 7 980 

CATGACTTCA CACTGACACA GACTGCAGAT GGAAGAAGCT CATTTGATTG GCTGACCGGG 804 0 

AGCAGCACTG ACCCGCTGGT CGACCACACC AGTCCCTCAT CTGACTCCTT GCTGTTTGCC 8100 

CACAAGAGGA GTGAAAGGTT ACAGAGAGCA CCCTTGAAGT CAGTGGGGCC TGATTTTGGG 8160 

AAAAAAAGGC TGGGCCTTCC AGGGGACGAG GTGGATAACA AAGTGAAAGG TGCGGCCGGC 8220 

CGGACGGACC TACTACGACT GCGCAGACGG TTTATGAGGG ACCAGGAGAA GCTCAGTTTG 8280 

ATGTATGCCA GAAAAGGCGT TGCTGAGCAA AAACGAGAGA AGGAAATCAA GAGTGAGTTA 8340 

A/yVATGAAGC AGGATGCCCA GGTCGTTCTG TACAGAAGCT ACCGGCACGG AGACCTTCCT 8400 

GACATTCAGA TCAAGCACAG CAGCCTCATC ACCCCGTTAC AGGCCGTGGC CCAGAGGGAC 8460 

CCAATAATTG CAAAACAGCT CTTTAGCAGC TTGTTTTCTG GAATTTTGAA AGAGATGGAT 8520 

AAATTTAAGA CACTGTCTGA AAAAAACAAC ATCACTCAAA AGTTGCTTCA AGACTTCAAT 8580 

CGTTTTCTTA ATACCACCTT CTCTTTCTTT CCACCCTTTG TCTCTTGTAT TCAGGACATT 8640 

AGCTGTCAGC ACGCAGCCCT GCTGA6CCTC GACCCAGCGG CTGTTAGCGC TGGTTGCCTG 8700 

GCCAGCCTAC AGCAGCCCGT GGGCATCCGC CTGCTAGAGG AGGCTCTGCT CCGCCTGCTG 8760 

CCTGCTGAGC TGCCTGCCAA GCGAGTCCGT GGGAAGGCCC GCCTCCCTCC TGATGTCCTC 882 0 

AGATGGGTGG AGCTTGCTAA GCTGTATAGA TCAATTGGAG AATACGACGT CCTCCGTGGG 88 80 

ATTTTTACCA GTGAGATAGG AACAAAGCAA ATCACTCAGA GTGCATTATT AGCAGAAGCC 8940 

AGAAGTGATT ATTCTGAAGC TGCTAA6CAG TATGATGAGG CTCTCAATAA ACAAGACTGG 9000 

GTAGATGGTG AGCCCACAGA AGCCGAGAAG GATTTTTGGG AACTTGCATC CCTTGACTGT 9060 
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TACAACCACC TTGCTGAGTG GAAATCACTT GAATACTGTT CTACAGCCAG TATAGACAGT 912 0 

GAGAACCCCC CAGACCTAAA TAAAATCTGG AGTGAACCAT TTTATCAG6A AACATATCTA 9180 

CCTTACATGA TCCGCAGCAA GCTGAAGCTG CTGCTCCAGG GAGAGGCTGA CCAGTCCCTG 9240 

CTGACATTTA TTGACAAAGC TATGCACGGG GAGCTCCAGA AGGCGATTCT AGAGCTTCAT 9300 
5 TACAGTCAAG AGCTGAGTCT GCTTTACCTC CTGCAAGATG ATGTTGACAG AGCCAAATAT 9360 

TACATTCAAA ATGGCATTCA GAGTTTTATG CAGAATTATT CTAGTATTGA TGTCCTCTTA 9420 

CACCAAAGTA GACTCACCAA ATTGCAGTCT GTACAGGCTT TAACAGAAAT TCAGGAGTTC 9480 

ATCAGCTTTA TAAGCAAACA AGGCAATTTA TCATCTCAAG TTCCCCTTAA GAGACTTCTG 9540 

AACACCTGGA CAAACAGATA TCCAGATGCT AAAATGGACC CAATGAACAT CTGGGATGAC 9600 
10 ATCATCACAA ATCGATGTTT CTTTCTCAGC AAAATAGAGG AGAAGCTTAC CCCTCTTCCA 9660 

GAAGATAATA GTATGAATGT GGATCAAGAT GGAGACCCCA GTGACAGGAT GGAAGTGCAA 972 0 

GAGCAGGAAG AAGATATCAG CTCCCTGATC AGGAGTTGCA AGTTTTCCAT GAAAATGAAG 9780 

ATGATAGACA GTGCCCGGAA GCAGAACAAT TTCTCACTTG CTATGAAACT ACTGAAGGAG 9840 

CTGCATAAAG AGTCAAAAAC CAGAGACGAT TGGCTGGTGA GCTGGGTGCA GAGCTACTGC 9900 
15 CGCCTGAGCC ACTGCCGGAG CCGGTCCCAG GGCTGCTCTG AGCAGGTGCT CACTGTGCTG 9960 

AAAACAGTCT CTTTGTTGGA TGAGAACAAC GTGTCAAGCT ACTTAAGCAA AAATATTCTG 10020 

GCTTTCCGTG ACCAGAACAT TCTCTTGGGT ACAACTTACA GGATCATAGC GAATGCTCTC 10080 

AGCAGTGAGC CAGCCTGCCT TGCTGAAATC GAGGAGGACA AGGCTAGAAG AATCTTAGAG 10140 

CTTTCTGGAT CCAGTTCAGA GGATTCAGAG AAGGTGATCG CGGGTCTGTA CCAGAGAGCA 10200 
20 TTCCAGCACC TCTCTGAGGC TGTGCAGGCG GCTGAGGAGG AGGCCCAGCC TCCCTCCTGG 10260 

AGCTGTGGGC CTGCAGCTGG GGTGATTGAT GCTTACATGA CGCTGGCAGA TTTCTGTGAC 10320 

CAACAGCTGC GCAAGGAGGA AGAGAATGCA TCAGTTATTG ATTCTGCAGA ACTGCAGGCG 10380 

TATCCAGCAC TTGTGGTGGA GAAAATGTTG AAAGCTTTAA AATTAAATTC CAATGAAGCC 10440 

AGATTGAAGT TTCCTAGATT ACTTCAGATT ATAGAACGGT ATCCAGAGGA GACTTTGAGC 1050 0 
25 CTCATGACAA AAGAGATCTC TTCCGTTCCC TGCTGGCAGT TCATCAGCTG GATCAGCCAC 10560 

ATGGTGGCCT TACTGGACAA AGACCAAGCC GTTGCTGTTC AGCACTCTGT GGAAGAAATC 10620 

ACTGATAACT ACCCGCAGGC TATTGTTTAT CCCTTCATCA TAAGCAGCGA AAGCTATTCC 10680 

TTCAAGGATA CTTCTACTGG TCATAAGAAT AAGGAGTTTG TGGCAAGGAT TAAAAGTAAG 10740 

TTGGATCAAG GAGGAGTGAT TCAAGATTTT ATTAATGCCT TAGATCAGCT CTCTAATCCT 10800 
30 GAACTGCTCT TTAAGGATTG GAGCAATGAT GTAAGAGCTG AACTAGCAAA AACCCCTGTA 10860 

AATAAAAAAA ACATTGAAAA AATGTATGAA AGAATGTATG CAGCCTTGGG TGACCCAAAG 10920 

GCTCCAGGCC TGGGGGCCTT TAGAAGGAAG TTTATTCAGA CTTTTGGAAA AGAATTTGAT 10980 

AAACATTTTG GGAAAGGAGG TTCTAAACTA CTGAGAATGA AGCTCAGTGA CTTCAACGAC 11040 

ATTACCAACA TGCTACTTTT AAAAATGAAC AAAGACTCAA AGCCCCCTGG GAATCTGAAA 11100 
35 GAATGTTCAC CCTGGATGAG CGACTTCAAA GTGGAGTTCC TGAGAAATGA GCTGGAGATT 11160 

CCCGGTCAGT ATGACGGTAG GGGAAAGCCA TTGCCAGAGT ACCACGTGCG AATCGCCGGG 11220 

TTTGATGAGC GGGTGACAGT CATGGCGTCT CTGCGAAGGC CCAAGCGCAT CATCATCCGT 11280 

GGCCATGACG AGAGGGAACA CCCTTTCCTG GTGAAGGGTG GCGAGGACCT GCX3GCAGGAC 11340 

CAGCGCGTGG AGCAGCTCTT CCAGGTCATG /ATGGGATCC TGGCCCAAGA CTCCGCCTGC 11400 
40 AGCCAGAGGG CCCTGCAGCT GAGGACCTAT AGCGTTGTGC CCATGACCTC CAGGTTAGGA 11460 

TTAATTGAGT GGCTTGAAAA TACTGTTACC TTGAAGGACC TTCTTTTGAA CACCATGTCC 11520 

CAAGAGGAGA AGGCGGCTTA CCTGAGTGAT CCCAGGGCAC CGCCGTGTGA ATATAAAGAT 11580 

TGGCTGACAA AAATGTGAGG AAAACATGAT GTTGGAGCTT ACATGCTAAT GTATAAGGGC 11640 

GCTAATCGTA CTGAAACAGT CACGTCTTTT AGAAAACGAG AAAGTAAAGT GCCTGCTGAT 11700 
45 CTCTTAAAGC GGGCCTTCGT GAGGATGA6T ACAAGCCCTG AGGCTTTCCT GGCGCTCCGC 11760 

TCCCACTTCG CCAGCTCTCA CGCTCTGATA TGCATCAGCC ACTGGATCCT CGGGATTGGA 11820 

GACAGACATC TGAACAACTT TATGGTGGCC ATGGAGACTG GCGGCGTGAT CGGGATCGAC 11880 

TTTGGGCATG CGTTTGGATC CGCTACACAG TTTCTGCCAG TCCCTGAGTT GATGCCTTTT 11940 

CGGCTAACTC GCCAGTTTAT CAATCTGATG TTACCAATGA AAGAAACGGG CCTTATGTAC 12000 
50 AGCATCATGG TACACGCACT CCGGGCCTTC CGCTCAGACC CTGGCCTGCT CACCAACACC 12060 

ATGGATGTGT TTGTCAAGGA GCCCTCCTTT GATTGGAAAA ATTTTGAACA GAAAATGCTG 12120 

AAAAAAGGAG GGTCATGGAT TCAAGAAATA AATGTTGCTG AAAAAT^TTG GTACCCCCGA 12180 

CAGAAAATAT GTTACGCTAA GAGAAAGTTA GCAGGTGCCA ATCCAGCAGT CATTACTTGT 12240 

GATGAGCTAC TCCTGGGTCA TGAGAAGGCC CCTGCCTTCA GAGACTATGT GGCTGTGGCA 12300 
55 CGAGGAAGCA AAGATCACAA CATTCGTGCC CAAGAACCAG AGAGTGGGCT TTCAGAAGAG 12360 

ACTCAAGTGA AGTGCCTGAT GGACCAGGCA ACAGACCCCA ACATCCTTGG CAGAACCTGG 12420 

GAAGGATGGG AGCCCTGGAT GTGAGGTCTG TGGGAGTCTG CAGATAGAAA GCATTACATT 12480 

GTTTAAAGAA TCTACTATAC TTTGGTTGGC AGCATTCCAT GAGCTGATTT TCCTGAAACA 12540 

CTAAAGAGAA ATGTCTTTTG TGCTACAGTT TCGTAGCATG AGTTTA7VATC AAGATTATGA 12 600 
60 TGAGTAAATG TGTATGGGTT AAATCAAAGA TAAGGTTATA GTAACATCAA AGATTAGGTG 12660 

AGGTTTATAG AAAGATAGAT ATCCAGGCTT ACCAAAGTAT TAAGTCAAGA ATATAATATG 1272 0 

TGATCAGCTT TCAAAGCATT TACAAGT6CT GCAAGTTAGT GAAACAGCTG TCTCCGTAAA 12780 

TGGAGGAAAT GTGGGGAAGC CTTGGAATGC CCTTCTGGTT CTGGCACATT GGAAAGCACA 12840 

CTCAGAAGGC TTCATCACCA AGATTTTGGG AGAGTAAAGC TAAGTATAGT TGATGTAACA 12 900 
65 TTGTAGAAGC AGCATAGGAA CAATAAGAAC AATAGGTAAA GCTATAATTA TGGCTTATAT 12960 

TTAGAAATGA CTGCATTTGA TATTTTAGGA TATTTTTCTA GGTTTTTTCC TTTCATTTTA 13020 

TTCTCTTCTA GTTTTGACAT TTTATGATAG ATTTGCTCTC TAGAAGGAAA CGTCTTTATT 13080 

TAGGAGGGCA AAAATTTTGG TCATAGCATT CACTTTTGCT ATTCCAATCT ACAACTGGAA 13140 

GATACATAAA AGTGCTTTGC ATTGAATTTG GGATAACTTC AAAAATCCCA TGGTTGTTGT 13200 
70 TAGGGATAGT ACTAAGCATT TCAGTTCCAG GAGAATAAAA GAAATTCCTA TTTGAAATGA 13260 

ATTCCTCATT TGGAGGAAAA AAAGCATGCA TTCTAGCACA ACAAGATGAA ATTATGGAAT 13320 

ACAAAAGTGG CTCCTTCCCA TGTGCAGTCC CTGTCCCCCC CCGCCAGTCC TCCACACCCA 133 80 

AACTGTTTCT GATTGGCTTT TAGCTTTTTG TTGTTTTTTT TTTTCCTTCT AACACTTGTA 13440 

TTTGGAGGCT CTTCTGTGAT TTTGAGAAGT ATACTCTTGA GTGTTTAATA AAGTTTTTTT 13500 
75 CCAAAAGTA 

Seq ID NO: 99 Protein sequence: 
Protein Accession #: NP_00883S.S 

80 1 11 21 31 41 51 

1 1 I I I I 

MAGSGAGVRC SLLRLQETL.S AADRCGAALA GHQLIRGLGQ ECVLSSSPAV IiALQTSLVFS 60 

RDFGLLVFVR KSLNSIEPRE CREEILKFLC IFLEKMGQKI APYSVEIKNT CTSVYTKDRA 120 

AKCKIPALDL LIKLLQTFRS SRLMDEFKIG ELFSKFYGEL ALKKKIPDTV LEKVYELLGL. 180 

85 LGEVHPSEMI NNAENLFRAF LGELKTQMTS AVREPKLPVL AGCLKGLSSL LCNFTKSMEE 240 

DPQTSREIFN FVLKAIRPQI DLKRYAVPSA GIiRLFALHAS QFSTCLLDNY VSLFEVLLKW 300 

CAHTNVELKK AALSALESFL KQVSNMVAKN AEMHKNKLQY FMEQFYGIIR NVDSNNKBLS 360 
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lAIRGYGLFA GPCKVINAKD VDFMYVBLIQ RCKQMPLTQT DTGDDRVYQM PSFLQSVASV 420 

LLYLDTVPEV YTPVLEHLW MQIDSFPQYS PKMQLVCCRA IVKVFLAIAA KGPVLRNCIS 480 

TWHQGLIRI CSKPWLPKG PESESBDHRA SGBVRTGKWK VPTYKDYVDIi FRHLLSSDQM 540 

MDSIIADEAF FSVNSSSESL NHLLYDEFVK SVLKIVEKLD LTLEIQTVGE QENGDEAPGV 600 

5 WMIPTSDPAA NLHPAKPKDF SAFINLVEFC REILPEKQAE FFEPWVYSFS YELILQSTRL 660 

PLISGFYKLL SITVRNAKKI KYFEGVSPKS LKHSPEDPEK YSCFALFVKF GKEVAVKMKQ 720 

YKDELLASCL TFLLSLPHNI lELDVRAYVP ALQMAPKLGL SYTPIABVGL NALEEWSIYI 780 

DRHVMQPYYK DILPCLDGYL KTSALSDETK NNWEVSALSR AAQKGFNKW LKHLKKTKNL 840 

SSNEAISLEE IRIRWQMLG SLGGQINKNL LTVTSSDEMM KSYVAWDREK RLSFAVPFRE 900 

10 MKPVIFLDVF LPRVTELALT ASDRQTKVAA CELLHSMVMF MLGKATQMPE GGQGAPPMYQ 960 

LYKRTFPVLL RhACDVDQVT RQI/YEPLVMQ LIHWFTNNKK FESQDTVALL EAILDGIVDP 1020 

VDSTLRDFCG RCIREFLKWS IKQITPQQQE KSPVNTKSLF KRLYSLALHP NAFKRLGASL 1080 

AFNNIYREFR EEESLVEQFV PEAIiVIYMES LALAHADEKS LGTIQQCCDA IDHLCRIIEK 1140 

KHVSLNKAKK RRLPRGFPPS ASLCLLDLVK WLLAHCGRPQ TECRHKSIEL FYKFVPLLPG 1200 

15 NRSPNLWLKD VLKEEGVSFL INTFEGGGCG QPSGILAQPT LLYLRGPFSL QATLCWLDLL 1260 

LAAIiBCYNTF IGERTVGALQ VLGTEAQSSL LKAVAFFLES lAMHDIIAAE KCFGTGAAGN 1320 

RTSPQEGERY NYSKCTWVR IMEFTTTLLN TSPEGWKLLK KDLCNTHLMR VLVQTLCEPA 1380 

SIGFNIGDVQ VMAHLPDVCV NLMKALKMSP YKDILETHLR EKITAQSIEE LCAVNLYGPD 1440 

AQVDRSRLAA WSACKQLHR AGLLHNILPS QSTDLHHSVG TELLSLVYKG lAPGDERQCti 1500 

20 PSLDLSCKQL ASGLLELAFA FGGLCERIjVS LLLNPAVLST ASLGSSQGSV IHFSHGEYFY 1560 

SLFSETINTE LLKNLDLAVL ELMQSSVDNT KMVSAVLNGM LDQSFRERAN QKHQGLKIAT 1620 

TILQHWKKCD SWWAKDSPLE TKMAVLALLA KILQIDSSVS FNTSHGSFPE VFTTYISLLA 1680 

DTKLDLHLKG QAVTLLPFFT SLTGGSLEEL RRVLEQLIVA HFPMQSREFP PGTPRFNNYV 1740 

DCMKKFLDAL ELSQSPMIiLE LMTEVLCREQ QHVMEELFQS SFRRIARRGS CVTQVGLLES 1800 

25 VYEMFRKDDP RLSFTRQSFV DRSLLTLLWH CSLDALREFF STIWDAIDV LKSRFTKLNE 1860 

STFDTQITKK MGYYKILDVM YSRLPKDDVH AKESKINQVF HGSCITEGNB LTKTLIKLCY 1920 

DAFTENMAGE NQIjLERRRLY HCAAYNCAIS VICCVFNELK FYQGFLFSEK PEKNLLIFEN 1980 

LIDLICRRYNF PVEVEVPMER KKKYIEIRKE AREAANGDSD GPSYMSSDSY LADSTLSEEM 2040 

SQFDFSTGVQ SYSYSSQDPR PATGRFRRRE QRDPTVHDDV LELEMDELNR HECMAPLTAL 2100 

30 VKHMHRSLGP PQGEEDSVPR DLPSWMKFLH GKLGNPIVPL NIRLFLAKLV INTEEVFRPY 2160 

AKHWLSPLLQ LAASENNGGE GIHYMWEIV ATILSWTGLA TPTGVPKDEV LANRLLNFLM 2220 

KHVFHPKRAV FRHNLEIIKT LVECWKDCLS IPYRLIFEKF SGKDPNSKDN SVGIQLLGIV 2280 

MANDIiPPYDP QCGIQSSEYF QALVNNMSFV RYKEVYAAAA EVLGLILRYV MERKNILEES 2340 

LCEIiVAKQLK QHQNTMEDKF IVCLNKVTKS FPPLADRFMN AVFFIiLPKFH GVLKTLCLEV 2400 

35 VLCRVEGMTE LYFQLKSKDF VQVMRHRDDE RQKVCLDIIY KMMPKLKPVE LRELLNPWE 2460 

FVSHPSTTCR EQMYNILMWI HDNYRDPESE TDNDSQEIFK LAKDVLIQGL IDENPGLQLI 2520 

IRNFWSHETR LPSNTLDRLD ALKSLYSPKI BVHFLSLATN FLLEMTSMSP DYPNPMFEHP 2580 

LSECEFQEYT IDSDWRFRST VLTPMFVETQ ASQGTLQTRT QEGSLSARWP VAGQIRATQQ 2640 

QHDFTLTQTA DGRSSFDWLT GSSTDPLVDH TSPSSDSLLF AHKRSERLQR APLKSVGPDF 2700 

40 GKKRLGLPGD EVDNKVKGAA GRTDLLRLRR RFMRDQEKLS LMYARKGVAE QKREKEIKSE 2760 

LKMKQDAQW LYRSYRHGDL PDIQIKHSSL ITPLQAVAQR DPIIAKQLFS SLFSGILKEM 2 820 

DKFKTLSEKN NITQKLLQDF NRFLNTTFSF FPPFVSCIQD ISCQHAALLS LDPAAVSAGC 2880 

IjASLQQPVGI RLLEEALLRL LPAELPAKRV RGKARLPPDV LRWVELAKLY RSIGEYDVLR 2 940 

GIFTSEIGTK QITQSALLAE ARSDYSEAAK QYDEALNKQD WVDGEPTEAE KDFWELASLD 3000 

45 CYNHIiAEWKS LBYCSTASID SENPPDIiNKI WSBPPYQETY LPYMIRSKLK LLLQGEADQS 3060 

LLTFIDKAMH GELQKAILEL HYSQELSLLY LLQDDVDRAK YYIQNGIQSP MQNYSSIDVL 3120 

LHQSRLTKLQ SVQALTEIQE FISFISKQGN LSSQVPLKRL LNTWTNRYPD AKMDPMNIWD 3180 

DIITNRCFFIi SKIEEKLTPL PEDNSMNVDQ DGDPSDRMEV QEQEEDISSL IRSCKFSMKM 3240 

KMIDSARKQN NFSLAMKLLK EtiHKESKTRD DWLVSWVQSY CRLSHCRSRS QGCSEQVLTV 33 00 

50 LKTVSLLDEN NVSSYLSKNI BAFRDQNILL 6TTYRIIANA LSSEPACLAE lEBDKARRIL 3360 

ELSGSSSEDS EKVIAGLYQR AFQHLSEAVQ T^EEEAQPPS WSCGPAAGVI DAYMTLADFC 3420 

DQQLRKEEEN ASVIDSAELQ AYPALWEKM LKAliKLNSNE ARLKFPRLLQ IIERYPEETL 3 480 

SLMTKEISSV PCWQFISWIS HMVALLDKDQ AVAVQHSVEE ITDNYPQAIV YPFIISSESY 3540 

SFKDTSTGHK NKEFVARIKS KLDQGGVIQD FINALDQLSN PELLFKDWSN DVRAELAKTP 3600 

55 VNKKNIEKMY ERMYAALGDP KAPGLGAFRR KFIQTFGKEF DKHFGKGGSK LLRMKLSDEN 3660 

DITNMLLLKM NKDSKPPGNL KECSPWMSDF KVEFIiRNELE IPGQYDGRGK PLPEYHVRIA 3720 

GFDBRVTVMA SIiRRPKRIII RGHDEREHPF LVRGGEDLRQ DQRVEQLFQV MNGILAQDSA 3780 

CSQRALQLRT YSWPMTSRL GLIEWLENTV TLKDLLLNTM SQEEKAAYLS DPRAPPCEYK 3840 

DWLTKMSGKH DVGAYMLMYK GANRTETVTS FRKRESKVPA DLLKRAFVRM STSPEAFLAL 3 900 

60 RSHFASSHAL ICISHWILGI GDRHLNNFMV AMETGGVIGI DFGHAFGSAT QFLPVPELMP 3 960 

FRLTRQFINL MIiPMKETGIiM YSIMVHAIiRA FRSDPGIiIiTN TMDVFVKEPS FDWKNFEQKM 4 020 

LKKGGSWIQE INVAEKNWYP RQKICYAKRK LAGANPAVIT CDELLLGHBK APAFRDYVAV 4080 
ARGSKDHNIR AQEPESGLSE ETQVKCLMDQ ATDPNILGRT WBGWEPWM 

65 Seq ID NO: 100 DNA sequence 

Nucleic Acid Accession #: NM_000673 
Coding sequence: 101-1225 
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70 



75 



80 



85 



ATGTGAAGGC 
GTCTATGTTT 
TATTAAATGC 
AGAAGTTGCC 
TCGCACAGAT 
ACATGAGGCA 
TGACAAAGTC 
AGATGGCAAC 
CACCAGATTT 
CGAGTACACA 
GAAAGTCTGT 
CAAGGTCAAA 
CATCATGGGC 
CAAATTTGAG 
CAAACCCATC 
AGTTATTGGG 
GACCAGCGTG 



ACAAGCTGCT 
GCAGAAATAC 
AAAGCAGCTG 
CCACCAAAGA 
GACCATGTGA 
ACTGGGATTG 
ATCCCTCTCT 
CTTTGCATTA 
ACATGCAAGG 
GTGGTGGATG 
TTAATTGGCT 
CCTGGTTCCA 
TGTAAGTCAG 
AAGGCCATGG 
AGTGAGGTGC 
CATCTTGAAA 
GTTGTAGGAG 



21 

1 

GTTATATACA 
AGATCCAAGA 
TGCTTTGGGA 
CTAAAGAA6T 
TAAAAGGAAC 
TAGAGAGCAT 
TTCTGCCACA 
GGAGCGATAT 
GCAAACCAGT 
AATCTTCTGT 
GTGGGTTTTC 
CTTGCGTCGT 
CTGGTGCATC 
CTGTAGGTGC 
TGTCAGAAAT 
CCATGATTGA 
TTCCTCCATC 



31 
I 

ACAGAGTGAA 
CAAAGACAGG 
GCAGAAGCAA 
TCGCATTAAG 
AATGGTGTCC 
TGGAGAAGGA 
ATGTAGAGAA 
TACTGGTCGT 
ACACCACTTC 
TGCTAAGATT 
CACTGGATAT 
CTTTGGCCTG 
TAGGATCATT 
CACTGAGTGT 
GACAGGCAAC 
TGCCCTGGCA 
A6CCAAGATG 



41 
1 

CTGAGCATCA 
ATGGGCACTG 
CCCTTCTCCA 
ATTTTGGCCA 
AAGTTTCCAG 
GTGACTACAG 
TGCAATGCTT 
GGAGTACTGG 
ATGTU^CACCA 
GATGATGCAG 
GGCGCTGCTG 
GGAGGAGTTG 
GGGATTGACC 
ATCAGTCCCA 
AACGTGGGAT 
TCCTGCCACA 
CTCACCTATG 



51 

1 

GTCAGAAAAA 
CTGGAAAAGT 
TTGAGGAAAT 
CAGGAATCTG 
TGATTGTGGG 
TGAAACCAGG 
GTCGCAACCC 
CTGATGGCAC 
GTACATTTAC 
CTCCTCCTGA 
TTAAAACTGG 
GCCTGTCAGT 
TCAACAAAGA 
AGGACTCTAC 
ACACCTTTGA 
TGAACTATGG 
ACCCGATGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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GCTCTTCACT GGACGCACAT GGAAGGGATG TGTCTTTGGA GGTTTGAAAA GCAGAGATGA 1080 

TGTCCCAAAA CTAGTGACTG AGTTCCTGGC AAAGAAATTT GACCTGGACC AGTTGATAAC 1140 

TCATGTTTTA CCATTTAAAA AAATCAGTGA AGGATTTGAG CTGCTCAATT CAGGACT^AAG 1200 

CATTCGAACG GTCCTGACGT TTTGAGATCC AAAGTGGCAG GAGGTCTGTG TTGTCATGGT 1260 

GAACTGGAGT TTCTCTTGTG AGAGTTCCCT CATCTGAAAT CATGTATCTG TCTCACAAAT 1320 

ACAAGCATAA 6TAGAAGATT TGTTGAAGAC ATAGAACCCT TATAAAGAAT TATTAACCTT 1380 

TATAAACATT TAAAGTCTTG TGAGCACCTG GGAATTAGTA TAATAACAAT GTTAATATTT 1440 

TTGATTTACA TTTTGTAAGG CTATAATTGT ATCTTTTAAG AAAACATACA CTTGGATTTC 1500 

TATGTTGAAA TGGAGATTTT TAAGAGTTTT AACCAGCTGC TGCAGATATA TAACTCAAAA 1560 

CAGATATAGC GTATAAAGAT ATAGTAAATG CATCTCCCAG AGTAATATTC ACTTAACACA 1620 

TTGAAACTAT TATTTTTTAG ATTTGAATAT AAATGTATTT TTTAAACACT TGTTATGAGT 1680 

TAACTTGGAT TACATTTTGA AATCAGTTCA TTCCATGATG CATATTACTG GATTAGATTA 1740 

AGAAAGACAG AAAAGATTAA GGGACGGGCA CATTTTTCAA CGATTAAGAA TCATCATTAC 1800 

ATAACTTGGT GAAACTGAAA AAGTATATCA TATGGGTACA CAAGGCTATT TGCCAGCATA 1860 

TATTAATATT TTAGAAAATA TTCCTTTTGT AATACTGAAT ATAAACATAG AGCTAGAGTC 1920 

ATATTATCAT ACTTATCATA ATGTTCAATT TGATACAGTA GAATTGCAAG TCCCTAAGTC 1980 

CCTATTCACT GTGCTTAGTA GTGACTCCAT TTAATAAAAA GTGTTTTTAG TTTTTAACAA 2040 
CTAAACCG 

Seq ID NO: 101 Protein sequence: 
Protein Accession #: NP_000664 

1 11 21 31 41 51 

I I I 1 ^ ' 

MGTAGKVIKC KAAVLWEQKQ PFSIEEIEVA PPKTKEVRIK ILATGICRTD DHVIKGTMVS 60 

KFPVIVGHEA TQIVESIGBG VTTVKPGDKV IPLFLPQCRE CNACRNPDGN LCIRSDITGR 120 

GVLADGTTRF TCKGKPVHHF MNTSTFTEYT WDESSVAKI DDAAPPEKVC LIGCGFSTGY 180 

GAAVKTGKVK PGSTCWFGL GGVGL.SVIMG CKSAGASRII GIDLNKDKFE KAMAVGATEC 240 

ISPKDSTKPI SEVLSEMTGN NVGYTFEVIG HLETMIDALA SCHMNYGTSV WGVPPSAKM 3 00 

LTYDPMLLFT GRTWKGCVFG GLKSRDDVPK LVTEFLAKKF DLDQLITHVL PFKKISEGFE 360 
LLNSGQSIRT VLTF 

Seq ID NO: 102 DNA sequence 

Nucleic Acid Accession H: NM_006783.1 

Coding sequence : 1 . . 786 

1 11 21 31 41 51 

1 t I I I t 

ATGGATTGGG GGACGCTGCA CACTTTCATC GGGGGTGTCA ACAAACACTC CACCAGCATC 60 

GGGAAGGTGT GGATCACAGT CATCTTTATT TTCCGAGTCA TGATCCTAGT GGTGGCTGCC 120 

CAGGAAGTGT GGGGTGACGA GCAAGAGGAC TTCGTCTGCA ACACACTGCA ACCGGGATGC 180 

AAAAATGTGT GCTATGACCA CTTTTTCCCG GTGTCCCACA TCCGGCTGTG GGCCCTCCAG 240 

CTGATCTTCG TCTCCACCCC AGCGCTGCTG GTGGCCATGC ATGTGGCCTA CTACAGGCAC 300 

GAAACCACTC GCAAGTTCAG GCGAGGAGAG AAGAGGAATG ATTTCAAAGA CATAGAGGAC 360 

ATTAAAAAGC ACAAGGTTCG GATAGAGGGG TCGCTGTGGT GGACGTACAC CAGCAGCATC 420 

TTTTTCCGAA TCATCTTTGA AGCAGCCTTT ATGTATGTGT TTTACTTCCT TTACAATGGG 480 

TACCACCTGC CCTGGGTGTT GAAATGTGGG ATTGACCCCT GCCCCAACCT TGTTGACTGC 540 

TTTATTTCTA GGCCAACAGA GAAGACCGTG TTTACCATTT TTATGATTTC TGCGTCTGTG 600 

ATTTGCATGC TGCTTAACGT GGCAGAGTTG TGCTACCTGC TGCTGAAAGT GTGTTTTAGG 660 

AGATCAAAGA GAGCACAGAC GCAAAAAAAT CACCCCAATC ATGCCCTAAA GGAGAGTAAG 720 

CAG7\ATGAAA TGAATGAGCT GATTTCAGAT AGTGGTCAAA ATGCAATCAC AGGTTTCCCA 780 
AGCTAA 

Seq ID NO: 103 Protein sequence: 
Protein Accession #: NP_006774.1 

1 11 21 31 41 51 

I I I I ' ^ 

MDWGTLHTFI GGVNKHSTSI GKVWITVIFI FRVMILWAA QEVWGDEQED FVCNTLQPGC 60 

KNVCYDHFFP VSHIRLWALQ LIFVSTPALL VAMHVAYYRH ETTRKFRRGE KRNDFKDIED 120 

IKKHKVRIEG SLWWTYTSSI FFRIIFEAAF MYVFYFLYNG YHLPWVLKCG IDPCPNLVDC 180 

FISRPTEKTV FTIFMISASV ICMLLNVAEL CYLLLKVCFR RSKRAQTQKN HPNHALKESK 240 
QNEMNELISD SGQNAITGFP S 

Seq ID NO: 104 DNA sequence 
Nucleic Acid Accession #: NM_020411 
Coding seq[uence: 86-526 

1 11 21 31 41 51 

I I I I I I 

GGACCTGGGA AGGAGCATAG GACAGGGCAA GGCGGGATAA GGAGGGGCAC CACAGCCCTT 60 

AAGGCACGAG GGAACCTCAC TGCGCATGCT CCTTTGGTGC CCACCTCAGT GCGCATGTTC 120 

ACTGGGCGTC TTCCCATCGG CCCCTTCGCC AGTGTGGGGA ACGCGGCGGA GCTGTGAGCC 180 

GGCGACTCGG GTCCCTGAGG TCTGGATTCT TTCTCCGCTA CTGAGACACG GCGGACACAC 240 

ACAAACACAG AACCACACAG CCAGTCCCAG GAGCCCAGTA ATGGAGAGCC CCAAAAAGAA 300 

GAACCAGCAG CTGAAAGTCG GGATCCTACA CCTGGGCAGC AGACAGAAGA AGATCAGGAT 360 

ACAGCTGAGA TCCCAGTGCG CGACATGGAA GGTGATCTGC AAGAGCTGCA TCAGTCAAAC 420 

ACCGGGGATA AATCTGGATT TGGGTTCCGG CGTCAAGGTG AAGATAATAC CTAAAGAGGA 480 

ACACTGTAAA ATGCCAGAAG CAGGTGAAGA GCAACCACAA GTTTAAATGA AGACAAGCTG 540 

AAACAACGCA AGCTGGTTTT ATATTAGATA TTTGACTTAA ACTATCTCAA TAAAGTTTTG 600 
CAGCTTTCAC CAAAAAAAAA AAAAAA 



Seq ID NO: 105 Protein sequence: 
Protein Accession #: NP_065144.1 

1 11 21 31 41 51 
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I I I I I I 

MLLWCPPQCA CSLGVFPSAP SPVWGTRRSC EPATRVPEVW ILSPLLRHGG HTQTQNHTAS 
PRSPVMESPK KKNQQLKVGI LHLGSRQKKI RIQLRSQCAT WKVICKSCIS QTPGINLDLG 

SGVKVKIIPK EEHCKMPEAG EEQPQV 

Seq ID NO: 106 DNA sequence 
Nucleic Acid Accession #: J04129 
Coding sequence: 99-587 

1 11 21 31 41 51 

I I 1 1 1 1 

CATCCCTCTG GCTCCAGAGC TCAGAGCCAC CCACAGCCGC AGCCATGCTG TGCCTCCTGC 
TCACCCTGGG CGTGGCCCTG GTCTGTGGTG TCCCGGCCAT GGACATCCCC CAGACCAAGC 
AGGACCTGGA GCTCCCAAAG TTGGCAGGGA CCTGGCACTC CATGGCCATG GCGACCAACA 
ACATCTCCCT CATGGCGACA CTGAAGGCCC CTCTGAGGGT CCACATCACC TCACTGTTGC 
CCACCCCCGA GGACAACCTG GAGATCGTTC TGCACAGATG GGAGAACAAC AGCTGTGTTG 
AGAAGAAGGT CCTTGGAGAG AAGACTGGGA ATCCAAAGAA GTTCAAGATC AACTATACGG 
TGGCGAACGA GGCCACGCTG CTCGATACTG ACTACGACAA TTTCCTGTTT CTCTGCCTAC 
AGGACACCAC CACCCCCATC CAGAGCATGA TGTGCCAGTA CCTGGCCAGA GTCCTGGTGG 
AGGACGATGA GATCATGCAG GGATTCATCA GGGCTTTCAG GCCCCTGCCC AGGCACCTAT 
GGTACTTGCT GGACTTGAAA CAGATGGAAG AGCCGTGCCG TTTCTAGCTC ACCTCCGCCT 
CCAGGAAGAC CAGACTCCCA CCCTTCCACA GCTCCAGAGC AGTGGGACTT CCTCCTGCCC 
TTTCAAAGAA TAACCACAGC TCAGAAGACG ATGACGTGGT CATCT6TGTC GCCATCCCCT 
TCCTGCTGCA CACCTGCACC ATTGCCATGG GGAGGCTGCT CCCTGGGGGC AGAGTCTCTG 
GCAGAGGTTA TTAATAAACC CTTGGAGCAT G 



Seq ID NO: 107 Protein sequence: 
Protein Accession #: AAA60147 

1 11 21 31 41 51 

I I I I I I 

MDIPQTKQDIi ELPKLAGTWH SMAMATNNIS LMATLKAPLR VHITSLLPTP EDNLEIVLHR 
WENNSCVEKK VLGEKTGNPK KFKINYTVAN EATLLDTDYD NFLPLCLQDT TTPIQSMMCQ 
YLARVLVEDD EIMQGFIRAF RPLPRHLWYL LDLKQMEEPC RF 

Seq ID NO: 108 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 48-794 

1 11 21 31 41 51 

I ] I I I I 

TCCCAGGCAG CAGTTAGCCG GCCGCCCGCC TGTGTGTCCC CAGAGCCATG GAGAGAGCCA 
GTCTGATCCA GAAGGCCAAG CTGGCAGAGC AGGCCGAACG CTATGAGGAC ATGGCAGCCT 
TCATGAAAGG CGCCGTGGAG AAGGGCGAGG AGCTCTCCTG CGAAGAGCGA AACCTGCTCT 
CAGTAGCCTA TAAGAACGTG GTGGGCGGCC AGAGGGCTGC CTGGAGGGTG CTGTCCAGTA 
TTGAGCAGAA AAGCAACGAG GAGGGCTCGG AGGAGAAGGG GCCCGAGGTG CGTGAGTACC 
GGGAGAAGGT GGAGACTGAG CTCCAGGGCG TGTGCGACAC CGTGCTGGGC CTGCTGGACA 
GCCACCTCAT CAAGGAGGCC GGGGACGCCG AGAGCCGGGT CTTCTACCTG AAGATGAAGG 
GTGACTACTA CCGCTACCTG GCCGAGGTGG CCACCGGTGA CGACAAGAAG CGCATCATTG 
ACTCAGCCCG GTCAGCCTAC CAGGAGGCCA TGGACATCAG CAAGAAGGAG ATGCCGCCCA 
CCAACCCCAT CCGCCTGGGC CTGGCCCTGA ACTTTTCCGT CTTCCACTAC GAGATCGCCA 
ACAGCCCCGA GGAGGCCATC TCTCTGGCCA AGACCACTTT CGACGAGGCC ATGGCTGATC 
TGCACACCCT CAGCGAGGAC TCCTACAAAG ACAGCACCCT CATCATGCAG CTGCTGCGAG 
ACAACCTGAC ACTGTGGACG GCCGACAACG CCGGGGAAGA GGGGGGCGAG GCTCCCCAGG 
AGCCCCAGAG CTGAGTGTTG CCCGCCACCG CCCCGCCCTG CCCCCTCCAG TCCCCCACCC 
TGCCGAGAGG ACTAGTATGG GGTGGGAGGC CCCACCCTTC TCCCCTAGGC GCTGTTCTTG 
CTCCAAAGGG CTCCGTGGAG AGGGACTGGC AGAGCTGAGG CCACCTGGGG CTGGGGATCC 
CACTCTTCTT GCAGCTGTTG AGCGCACCTA ACCACTGGTC ATGCCCCCAC CCCTGCTCTC 
CGCACCCGCT TCCTCCCGAC CCCAGGACCA GGCTACTTCT CCCCTCCTCT TGCCTCCCTC 
CTGCCCCTGC TGCCTCTGAT CGTAGGAATT GAGGAGTGTC CCGCCTTGTG GCTGAGAACT 
GGACAGTGGC AGGGGCTGGA GATGGGTGTG TGTGTGTGT6 TGTGTGTGTG TGTGTGTGTG 
CGCGCGCGCC AGTGCAAGAC CGAGATTGAG GGAAA6CATG TCTGCTGGGT GTGACCATGT 
TTCCTCTCAA TAAAGTTCCC CTGTGACACT C 

Seq ID NO: 109 Protein sequence: 
Protein Accession #: NP_006133.1 

1 11 21 31 41 51 

I 1 1 I I 1 

MERASLIQKA KLAEQAERYE DMAAFMKGAV EKGEELSCEE RNLLSVAYKN WGGQRAAWR 
VLSSIEQKSN EEGSEEKGPE VREYREKVET ELQGVCDTVL GLLDSHLIKE AGDAESRVFY 
LKMKGDYYRY LAEVATGDDK KRIIDSARSA YQEAMDISKK EMPPTNPIRL GLAIiNFSVFH 
YEIANSPEEA ISLAKTTFDE AMADLHTLSE DSYKDSTIilM QLLRDNLTLW TADNAGEEGG 
EAPQEPQS 

Seq ID NO: 110 DNA sequence 
Nucleic Acid Accession NM_000695 
Coding sequence: 407-1564 

1 11 21 31 41 51 

1 I 1 I I I 

CACGAGTTGG TTTGGGAGCT GCCAGTCTCC TGGGAGGATC GCAGTCAGCA GAGCAGGGCT 
GAGGCCTGGG GGTAGGAGCA GAGCCTGCGC ATCTGGAGGC AGCATGTCCA AGAAAGGGAG 
TGGAGGTGCA GCGAAGGACC CAGGGGCAGA GGCCACGCTG GGGATGGACC CCTTCGAGGA 
CACACTGCGG CGGCTGCGTG AGGCCTTCAA CTGAGGGCGC ACGCGGCCGG CCGAGTTCCG 
GGCTGCGCAG CTCCAGGGCG TGGGCCACTT CCTTCAAGAA AACAAGCAGC TTCTGCGCGA 
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CGTGCTGGCC CAGGACCTGC ATAAGCCAGC TTTCGAGGCA 6ACATATCTG AGCTCATCCT 360 

TTGCCAGAAC GAGGTTGACT ACGCTCTCAA GAACCTTCAG GCCTGGATGA AGGAT6AACC 420 

ACGGTCCACG AACCTGTTCA TGAAGCTGGA CTCGGTCTTC ATCTGG7VAGG AACCCTTTGG 480 

CCTGGTCCTC ATCATCGCAC CCTGGAACTA CCCATTGAAC CTGACCCTGG TGCTCCTGGT 540 

GGGCACCCTC CCCGCAGGGA ATTGCGTGGT GCTGAAGCCG TCAGAAATCA GCCAGGGCAC 600 

AGAGAAGGTC CTGGCTGAGG TGCTGCCCCA GTACCTGGAC CAGAGCTGCT TTGCCGTGGT 660 

GCTGGGCGGA CCCCAGGAGA CAGGGCAGCT GCTAGAGCAC AAGTTGGACT ACATCTTCTT 720 

CACAGGGAGC CCTCGTGTGG GCAAGATTGT CATGACTGCT GCCACCAAGC ACCTGACGCC 780 

TGTCACCCTG GAGCTGGGGG GCAAGAACCC CTGCTACGTG GACGACAACT 6CX5ACCCCCA 840 

GACCX3TGGCC AACCGCGTGG CCTGGTTCTG CTACTTCAAT GCCGGCCAGA CCTGC6TGGC 900 

CCCTGACTAC GTCCTGTGCA GCCCCGAGAT GCAGGAGAGG CTGCTGCCCG CCCTGCAGAG 960 

CACCATCACC CGTTTCTATG GCGACGACCC CCAGAGCTCC CCAAACCTGG GCCGCATCAT 1020 

CAACCAGAAA CAGTTCCAGC GGCTGCGGGC ATTGCTGGGC TGCGGCCGCG TGGCCATTGG 1080 

GGGCCAGAGC AACGAGAGCG ATCGCTACAT C6CCCCCACG GTGCTGGTGG ACGTGCAGGA 1140 

GACGGAGCCT GTGATGCAGG AGGAGATCTT CGGGCCCATC CTGCCCATCG TGAACGTGCA 1200 

GAGCGTGGAC GAGGCCATCA AGTTCATCTUl CCGGCAGGAG AAGCCCCTGG CCCTGTACGC 12 60 

CTTCTCCAAC AGCAGACAGG TTGTGAACCA GATGCTGGAG CGGACCAGCA GCGGCAGCTT 1320 

TGGAGGCAAT GAGGGCTTCA CCTACATATC TCTGCTGTCC GTGCCATTCG GGGGAGTCGG 1380 

CCACAGTGGG ATGGGCCGGT ACCACGGCAA GTTCACCTTC GACACCTTCT CCCACCACCG 1440 

CACCTGCCTG CTCGCCCCCT CCGGCCTGGA GAAATTAAAG GAGATCCGCT ACCCACCCTA 1500 

TACCGACTGG AACCAGCAGC TGTTACGCTG GGGCATGGGC TCCCAGAGCT GCACCCTCCT 1560 

GTGAGCGTCC CACCCGCCTC CAACGGGTCA CACAGAGAAA CCTGAGTCTA GCCATGAGGG 162 0 

GCTTATGCTC CCAACTCACA TTGTTCCTCC AGACCGCAGG CTCCCCCAGC CTCAGGTTGC 1680 

TGGAGCTGTC ACATGACTGC ATCCTGCCTG CCAGGGCTGC AAAGCAAGGT CTTGCTTCTA 1740 

TCTGGGGGAC GCTGCTCGAG AGAGGCGGAG AGGCCGCAGA ACATGCCAGG TGTCCTCACT 1800 

CACCCCACCC TCCCCAATTC CAGCCCTTTG CCCTCTCGGT CAGGGTTGGC CAGGCCCAGT 1860 

CACAGGGGCA GTGTCACCCT GGAAAATACA GTGCCCTGCC TTCTTAGGGG CATCAGCCCT 1920 

GAACGGTTGA GAGCGTGGAG CCCTCCAGGC CTTTGCTCTC CCCTCTAGGC ACACGCGCAC 1980 

TTCCACCTCT GCCCCATCCC AACTGCACCA GCACTGCCTC CCCCAGGGAT CCTCTCACAT 2 040 

CCCACACTGG TCTCTGCACC ACCCCTCTGG TTCACACCGC ACCCTGCACT CACCCACAGC 2100 

AGCTCCATCC ACTGGGAAAA CTGGGGTTTG CATCACTCCA CTGCACAGTG TTAGTGGGAC 2160 

CTGGGGGCAA GTCCCTTGAC TTCTCTGAGC CTCAGTTTCC TTATGTGT^AA GTTGCTGGAA 2220 

CCAAAATGGA GTCACTTATG CCAAACTCTA ATAAAATGGA GTCGGGGGGG CACATAGAAG 2280 

CCCTCACACA CACATGCCCG TAACAGGATT TATCACCAAG ACACGCCTGC ATGTAAGACC 2 340 

AGACACAGGG CGTATGGAAA AGCACGTCCT CAAAGACTGT AGTATTCCAG ATGAGCTGCA 2400 

GATGCTTACC TACCACGGCC GTCTCCACCA GAAAACCATC GCCAACTCCT GCGATCAGCT 2450 

TGTGACTTAC AAACCTTGTT TAAAAGCTGC TTACATGGAC TTCTGTCCTT TAAAACGTTC 2520 

CCCTTGGCTG TGGCCCTCTG TGTATGCCTG GGATCCTTCC AAGCACTCAT AGCCCAGATA 2580 
GGAATCCTCT GCTCCTCCCA AATAAATTCA TCTGTTC 
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Seq ID NO: 111 Protein sequence: 
Protein Accession NP_000686 



45 



50 



55 



MKDEPRSTNL 
ISQGTEKVLA 
KHLTPVTIiEIi 
PALQSTITRP 
VDVQBTEPVM 
5SGSFGGNEG 
RYPPYTDWNQ 



11 
I 

FMKLDSVFIW 
EVLPQYLDQS 
GGKNPCYVDD 
YGDDPQSSPN 
QEEIFGPILP 
FTYISLLSVP 
QI.LRWGMGSQ 



21 

t 

KEPFGLVLII 
CFAWLGGPQ 
NCDPQTVANR 
LGRIINQKQF 
IVNVQSVDEA 
FGGVGHSGMG 
SCTLIi 



31 
I 

APWNYPLNLT 
ETGQLLEHKL 
VAWPCYFNAG 
QRLRAIiLGCG 
IKFINRQEKP 
RYHGKFTFDT 



41 

1 

LVLLVGTLPA 
DYIFFTGSPR 
QTCVAPDYVL 
RVAIGGQSNE 
LALYAFSNSR 
FSHHRTCLLA 



51 
I 

GNCWLKPSE 
VGKIVMTAAT 
CSPEMQERLL 
SDRYIAPTVL 
QWNQMLERT 



Seq ID NO: 112 DNA sequence 
Nucleic Acid Accession #j NM_0044S6 
Coding sequence: 58-2298 



60 
12 0 
180 
240 
300 
360 
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GAATTCCGGG 
GGCCAGACTG 
GAGTACATGC 
TTTAGTTCCA 
CAGCGAAGGA 
GAGTGTTCGG 
AATGCAGTTG 
GTGGAAGATG 
GATGGTACTT 
GAATGTGGGT 
AATGATGATG 
GATCTGGAGG 
AAAATTTTGG 
GAAAAATATA 
CCCAACATAG 
CATACGCTTT 
ACACCCAACA 
CCACAGTGTT 
CGGATAAAGA 
' AGTAGCAGGC 
AGGGAAGCAG 
GATGAAACTT 
CCAAATATTG 
GTCCTCATTG 
ACATGTAGAC 
GCTGAGGATG 
CACTGCAGAA 



11 

CGACGCGCGG 
GGAAGAAATC 
GACTGAGACA 
ATCGTCAGAA 
TACAGCCTGT 
TGACCAGTGA 
CTTCAGTACC 
AAACTGTTTT 
TCATTGAAGA 
TTATAAATGA 
ACGATGATGA 
ATCACCGAGA 
AGGCCATTTC 
AAGAACTCAC 
ATGGACCAAA 
TCTGTAGGCG 
CTTATAAGCG 
ACCAGCATTT 
CCCCACCAAA 
CCAGCACCCC 
GGACTGAAAC 
CGAGCTCCTC 
AACCTCCTGA 
GCACTTACTA 
AGGTGTATGA 
TGGATACTCC 
AGATACAGCT 



21 

1 

GAACAACGCG 
TGAGAAGGGA 
GCTCAAGAGG 
AATTTTGGAA 
GCACATCCTG 
CTTGGATTTT 
CATAATGTAT 
ACATAACATT 
ACTT^ATAAAA 
TGT^T^TTTTT 
TGATGGAGAC 
TGATAAAGAA 
CTCAATGTTT 
CGAACAGCAG 
TGCTAAATCT 
ATGTTTTAAA 
GAAGAACACA 
GGAGGGAGCA 
ACGTGCAGGA 
CACCATTAAT 
GGGGGGAGAG 
TGAAGCAAAT 
GAATGTGGAG 
TGACAATTTC 
GTTTAGAGTC 
TCCAAG6AAA 
GAAAAAGGAC 



31 

AGTCGGCGCG 
CCAGTTTGTT 
TTCAGACGAG 
AGAACGGAAA 
ACTTCTGTGA 
CCAACACAAG 
TCTTGGTCTC 
CCTTATATGG 
AATTATGATG 
GTGGAGTTGG 
GATCCTGAAG 
AGCCGCCCAC 
CCAGATAAGG 
CTCCCAGGCG 
GTTCAGAGAG 
TATGACTGCT 
GAAACAGCTC 
AAGGAGTTTG 
GGCCGCAGAA 
GTGCTGGAAT 
AACAATGATA 
TCTCGGTGTC 
TGGAGTGGTG 
TGTGCCATTG 
AAAGAATCTA 
AAGAAGAGGA 
GGCTCCTCTA 



41 

! 

CGGGACGAAG 
GGCGGAAGCG 
CTGATGAAGT 
TCTTAAACCA 
GCTCATTGCG 
TCATCCCATT 
CCCTACAGCA 
GAGATGAAGT 
GGAAAGTACA 
TGAATGCCCT 
AAAGAGAAGA 
CTCGGAAATT 
GCACAGCAGA 
CACTTCCTCC 
AGCAAAGCTT 
TCCTACATCC 
TAGACAACAA 
CTGCTGCTCT 
GAGGACGGCT 
CAT^GGATAC 
AAGAAGAAGA 
AAACACCAAT 
CTGAAGCCTC 
CTAGGTTAAT 
GCATCATAGC 
AACACCGGTT 
ACCATGTTTA 



51 
I 

AATAATCATG 
TGTAATU^TCA 
AAAGA6TATG 
AGAATGGAAA 
CGGGACTAGG 
AAAGACTCTG 
GAATTTTATG 
TTTAGATCA6 
CGGGGATAGA 
TGGTCAATAT 
AAAGCAGAAA 
TCCTTCTGAT 
AGAACTAAAG 
TGAATGTACC 
ACACTCCTTT 
TTTTCATGCA 
ACCTTGTGGA 
CACCGCTGAG 
TCCCAATAAC 
AGACAGTGAT 
AGAGAAGAAA 
AAAGATGAAG 
AATGTTTAGA 
TGGGACCAAR. 
TCCAGCTCCC 
GTGGGCTGCA 
CAACTATCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
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CCCTGTGATC ATCCACGGCA GCCTTGTGAC 
TTTTGTGAAA AGTTTTGTCA ATGTAGTTCA 
TGCAAAGCAC AGTGCAACAC CAAGCAGTGC 
CCTGACCTCT GTCTTACTTG TGGAGCCGCT 
AAGAACTGCA GTATTCAGCG GGGCTCCAAA 
GCAGGCTGGG GGATTTTTAT CAAAGATCCT 
TGTGGAGAGA TTATTTCTCA AGATGAAGCT 
ATGTGCAGCT TTCTGTTCAA CTTGAACAAT 
AACAAAATTC GTTTTGCAAA TCATTCGGTA 
GTTAACGGTG ATCACAGGAT AGGTATTTTT 
CTGTTTGTTG ATTACAGATA CAGCCAGGCT 
GAAATGGAAA TCCCTTGACA TCTGCTACCT 
CTTCAGGAAC CTCGAGTACT GTGGGCAATT 
AATTTGCAAA GTACTGTAAG AATAATTTAT 
GCCTTCTCAC CAGCTGCAAA GTGTTTTGTA 
TACATTTTTC AACTTTGAAT AAAGAATACT 



AGTTCGTGCC CTTGTGTGAT AGCACAAAAT 
GAGTGTCAAA ACCX3CTTTCC GGGATGCCGC 
CCGTGCTACC TGGCTGTCCG AGAGTGTGAC 
GACCATTGGG ACAGTAAAAA TGTGTCCTGC 
AAGCATCTAT TGCTGGCACC ATCTGACGTG 
GTGCAGAAAA ATGAATTCAT CTCAGAATAC 
GACAGAAGAG GGAAAGTGTA TGATAAATAC 
GATTTTGTGG TGGATGCAAC CCGCAAGGGT 
AATCCAAACT GCTATGCAAA AGTTATGATG 
GCCAAGAGAG CCATCCAGAC TGGCGAAGAG 
GATGCCCTGA AGTATGTCGG CATCGAAAGA 
CCTCCCCCTC CTCTGAAACA GCTGCCTTAG 
TAGAAAAAGA ACATGCAGTT TGAAATTCTG 
AGTAATGAGT TTAAAAATCA ACTTTTTATT 
CCAGTGAATT TTTGCAATAA TGCAGTATGG 
TGAACTTGAA AAAAAAAAAA AAAAAA 



Seq ID NO: 113 Protexn sequence: 
Protein Accession #: NP_004447 

1 11 21 31 41 51 

I I I I I I 

MGQTGKKSEK GPVCWRKRVK SEYMRLRQLK RFRRADEVKS MFSSNRQKIL ERTEILNQEW 

KQRRIQPVHI LTSVSSLRGT RECSVTSDLD FPTQVIPLKT LNAVASVPIM YSWSPLQQNF 

MVEDETVIiHN IPYMGDEVLD QDGTFIEELI KNVDGKVHGD RECX3FINDEI FVEIiVNALGQ 

YNDDDDDDDG DDPEEREEKQ KDLEDHRDDK ESRPPRKFPS DKIIjEAISSM FPDKGTAEBL 

kekykelteq qlpgalppec tpnidgpnak svqreqslhs fhtlfcrrcf kydcflhpfh 
atpntykrkn tetaldnkpc gpqcyqhleg akeft^^lta eriktppkrp ggrrrgrlpn 
nssrpstpti nvleskdtds dreagtetgg enndkeeeek kdetssssea nsrcqtpikm 
kpnieppenv ewsgaeasmf rvligtyydn fcaiarligt ktcrqvybfr vkessiiapa 
PAEDVDTPPR kkkrkhrlwa ahcrkiqlkk dgssnhvyny qpcdhprqpc dsscpcviaq 
nfcekfcqcs secqnrfpgc rckaqcntkq cpcylavrec dpdlcltcga adhwdsknvs 
ckncsiqrgs kkhlllapsd vagwgifikd pvqknefise ycgeiisqde adrrgkvydk 
ymcsflfnln ndfwdatrk gnkirft^nhs vnpncyakvm mvngdhrigi fakraiqtge 

ELFVDYRYSQ ADALKYVGIE REMEIP 

Seq ID NO: 114 DKA sequence 
Nucleic Acid Accession #: NM_001827 
Coding sequence: 96-335 

1 11 21 31 41 51 

I I I I I I 

AGTCTCCGGC GAGTTGTTGC CTGGGCTGGA CGTGGTTTTG TCTGCTGCGC CCGCTCTTCG 
CGCTCTCGTT TCATTTTCTG CAGCGCGCCA CGAGGATGGC CCACAAGCAG ATCTACTACT 
CGGACAAGTA CTTCGACGAA CACTACGAGT ACCGGCATGT TATGTTACCC AGAGAACTTT 
CCAAACAAGT ACCTiU^AACT CATCTGATGT CTGAAGAGGA GTGGAGGAGA CTTGGTGTCC 
AACAGAGTCT AGGCTGGGTT CATTACATGA TTCATGAGCC AGAACCACAT ATTCTTCTCT 
TTAGACGACC TCTTCCAAAA GATCAACAAA AATGAAGTTT ATCTGGGGAT CGTCAAATCT 
TTTTCAAATT TAATGTATAT GTGTATATAA GGTAGTATTC AGTGAATACT TGAGAAATGT 
ACAAATCTTT CATCCATACC TGTGCATGAG CTGTATTCTT CACAGCAACA GAGCTCAGTT 
AAATGCAACT GCAAGTAGGT TACTGTAAGA TGTTTAAGAT AAAAGTTCTT CCAGTCAGTT 
TTTCTCTTAA GTGCCTGTTT GAGTTTACTG AAACAGTTTA CTTTTGTTCA ATAAAGTTTG 
TATGTTGCAT TTAAAAAAAA AAAAAAA 

Seq ID NO: 115 Protein seqiuence: 
Protein Accession #: NP_001818 

1 11 21 31 41 51 

t I I I I I 

MAHKQIYYSD KYFDEHYEYR hvmlprelsk qvpkthlmse eewrrlgvqq SLGWVHYMIH 
epephillfr rplpkdqqk 

Seq ID NO: 116 DNA sequence 

Nucleic Acid Accession #: CAT cluster 

1 11 21 31 41 51 

I I I I I I 

TCAGACCTCA TGAGTCACTT GGACTCTTGA gccacctctg ggggtggagt ctctctcctg 
gcatctggac ccttggtgct atcgacgaag cttgggtggg gctcttagct gctatgtgca 
agaggtgtgt tccagggaaa gcccctatct ctctgcagag gtcaagtgaa agcgacggcc 
gcagccaaca gagttcaaaa tgcaggcttg gaaagtacag ggggctctgt ggaggatggg 
aaggactgat ccacattccc accaggaagt ttagcagaac ccccgcgtgc caactggacc 
ccttggaagg acctggctca ggctggacca cctcttgaga gggaggagct ctggatttga 
tcaagaattc tttgctgagc atggtgcctc atgcctataa taccaacact ttgggaggcc 

AGTGTGGGAG GATCTCTTGA GCCCAGGAGT TCAAGACTAG CCTGGGCAAC ACAGAGAGAA 
CCCATCTCTA AAATAATAAT AATAATAAAA TTiAAAAATTA GCAGGGCATG GTGGCATGTG 
CCTGTAGTTC CAGCTACCCA GGAGGCTGAG GCAAGAGGAT GGCTGGAGCC TGGGATGTTG 
AGGCTGCAAT GAACTGTGAT TACCCCACTG CACTCCAGCC TGGGCAAAAG AGCGAGAGAA 
CCTGTCTCAA ATAATAATAA TAATAATAAT CTTATTTTGG AGAATAAAGA GACCTCTGGA 
TTTGAGGTGC CATTTGGGTA G7UVAGAAAAG ACGTTTACAC CGAGAAATAG TCTGTGTTGC 
CCTGAAGGAG CAGAGGGATG CATCGCTGGA GGTGACCTAC AGTTGAAGAA GACTCATTAT 
GACAGACCTT GTCCTTCTTC CTTGTGGAAA GTGTTTCCTC TGCTGCTACT GCTCATGAGA 
CTCTTCCCCC TCCCTGTCCC AGGGAACCAA AGGGCTTTCT ACCACACCCT TTCTTGCCCC 
CCGCCTCCCA TGTCTGCTGT GCCTTTGTAC TCAGCAATTC TTGTTTGCTC CATTATCTTC 
CAGCCGGATA CAGAGTGAAT AGTTAACCAC ACTTAGGTCA AATAGGATCT AAATTTTTGT 
TCCTGCTCC6 TGTAT^GAGG CCAGTGTTT6 TGTGTTGCAA GCAGCCTTGG AATAGTAACT 
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CTTCTCATTT GTTTGGGATC TGGCCACCAA GTTCCAGAAT GATACACGGA TCAGTGCAGA 1200 

AGTTCATCAG GCTCTCGGAC CTTAGGGCTG TTGGAGAAGG CTTCAGCAGC AGAACTGATG 12 60 

GTGAAGGCTC GTGTTCTCCA TCCTCAACTT TCTTTGCTTC GATCATACAC AAGAATACAT 1320 

TTGGAAGGGC AA7W\ATGAA CACTGTCGTT CATTGCAGCC GTGTTTTGTG ACACAGATGC 13 80 

ACAGTCTGCT GTGAAGACCT TCTCTCAAGT GGCATTTGGG AGTCCATGCC AGATCATGGT 1440 

GCTTCATGAG AGACTGACAG CTATCAGGGG TTGTGGCACT TAGTGAGGAC TCTCCTCCCC 1500 

CAGTGTGTGC TGATGACACA TACACACCTG ACAATAGCTT GAGTCTTCTC TGTTCCTTTT 1560 

ACTCTGTAGC CAACATACAC ATGATTTAAA ACCCTTTCTA AATATCTATC ATGGTTCATC 1620 

CTTGTCCAAA TGCAGAGTCA GAGCTATTTG TACTTCATTA TTATTTCCAA GGCGAATA6T 1680 

TGGCTTTCTT TTTGCAA7VAA TAATTAAAGT TTTTGTATGT TGCAAAAAAA AAAAAAAAAA 1740 
AAACAAAAAA 



PCT/US02/12476 



15 
20 
25 
30 
35 
40 
45 
50 
55 



Seq ID NO: 117 DNA sequence 

Nucleic Acid Accession #i BC012178.1 

Coding sequence: 204-2285 



1 
I 

CTTCTCTCCC 
TGGCGGCTGG 
GCCCGCGGCG 
CCGCGGCTCC 
CTGGAGGAGA 
CTGGTGCTCA 
AAATTTTCCC 
TCATCTCTGG 
TATTCACTAT 
TATTTGGAGG 
TGGATAATAC 
ATGGAGATAG 
TAGTAGCAGG 
TTGGCCTTAC 
GCAGTGGAAC 
GAGTAGGCAC 
CAGCTTTGCT 
GCTTTATGAG 
AGGTCAAAGT 
CAGATGAAGA 
CTGAAGAGAA 
GAGAAATGAA 
TAATTGAAAG 
ATGACACAGA 
ATTTTCATAA 
TTTCCAGGCA 
CTTATATTTG 
CTGCAAGTGT 
AGGATCAGGA 
CAATTAAAAC 
CCAGTAAAGA 
TGTGTCACAA 
CAGATGTTAC 
TTGAGGCCCA 
TGATTTTGAC 
GATCTGTGGT 
CTGGCAATGA 
CTGGTATTTC 
AGTAATAAAC 



11 
I 

GCGGCGCTG6 
CTCCTCTCCG 
CCGACCCTTC 
GGCCCTGGCC 
CCTTAAGGAT 
GTACGGGAAA 
CTTGGAAACA 
AGGACCTAAT 
TGGCAAGCCT 
TACTGTGCAC 
ATGTTCATTA 
TGTAGACAAA 
CATAGCAAAT 
AGAAAATGGA 
CTTCACCGTG 
GTCAAT^GTT 
AAATCGTGCT 
AAAACGAGAA 
GATAAATGCT 
TAGAACCCCA 
AAGAAAAATC 
CTTGAAACCA 
TGCATCCCTT 
GCTCATCAGA 
AGATGAAGTG 
TCCATTTCCA 
TAAGGACTTT 
TAAAAAGCCA 
GAAGCTGATG 
TGTAG6TGTG 
TGAACCTGAC 
CGTTAACAGA 
TCCCACTTTC 
TAACATTCTC 
ACCATTACAT 
TATTCGAACC 
GATCCCTGTA 
TCGAATTATG 
TTCTTGTTCT 



21 
I 

GGCCCGCGCT 
CTGCCGGCTG 
CGGCACCCTC 
CCGATGGCTC 
GGCCACCACC 
GTCATAGACC 
CCAGCATTTG 
TCTGTGTATG 
GTTCTTGGAA 
AAAAAAAGTG 
TTCAGGGGCC 
GTAGCTGATG 
GAATCTAAAA 
TU^GTAATAC 
CAGAACAGAG 
TTGGTTTTAC 
TTGAACCAAG 
AGCCAQTCTG 
GCTCATTCTT 
CGGAAAAGAA 
ATTGGGGATA 
GAGGAGGTTT 
GTTGCAAGTG 
AAGTTGAGAG 
AGT^TTTTGG 
GGTCCTGGCC 
CCTGAAACCA 
CATACCCTAT 
CAAATTACCA 
CAGGGTGACT 
TGGGAATCAC 
GTTGTTTATA 
TTGACAACAG 
AGGGAGTCTG 
TTTGATCGGG 
TTTATTACTA 
GAGGTGGTAT 
TATGACTTAA 
ATTAAAA 



31 
I 

CCGCTGCTGT 
CTCCTCGACC 

CCGCCCCGTC 
TGTGCAACGG 
ACTATGAAGG 
GAAGAGTGAG 
CTATAAAGGA 
CTGAAGATGC 
TTTGCTATGG 
TCAGAGAAGA 
TTCAGAAGGA 
GATTCAAGGT 
AGTTATATGG 
TGAAGAATTT 
AACTTGAGTG 
TCAGTGGTGG 
AACAAGTCAT 
TTGAAGAGGC 
TCTACAATGG 
TTAGCAAAAC 
CTTTTGTTAA 
TCCTTGCCCA 
GCAAAGCTGA 
AGGAGGGAAA 
GCAGAGAACT 
TGGCAATCAG 
ACAATATTTT 
TACAGAGAGT 
GTCTGCATTC 
GTCGTTCCTA 
TTATTTTTCT 
TATTTGGCCC 
GGGTGCTCAG 
GGTATGGTGG 
ACCCACTTCA 
GTGACTTCAT 
T/iAAGATGGT 
CATCAAAGCC 



41 

I 

TGCTCCATTC 
AGGCCTCCTT 

TCGTACTGTC 
AGACTCCAAG 
AGCTGTTGTC 
GGAACTGTTC 
ACAAGGATTC 
TCCCTGGTTT 
TATGCAGATG 
TGGAGTTTTC 
AGAAGTTGTT 
TGTGGCACGT 
AGCACAGTTC 
CCTTTATGAT 
TATTCGAGAG 
AGTAGACTCA 
TGCTGTGCAC 
CCTCTiAAAAG 
AACAACAACC 
GTTAAATATG 
GATTGCCAAT 
AGGTACTTTA 
ACTCATCAAA 
AGTAATAGAA 
TGGACTTCCA 
AGTAATATGT 
GAAAATAGTA 
CAAAGCCTGC 
ACTGAATGCC 
CAGTTACGTG 
GGCTAGGCTT 
ACCAGTTAAA 
TACTTTACGC 
GAAAATCAGC 
AAAGCAGCCT 
GACTGGTATA 
CACTGAGATT 
CCCAGGAACT 



51 
I 

GGCX3CTTTTC 
CTCAACCTCA 
GCCGTCACCG 
CTGGAGAATG 
ATTCTGGATG 
GTGCAGTCTG 
CGTGCTATTA 
GATCCAGCAA 
ATGAATAAGG 
AACATTAGTG 
TTGCTTACAC 
TCTGGAAACA 
CACCCTGAAG 
ATAGCTGGAT 
ATCAAAGAGA 
ACAGTTTGTA 
ATTGATAATG 
CTTGGAATTC 
CTACCAATAT 
ACCACAAGTC 
GAAGTAATTG 
CGGCCTGATC 
ACCCATCACA 
CCTCTGAAAG 
GAAGAGTTAG 
GCTGAAGAAC 
GCTGATTTTT 
ACAACAGAAG 
TTCTTGCTGC 
TGTGGAATCT 
ATACCTCGCA 
GAACCTCCTA 
CAAGCTGATT 
CAGATGCCGG 
TCATGCCAGA 
CCTGCAACAC 
AAGAAGATTC 
ACTGAGTGGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 



60 
65 
70 
75 
80 
85 



Seq ID NO: 118 Protein sequence: 
Protein Accession #: AAH12178.1 



MALCNGDSKIj 
APAIKEQGFR 
KSVREDGVPN 
SKKLYGAQFH 
VLLSGGVDST 
HSFYNGTTTL 
EVFLAQGTLR 
ILGRELGLPE 
TLIjQRVKACT 
BSLIFLARLI 
ESGYAGKISQ 
WriKMVTEIK 



11 

I 

ENAGGDLKDG 
AIIISGGPNS 
ISVDNTCSLP 
PEVGLTENGK 
VCTALLNRAL 
PISDEDRTPR 
PDLIESASLV 
ELVSRHPFPG 
TEEDQEKLMQ 
PRMCHNVNRV 
MPVILTPLHF 
KIPGISRIMY 



21 
1 

HHHYEGAWI 
VYAEDAPWFD 
RGLQKEBWIj 
VILKNFLYDI 
NQEQVIAVHI 
KRISKTLNMT 
ASGKAELIKT 
PGLAIRVICA 
ITSIiHSIjNAF 
VYIFGPPVKE 
DRDPLQKQPS 
DLTSKPPGTT 



31 
I 

LDAGAQYGKV 
PAIFTIGKPV 
LTHGDSVDKV 
AGCSGTFTVQ 
DNGFMRKRES 
TSPEEKRKII 
HHITDTELIRK 
EEPYICKDFP 
LLPIKTVGVQ 
PPTDVTPTFIj 
CQRSWIRTF 
EWE 



41 

I 

IDRRVRELFV 
LGICYGMQMM 
ADGFKWARS 
NRELECIRBI 
QSVEEALKKL 
GDTFVKIANE 
LREEGKVIEP 
ETNNILKIVA 
GDCRSYSYVC 
TT6VLSTLRQ 
ITSDFMTGIP 



51 
I 

QSEIFPLETP 
NKVFGGTVHK 
GNIVAGIANB 
KERVGTSKVL 
GIQVKVINAA 
VIGEMNLKPE 
LKDPHKDEVR 
DFSASVKKPH 
GISSKDEPDH 
ADFEAHNIIiR 
ATPGNEIPVE 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



Seq ID NO: 119 DNA sequence 

Nucleic Acid Accession #: NM_006500.1 

Coding sequence: 27.. 1967 

1 11 21 31 41 51 

I 11 I I I 

ACTTGCGTCT CGCCCTCCGG CCAAGCATGG GGCTTCCCAG GCTGGTCTGC GCCTTCTTGC 60 
TCGCCGCCTG CTGCTGCTGT CCTCGCGTCG CGGGTGTGCC CGGAGAGGCT GAGCAGCCTG 120 
CGCCTGA6CT GGTGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC GGCCTCTCCC 180 
AGTCCCAAGG CAACCTCAGC CATGTCGACT GGTTTTCTGT CCACAAGGAG AAGCGGACGC 240 
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TCATCTTCCG TGTGCGCCAG GGCCAGGGCC AGAGCGAACC TGGGGAGTAC GAGCAGCGGC 300 

TCAGCCTCCA GGACAGAGGG GCTACTCTGG CCCTGACTCA AGTCACCCCC CAAGACGAGC 3 60 

GCATCTTCTT GTGCCAGGGC AAGCGCCCTC GGTCCCAGGA GTACCGCATC CAGCTCCGCG 420 

TCTACSAAGC TCCGGAGGAG CCAAACATCC AGGTCAACCC CCTGGGCATC CCTGTGAACA 480 

GTAAGGAGCC TGAGGAGGTC GCTACCTGTG TAGGGAGGAA CGGGTACCCC ATTCCTCAAG 540 

TCATCTGGTA CAAGAATGGC CGGCCTCTGA AGGAGGAGAA GAACCGGGTC CACATTCAGT 600 

CGTCCCAGAC TGTGGAGTCG AGTGGTTTGT ACACCTTGCA GAGTATTCTG AAGGCACAGC 660 

TGGTTAAAGA AGACAAAGAT GCCCAGTTTT ACTGTGAGCT CAACTACCX3G CTGCCCAGTG 720 

GGAACCACAT GAAGGAGTCC AGGGAAGTCA CCGTCCCTGT TTTCTACCCG ACAGAAAAAG 780 

TGTGGCTGGA AGTGGAGCCC GTGGGAATGC TGAAGGAAGG GGACCGCGTG GAAATCAGGT 840 

GTTTGGCTGA TGGCAACCCT CCACCACACT TCAGCATCAG CAAGCAGAAC CCCAGCACCA 900 

GGGAGGCAGA GG2VAGAGACA ACCAACGACA ACGGGGTCCT GGTGCTGGAG CCTGCCCGGA 960 

AGGAACACAG TGGGCGCTAT GAATGTCAGG CCTGGAACTT GGACACCATG ATATCGCTGC 1020 

TGAGTGAACC ACAGGAACTA CTGGTGAACT ATGTGTCTGA CGTCCGAGTG AGTCCCGCAG 1080 

CCCCTGAGAG ACAGGAAGGC AGCAGCCTCA CCCTGACCTG TGAGGCAGAG AGTAGCCAGG 1140 

ACCTCGAGTT CCAGTGGCTG AGAGAAGAGA CAGACCAGGT GCTGGAAAGG GGGCCTGTGC 1200 

TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCGGCTA TCGCTGCGTG GCGTCTGTGC 1260 

CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT GGCCATTTTT GGCCCCCCTT 1320 

GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA TATGGTGTTG AATCTGTCTT 1380 

GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA CGTCAACGGC ACGGCAAGTG 1440 

AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 1500 

TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT GGGCAAAAAC ACCAGCATCC 1560 

TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA CTCCTU^CACA ACCACTGGCC 1620 

TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCTCCACA GAGAGAAAGC 1680 

TGCCGGAGCC GGAGAGCCX3G GGCGTGGTCA TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 1740. 

TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 

GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA GACCGAACTT GTAGTTGAAG 1860 

TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 

GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG GCATTAGCCC CGAATCACTT 1980 

CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 2040 

CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 

GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 

GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC AAGCAAGGAG CCCCAGTCTC 2220 

CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 

AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 

CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG AGGACACACC GGAGCCAGGC 2400 

GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG AGAGCACCCC AGCGGCATCC 2460 

AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 2520 

ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT CCTTAAAAGA TACGTGCCGG 2 580 

GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCGA GGCGGGCGGA 2640 

TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CTACTAAAAA 2700 

TACAAAAAAA AATTAGCTAG GCGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 2760 

CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2820 

CACTGCACTC CAGCCTGGGC AACACAGCGA GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 2880 

ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2 940 

TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAGC 3000 

GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 3060 

TTAGCACCAA ACTTCTAC/U^ ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 3120 

AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 3180 

CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 3240 

TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 33 0 0 

AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA TTCTACATGG GTACCACAGG 33 60 

AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA AACCGTTTCC AGTTGGCAGC 3420 

AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA AGCTCTACCA GAGCAGACAG 3480 

CTACCCTACT TTTCAGCAGC AAAACGTCCC GTATGACGCA GCACGAAGGG CCTGGCAGGC 3540 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA CTT 

Seq ID NO: 120 Protein sequence: 
Protein Accession #: NP_006491.1 

1 11 21 31 41 51 

I 1 I I I I 

MGLPRLVCAF LLAACCCCPR VAGVPGEAEQ PAPELVEVEV GSTALLKCGL SQSQGNLSHV 60 

DWFSVHKEKR TIiIFRVRQGQ GQSEPGEYEQ RLSLQDRGAT LALTQVTPQD ERIFLCQGKR 120 

PRSQEYRIQL RVYKAPEEPN IQVNPIiGIPV NSKEPEEVAT CVGRNGYPIP QVIWYKNGRP 180 

LKEEKNRVHI QSSQTVESSG LYTLQSILKA QLVKEDKDAQ FYCELNYRLP SGNHMKESRE 240 

VTVPVFYPTE KVWLEVEPVG MLKEGDRVEI RCLADGNPPP HFSISKQNPS TREAEEETTN 3 00 

DNGVLVLBPA RKEHSGRYEC QAWNLDTMIS LLSEPQELLV NYVSDVRVSP AAPERQEGSS 3 60 

LTLTCEABSS QDLEFQWLRE ETDQVLERGP VLQLHDLKRE AGGGYRCVAS VPSIPGLNRT 420 

QLVKLAIF6P PWMAFKBRKV WVKENMVLNL SCEASGHPRP TISWNVNGTA SEQDQDPQRV 480 

LSTLNVLVTP ELLBTGVECT ASNDLGKNTS ILFIiELVNLT TLTPDSNTTT GLSTSTASPH 540 

TRT^STSTER KLPBPBSRGV VIVAVIVCIIi VIiAVLGAVLY FLYKKGKLPC RRSGKQEITL 600 
PPSRKTELW EVKSDKLPEE MGIiLQGSSGD KRAPGDQGEK YIDLRH 

Seq ID NO: 121 DNA sequence 
Nucleic Acid Accession #: NM_018306 
Coding sequence: 60-671 

1 11 21 31 41 51 

I I I I I I 

ATAGTCTACA CAGAGCTCCC CTTGCTGCCC AGACAAGCTG AAGGACCACA GGAAAAGCCA 60 

TGGAGACTTC AGCATCCTCC TCCCAQCCTC AGGACAACAG TCAAGTCCAC AGAGAAACAG 120 

AAGATGTAGA CTATGGAGAG ACAGATTTCC ACAAGCAAGA CGGGAAGGCT GGACTCTTTT 180 

CCCAAGAACA ATATGAGAGA AACAAGTCTT CTTCCTCCTC CTTCTCTTCC TCCTCATCCT 240 

CCTCATCTTC TTCATCCTCC TCCTCCTCAG GTCCTGGGCA TGGGGAGCCT GACGTTTTGA 300 
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AGGATGAGCT TCAACTCTAT GGAGATGCTC CTG6AGAGGT GGTACCCTCT GGGGAATCAG 360 

GACTCCGAAG GAGAGGCTCT GACCCAGCAA GTGGAGAAGT GGAGGCCTCT CAGTTAAGAA 420 

GACTGAATAT AAAGAAAGAT GATGAGTTTT TCCATTTCGT CCTCCTGTGC TTTGCCATCG 480 

GGGCCTTGCT GGTGTGTTAT CACTATTACG CAGACTGGTT CATGTCTCTT GGGGTCGGCC 540 

5 TGCTCACCTT CGCCTCCCTG GAAACCGTTG GCATCTACTT CGGACTAGTG TACCGTAT CC 60 0 

ACAGCGTCCT CCAAGGCTTC ATCCCCCTCT TCCAGAAGTT TAGGCTGACA GGGTTCAGGA 660 

AGACTGACTG AGGCCACTTC CAGGTGGGCA GCAGAGGCAG GCCCCAGTGT GACCACCACT 720 

GCGACCCCTG AGCCCACAAG GGCAGAGCAG CATTCTGAGA GACGCACAGG AGACCAAGCC 780 

AGACCAATAA ACAGAACACT TTTCCTTCCA TGTGGTCTGA ATGTTGGCAC CAGCCCGGGC 840 

10 AGGGGCATCT CATTTGGGCA GTACTGCTGT GCAACCCAGC TGCAAGGATG GAAGGCAGAG 900 

GGTGGGTGTG GGGCCTGAGG CTTCACAGTA CCTGGACCAG CAGGAAGATT CTGGGAGGTC 960 

ACTGCTCTCA GAGGACAGCA AGGGACCCTG AGCTCTGCAA GCTGTGATCT GTCTGGGTTC 1020 

ATGGTTTTTC TCAAATCCCA GGCTATCTGC ATGCGCTCTC AGGTGCTACC GAGCCATCCT 1080 

GGGAGAGATG GATGGTCCAC TGCTTTGAGG CAGGGAGCCA TCGGGCTGGG GCCCCTTGGT 1140 

15 GAACCTGATG CAGGTAAGAT GCTGAGGACT AAAACCATTT TTTTTGCACC CAAAAAAAAA 12 0 0 

GGCAGGAAAA TGATCATCAG AAACTAAATG GCAGCCAGGC ATGGGGGCTC ACGACTGTAA 1260 

TCCTCGCACT TTGGGAGGCT CAGGCTAAGG GTCGCTTGAA GCTGAGAGTT CAAGACCAAC 1320 

CTGGGCAACA TAGTGAGACC CCCATCTCTA CAATTTTTTT TTAATGACCA AATGTGGCGG 1380 

TACATACCT6 TACATACCTG CGGTTCCAGC TACTCAAGAG GCTGAGGCAG GAGGACTGCT 1440 

20 TGAGCCCAGG AGTTCAGGGC TGCAGTGAGG TACGATCAAG CCACTGCACT CCAGCCTGGG 1500 
CGACAGAGCA AGATCGTTTC TCTAAAATT 
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25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 122 Protein sequence: 
Protein Accession #: NP_060776 

1 11 21 31 41 51 

I I I I I I 

METSASSSQP QDNSQVHRET EDVDYGETDF HKQDGKAGLF SQEQYERNKS SSSSFSSSSS 
SSSSSSSSSS GPGHGEPDVL KDELQLYGDA PGEWPSGES GLRRRGSDPA SGEVEASQLR 
RLNIKKDDEF FHFVLLCFAI GAIjIiVCYHYY ADWFMSLGVG LLTFASLETV GIYFGLVYRI 
HSVLQGFIPIi FQKFRLTGFR KTD 

Seq ID NO: 123 DNA sequence 
Nucleic Acid Accession #: BC022542 
Coding sequence: 243.. 8 96 

1 11 21 31 41 51 

I I I I I I 

ACTTGGTCCC AGCCGATAAA TCTGGGGCAG CGCGCGGTAG GAGCTGCGGG CGGCCAGGCC 
CCTTCCTGCG TCCGCACCTG GCCCCGCGCG CCCCTCTCGG GCGTCCGGCT TCCGGCGTCC 
TGGCGGCTCG GGTGGCGGCG GTTCGGGCGG CCGCCTGGCT GCTCCTCGGG GCGGCGACGG 
GGCTCACGCG CGGGCCCGCC ACGGCCTTCA CCGCCGCGCG CTCTGACGCC GGCATAAGGG 
CCATGTGTTC TGT^AATTATT TTGAGGCAAG AAGTTTTGTWV AGATGGTTTC CACAGAGACC 
TTTTAATCAA AGTGAAGTTT GGGGAAAGCA TTGAGGACTT GCACACGTGC CGTCTCTTAA 
TTAAACAGGA CATTCCTGCA GGACTTTATG TGGATCCGTA TGAGTTGGCT TCATTACGAG 
AGAGAAACAT AACAGAGGCA GTGATGGTTT CAGAAAATTT TGATATAGAG GCCCCTAACT 
ATTTGTCCAA GGAGTCTGAA GTTCTCATTT ATGCCAGACG AGATTCACAG TGCATTGACT 
GTTTTC7VAGC CTTTTTGCCT GTGCACTGCC GCTATCATCG GCCGCACAGT GAAGATGGAG 
AAGCCTCGAT TGTGGTCAAT AACCCAGATT TGTTGATGTT TTGTGACCAA GAGTTCCCGA 
TTTTGAAATG CTGGGCTCAC TCAGAAGTGG CAGCCCCTTG TGCTTTGGAT AATGAGGATA 
TATGCCAATG GAACAAGATG AAGTATAAAT CAGTATATAA GAATGTGATT CTACAAGTTC 
CAGTGGGACT GACTGTACAT ACCTCTCTAG TATGTTCTGT GACTCTGCTC ATTACAATCC 
TGTGCTCTAC ATTGATCCTT GTAGCAGTTT TCAAATATGG CCATTTTTCC CTATAAGTTT 
TATGTAGTTA AATGCTTCCT AGAAACCTAA ATAAGATCTA TTAATTTCTG ACGAGAGGTG 
TTCTTCTAGA ATTAATTACT TTTATCTTTT GTCTTCATTT GTGGCCAAAA TTATGTTTAC 
TAGAGGAAAT TTGGGATCAT TCTCAGCTAA TTCCAAAATG TAGTGCTCTA TTGCATGGAT 
CCTTGGTAAT CCTCAAGCAT CAGATGCCAT AAGGGGAAAC TTAATTCTGC TAAATTAATG 
TTTATTTTGT GAGAAGTGAC TTTATCTTCA TTTGGGGTAG AAAAATTATT TCTTTATGTA 
GTAGAGACAA ATTATTCTCA TTTTGCAAGT ACTTTCAATT TAAGCTACM ATTGAGAAAA 
CCGTTATAAA TAAGAATAAA ATAGGCCAGG CACAGTGGCT CACACCTGTA ATCCCAGCAC 
TTTGGGAGGC CGAGGTGGGC GGATCACCAG AGGTCAAGAG TTTGAGACCA GCTTGGTGAA 
ACCCTGTCTC TACTAAAAAT ACAAAAGTTA GCTGGGGCTG GTGGTGGGCA TCTGTAGTCC 
CAGCTAATTG GAAGGGTGAG GCGGGAGGAT CGCTTGAACC TGGGAGGCGG AGGTTCCAGA 
GAGCCAAGAT CGCACCACTG CACTACAGCC TGGGCGACAG AACGAGACCC TGTCTCCAAA 
GGAAAAACAA AAAAGAAGAA TAAAATAATT TGGATGAAAA TCATGTTTAT TTAAATAGTA 
ATGTCATGAG ACTATTAAAG ATGTGCCAGA GTTTCAATGA AAATCATTAA AGTAGGACAG 
CTAAGi\AATT AATATTAATA TAAAAATTAT TGATAATCTT AAATTATTGA TTATTCCTTA 
ACGCACTCCA TTCTCCTTTT ACATTTTATC ATGTTTCTTT TGAATATATG AATTGGCAAA 
GGACTTGATG AAACTGAGTA CTAAGATTTG GTACAGAGTA TGTCAGGAAG ACAACTCAGA 
TTGCCATTTT AAATAAAGTT GTACATGAAC AAAAAAAAAA AAAAAA 



Seq ID NO: 124 Protein sequence: 
Protein Accession #: AAH22542 



41 



51 



1 11 21 31 

1 1 1 11 I 

MCSEIILRQE VLKDGFHRDL IjIKVKFGESI EDLHTCRLLI KQDIPAGLYV DPYELASLRE 
RNITEAVMVS ENFDIEAPNY LSKESEVLIY ARRDSQCIDC FQAFLPVHCR YHRPHSEDGE 
ASIWNNPDL LMFCDQAGSR RMIRFRPDSF DKTIEFPILK CWAHSEVAAP CADENEDICQ 
WNKMKYKSVY KNVILQVPVG LTVHTSLVCS VTIiLITILCS KKKKK 

Seq ID NO: 125 DNA sequence 

Nucleic Acid Accession #: NM__004994.1 

Coding sequence: 20.. 2143 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



60 
120 
180 
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WO 02/086443 

1 11 21 31 

I I I I 

AGACACCTCT GCCCTCACCA TGAGCCTCTG GCAGCCCCTG 
GGGCTGCTGC TTTGCTGCCC CCAGACAGCG CCAGTCCACC 
CCTGAGAACC AATCTCACCG ACAGGCAGCT GGCAGAGGAA 
CACTCGGGTG GCAGAGATGC GTGGAGAGTC GAAATCTCTG 
CCAGAAGCAA CTGTCCCTGC CCGAGACCGG TGAGCTGGAT 
GCGAACCCCA CGGTGCGGGG TCCCAGACCT GGGCAGATTC 
CAAGTGGCAC CACCACAACA TCACCTATTG GATCCAAAAC 
GGCGGTGATT GACGACGCCT TTGCCCGCGC CTTCGCACTG 
CACCTTCAGT CGCGTGTACA GCCGGGACGC AGACATCGTC 
GCACGGAGAC GGGTATCCCT TCGACGGGAA GGACGGGCTC 
TGGCCCCGGC ATTCAGGGAG ACGCCCATTT CGACGATGAC 
GGGCGTCGTG GTTCCAACTC GGTTTGGAAA CGCAGATGGC 
CATCTTCGAG GGCCGCTCCT ACTCTGCCTG CACCACCGAC 
CTGGTGCAGT ACCACGGCCA ACTACGACAC CGACGACCGG 
GAGACTCTAC ACCCGGGACG GCi^ATGCTGA TGGGAAACCC 
CCAAGGCCAA TCCTACTCCG CCTGCACCAC GGACGGTCGC 
CGCCACCACC GCCAACTACX5 ACCGGGACAA GCTCTTCGGC 
CTCGACGGTG ATGGGGGGCA ACTCGGCGGG GGAGCTGTGC 
GGGTAAGGAG TACTCGACCT GTACCAGCGA GGGCCGCGGA 
TACCACCTCG AACTTTGACA GCGACAAGAA GTGGGGCTTC 
TTTGTTCCTC GTGGCGGCGC ATGAGTTCGG CCACGCGCTG 
GCCGGAGGCG CTCATGTACC CTATGTACCG CTTCACTGAG 
CGACGTGAAT GGCATCCGGC ACCTCTATGG TCCTCGCCCT 
AACCACCACC ACACCGCAGC CCACGGCTCC CCCGACGGTC 
TGTCCACCCC TCAGAGCGCC CCACAGCTGG CCCCACAGGT 
AGGTCCCCCC ACTGCTGGCC CTTCTACGGC CACTACTGTG 
TGCCTGCAAC GTGAACATCT TCGACGCCAT CGCGGAGATT 
CAAGGATGGG AAGTACTGGC GATTCTCTGA GGGCAGGGGG 
CCTTATCGCC GACAAGTGGC CCGCGCTGCC CCGCTUIGCTG 
GCTCTCCAAG AAGCTTTTCT TCTTCTCTGG GCGCCAGGTG 
GGTGCTGGGC CCGAGGCGTC TGGACAAGCT GGGCCTGGGA 
CGGGGCCCTC CGGAGTGGCA GGGGGAAGAT GCTGCTGTTC 
GTTCGACGTG AAGGCGCAGA TGGTGGATCC CCGGAGCGCC 
CCCCGGGGTG CCTTTGGACA CGCACGACGT CTTCCAGTAC 
CCAGGACCGC TTCTACTGGC GCGTGAGTTC CCGGAGTGAG 
GGGCTACGTG ACCTATGACA TCCTGCAGTG CCCTGAGGAC 
GCAGTGCCAT GTAAATCCCC ACTGGGACCA ACCCTGGGGA 
CAAACTGGTA TTCTGTTCTG GAGGAAAGGG AGGAGTGGAG 
TCACCTTTGT TTTTTGTTGG AGTGTTTCTA ATAAACTTGG 

Seq ID NO: 126 Protein sequence: 
Protein Accession #: NP_004985.1 
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MSLWQPLVLV 
RGESKSLGPA 
ITYWIQNYSE 
FDGKDGLLAH 
YSACTTDGRS 
ACTTDGRSDG 
CTSEGRGDGR 
PMYRFTEGPP 
PTAGPTGPPS 
RFSEGRGSRP 
LDKLGLGADV 
THDVFQYREK 



11 

I 

LLVLGCCFAA 
LLIiLQKQLSL 
DLPRAVIDDA 
AFPPGPGIQG 
DGIiPWCSTTA 
YRWCATTANY 
LWCATTSNFD 
liHKDDVNGIR 
AGPTGPPTAG 
QGPFLIADKW 
AQVTGALRSG 
AYPCQDRFYW 



21 
1 

PRQRQSTLVL 
PETGELDSAT 

FARAFALWSA 
DTVHFDDDELW 
NYDTDDRFGF 
DRDKLFGFCP 
SDKKWGFCPD 
HLYGPRPEPB 
PSTATTVPLS 
PALPRKLDSV 
RGKMLLFSGR 
RVSSRSELNQ 



31 
I 

FPGDLRTNIiT 
LKiywiRTPRCG 
VTPLTFTRVY 
SLGKGVWPT 
CPSERLYTRD 
TR7U3STVMGG 
QGYSLFLVAA 
PRPPTTTTPQ 
PVDDACNVNI 
FEEPLSKKLF 
RLWRFDVKAQ 
VDQVGYVTYD 



Seq ID NO: 127 DNA sequence 
Nucleic Acid Accession #: NM_004181 
Coding sequence: 32-670 



GCAGAAATAG 
GGTCGCCGGC 
GGTGCCAGCG 
CTTCAGGAAA 
CATGAAGCAG 
TAATCAAGAC 
AGAGAAAATG 
AGCCCATGAT 
TTTTATTCTG 
TCCGGTGAAC 
CAGAGAATTC 
GGCAGCCTAA 
AATATATACC 
TGTTCTGCAG 
ACAGCTGTCC 
TATGTCTTGT 
AAGACCTTGG 



11 
1 

CCTAGGGAGA 
CAGTGGCGCT 
CCTGCCTGCG 
AAGCAGATTG 
ACCATTGGGA 
AAACTGGGAT 
TCCCCTGAAG 
GCCGTGGCAC 
TTTAACAACG 
CATGGCGCCA 
ACCGAGCGTG 
TGCTCTGTGG 
CCCCATGCAG 
ACACGCCTTC 
ACTGGGCCAT 
ATCCGATATC 
ATGTGGTTAT 



21 
1 

TCAACCCCGA 
TCGTGGACGT 
CGCTGCTGCT 
AAGAGCTGAA 
ATTCCTGTGG 
TTGAGGATGG 
ACAGAGCAAA 
AGGAAGGCCA 
TGGATGGCCA 
GTTCAGAGGA 
AGCAAGGAGA 
GAGGGACTTT 
TCTAAAATGC 
CCCTCAGCCA 
TGTGGTGTGA 
TAACGCTTTA 
GTTGTCCTAA 



31 

I 

GATGCTGAAC 
GCTGGGGCTG 
GCTGTTTCCC 
GGGACAAGAA 
CACAATCGGA 
ATCAGTTCTG 
ATGCTTTGAA 
ATGTCGGGTA 
CCTCTATGAA 
CACCCTGCTG 
AGTCCGCTTC 
GCTGATTTCC 
TTCAGTACTT 
CACCCAGGCA 
GCTTCAGATG 
AATGGCTACT 
AGAATAAATT 



41 

1 


51 

1 




1 

GTCCTGGTGC 


1 

TCCTGGTGCT 


60 


CTTGTGCTCT 


T CCCTGGAGA 


120 


TACCTGTACC 


GCTATGGTTA 


lao 


GGGCCTGCGC 


TGCTGCTTCT 


240 


AGCGCCACGC 


TGAAGGCCAT 


300 


CAAACCTTTG 


AGGGC6ACCT 


360 


TACT CGG AAG 


ACTTGCCGCG 


420 


TGGAGCGCGG 


TGACGCCGCT 


480 


ATCCAGTTTG 


GT6TCGCGGA 


540 


CTG6CA.CAC6 


CCTTTCCTCC 


600 


GAGTTGTGGT 


CCCTGGG CAA 


660 


GCGGCCTGCC 


ACTTCCCCTT 


720 


GGTCGCTCCG 


ACGGCTTGCC 


780 


TTTGGCTTCT 


GCCCCAGCGA 


840 


TGCCAGTTTC 


CATTCATCTT 


900 


TCCGACGGCT 


ACCGCTGGTG 


960 


TTCTGCCCGA 


CCCGAGCTGA 


1020 


GTCTTCCCCT 


TCACTTTCCT 


1080 


GATGGGCGCC 


TCTGGTGCGC 


1140 


TGCCCGGACC 


AAGGATACAG 


1200 


GGCTTAGATC 


ATTCCTCAGT 


1260 


GGGCCCCCCT 


TGCATAA6GA 


1320 


GAACCTGAGC 


CACGGCCTCC 


1380 


TGCCCCACCG 


GACCCCCCAC 


1440 


CCCCCCTCAG 


CTGGCCCCAC 


1500 


CCTTTGAGTC 


CGGTGGACGA 


1560 


GGGAACCAGC 


TGTATTTGTT 


1620 


AGCCGGCCGC 


AGGGCCCCTT 


1680 


GACTCGGTCT 


TT6AGGAGCC 


1740 


TGGGT6TACA 


CAGGCGCGTC 


1800 


GCCGACGTGG 


CCCAGGTGAC 


186,0 


AGCGGGCGGC 


GCCTCTGGAG 


1920 


AGCGAGGTGG 


ACCGGATGTT 


1980 


C6AGAGAAAG 


CCTATTTCTG 


2040 


TTGAACCAGG 


TGGACCAAGT 


2100 


TAGGGCTCCC 


GTCCTGCTTT 


2160 


AGGAGCCAGT 


TTGCCGGATA 


2220 


GTGGGCTGGG 


CCCTCTCTTC 


2280 


ATTCTCTAAC 


CTTT 




41 

1 


51 

1 




1 

DP OT . RT^ EYTiV 
■ ij^ I irSriT' LUX 


1 

RYGYTRVAEM 


60 


VPDLGRFQTF 


EGDLKWHHHN 


120 


SRDADIVIQF 


GVAEHGDGYP 


180 


RFGN7UDGAAC 


HFPFIFEGRS 


240 


GNADGKPCQF 


PFIFQGQSYS 


300 


NSAGELCVFP 


FTFLGKEYST 


360 


HEFGHALGLD 


HSSVPEAIiMY 


420 


PTAPPTVCPT 


GPPTVHPSER 


480 


FDAIAEIGNQ 


IiYIjFKDGKYW 


540 


FFSGRQVWVY 


TGASVLGPRR 


600 


MVDPRSASEV 


DRMFPGVPLD 


660 


ILQCPED 






41 
1 


51 
1 




1 

AAAGTGCTGT 


1 

CCCGGCTGGG 


60 


GAAGAGGAGT 


CTCTGGGCTC 


120 


CTCACGGCCC 


AGCATGAGAA 


180 


GTTAGT CCTA 


AAGTGTACTT 


240 


CTTATTCACG 


CAGTGGCCAA 


300 


AAACAGTTTC 


TTTCTGAAAC 


360 


AAGAATGAGG 


CCATACAGGC 


420 


GATGACAAGG 


TG7VATTTCCA 


480 


CTTGATGGAC 


GAATGCCTTT 


540 


AAGGACGCTG 


CCAAGGTGTG 


600 


TCTGCCGTG6 


CTCTCTGCAA 


660 


CCTCTTCCCT 


TCAACATGAA 


720 


GTGAAACACA 


GCTGTTCTTC 


780 


CTTAAGCACA 


AGCAGAGTGC 


840 


GTGAAGCATT 


CTCCCCAGTG 


900 


TTGGTTTCTG 


TCTGTAAGTT 


960 


TTGCTGATAG 


TAGC 





Seq ID NO: 128 Protein sequence: 
Protein Accession #: NP_004172 
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51 
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1 



11 



21 



31 



MLNKVLSRLG VAGQWRFVDV LGLEEESLGS VPAPACALLL LFPLTAQHEN FRKKQIEELK 60 

GQEVSPKVYF MKQTIGNSCG TIGLIHAVAN NQDKLGFEDG SVLKQFLSET EKMSPEDRAK 120 

5 CFEKNEAIQA AHDAVAQEGQ CRVDDKVNFH FILPNNVDGH LYELDGRMPF PVNHGASSED 180 
TLLKDAAKVC REPTEREQGE VRFSAVALCK AA 



Seq ID NO: 129 DNA sequence 
10 Nucleic Acid Accession #: NM_000213 
Coding sequence: 127-5385 

1 11 21 31 41 51 

1 1 1 I 1 I 

1 J CGCCCGCGCG CTGCAGCCCC ATCTCCTAGC GGCAGCCCAG GCGCGGAGGG AGCGAGTCCG 60 

CCCCGAGGTA GGTCCAGGAC GGGCGCACAG CAGCAGCCGA GGCTGGCCGG GAGAGGGAGG 120 

AAGAGGATGG CAGGGCCACG CCCCAGCCCA TGGGCCAGGC TGCTCCTGGC AGCCTTGATC 180 

AGCGTCAGCC TCTCTGGGAC CTTGGCAAAC CGCTGCAAGA AGGCCCCAGT GAAGAGCTGC 240 

ACGGAGTGTG TCCGTGTGGA TAAGGACTGC GCCTACTGCA CAGACGAGAT GTTCAGGGAC 300 

20 CGGCGCTGCA ACACCCAGGC GGAGCTGCTG GCCGCGGGCT GCCAGCGGGA GAGCATCGTG 360 

GTCATGGAGA GCAGCTTCCA AATCACAGAG GAGACCCAGA TTGACACCAC CCTGCGGCGC 420 

AGCCAGATGT CCCCCCAAGG CCTGCGGGTC CGTCTGCGGC CCGGTGAGGA GCGGCATTTT 480 

GAGCTGGAGG TGTTTGAGCC AGTGGAGAGC CCCGTGGACC TGTACATCCT CATGGACTTC 540 

TCCAACTCCA TGTCCGATGA TCTGGACAAC CTCAAGAAGA TGGGGCAGAA CCTGGCTCGG 600 

25 GTCCTGAGCC AGCTCACCAG CGACTACACT ATTGGATTTG GCAAGTTTGT GGACAAAGTC 660 

AGCGTCCCGC AGACGGACAT GAGGCCTGAG AAGCTGAAGG AGCCCTGGCC CAACAGTGAC 720 

CCCCCCTTCT CCTTCJ^AGAA CGTCATCAGC CTGACAGAAG ATGTGGATGA GTTCCGGAAT 780 

AAACTGCAGG GAGAGCGGAT CTCAGGCT^C CTGGATGCTC CTGAGGGCGG CTTCGATGCC 840 

ATCCTGCAGA CAGCTGTGTG CACGAGGGAC ATTGGCTGGC GCCCGGACAG CACCCACCTG 900 

30 CTGGTCTTCT CCACCGAGTC AGCCTTCCAC TATGAGGCTG ATGGCGCCAA CGTGCTGGCT 960 

GGCATCATGA GCCGCAACGA TGAACGGTGC CACCTGGACA CCACGGGCAC CTACACCCAG 1020 

TACAGGACAC AGGACTACCC GTCGGTGCCC ACCCTGGTGC GCCTGCTCGC CAAGCACAAC 1080 

ATCATCCCCA TCTTTGCTGT CACCAACTAC TCCTATAGCT ACTACGAGAA GCTTCACACC 1140 

TATTTCCCTG TCTCCTCACT GGGGGTGCTG CAGGAGGACT CGTCCAACAT CGTGGAGCTG 1200 

35 CTGGAGGAGG CCTTCAATCG GATCCGCTCC AACCTGGACA TCCGGGCCCT AGACAGCCCC 1260 

CGAGGCCTTC GGACAGAGGT CACCTCCAAG ATGTTCCAGA AGACGAGGAC TGGGTCCTTT 1320 

CACATCCGGC GGGGGGAAGT GGGTATATAC CAGGTGCAGC TGCGGGCCCT TGAGCACGTG 1380 

GATGGGACGC ACGTGTGCCA GCTGCCGGAG GACCAGAAGG GCAACATCCA TCTGAAACCT 1440 

TCCTTCTCCG ACGGCCTCAA GATGGACGCG GGCATCATCT GTGATGTGTG CACCTGCGAG 1500 

40 CTGCAAAAAG AGGTGCGGTC AGCTCGCTGC AGCTTCAACG GAGACTTCGT GTGCGGACAG 1560 

TGTGTGTGCA GCGAGGGCTG 6AGTGGCCA6 ACCTGCAACT GCTCCACCGG CTCTCTGAGT 1620 

GACATTCAGC CCTGCCTGCG GGAGGGCGAG GACAAGCCGT GCTCCGGCCG TGGGGAGTGC 1680 

CAGTGCGGGC ACTGTGTGTG CTACGGCGAA GGCCGCTACG AGGGTCAGTT CTGCGAGTAT 1740 

GACAACTTCC AGTGTCCCCG CACTTCCGGG TTCCTCTGCA ATGACCGAGG ACGGTGCTCC 1800 

45 ATGGGCCAGT GTGTGTGTGA GCCTGGTTGG ACAGGCCCAA GCTGTGACTG TCCCCTCAGC 1860 

AATGCCACCT GCATCGACAG CAATGGGGGC ATCTGTAATG GACGTGGCCA CTGTGAGTGT 1920 

GGCCGCTGCC ACTGCCACCA GCAGTCGCTC TACACGGACA CCATCTGCGA GATCAACTAC 1980 

TCGGCGATCC ACCCGGGCCT CTGCX5AGGAC CTACGCTCCT GCGTGCAGTG CCAGGCGTGG 2 040 

GGCACCGGCG AGAAGAAGGG GCX3CACGTGT GAGGAATGCA ACTTCAAGGT CAAGATGGTG 2100 

50 GACGAGCTTA AGAGAGCCGA GGAGGTGGTG GTGCGCTGCT CCTTCCGGGA CGAGGATGAG 2160 

GACTGCACCT ACAGCTACAC CATGGAAGGT GACGGCGCCC CTGGGCCCAA CAGCACTGTC 2220 

CTGGTGCACA AGAAGAAGGA CTGCCCTCCG GGCTCCTTCT GGTGGCTCAT CCCCCTGCTC 2280 

CTCCTCCTCC TGCCGCTCCT GGCCCTGCTA CTGCTGCTAT GCTGGAAGTA CTGTGCCTGC 2340 

TGCAAGGCCT GCCTGGCACT TCTCCCGTGC TGCAACCGAG GTCACATGGT GGGCTTTAAG 2400 

55 GAAGACCACT ACATGCTGCG GGAGAACCTG ATGGCCTCTG ACCACTTGGA CACGCCCATG 2460 

CTGCGCAGCG GGAACCTCAA GGGCCGTGAC GTGGTCCGCT GGAAGGTCAC CAACAACATG 2520 

CAGCGGCCTG GCTTTGCCAC TCATGCCGCC AGCATCAACC CCACAGAGCT GGTGCCCTAC 2580 

GGGCTGTCCT TGCGCCTGGC CCGCCTTTGC ACCGAGAACC TGCTGAAGCC TGACACTCGG 2640 

GAGTGCGCCC AGCTGCGCCA GGAGGTGGAG GAGAACCTGA ACGAGGTCTA CAGGCAGATC 2700 

60 TCCGGTGTAC ACAAGCTCCA GCAGACCAAG TTCCGGCAGC AGCCCAATGC CGGGAAAAAG 2760 

CAAGACCACA CCATTGTGGA CACAGTGCTG ATGGCGCCCC GCTCGGCCAA GCCGGCCCTG 2 82 0 

CTGAAGCTTA CAGAGAAGCA GGTGGAACAG AGGGCCTTCC ACGACCTCAA GGTGGCCCCC 2 880 

GGCTACTACA CCCTCACTGC AGACGAGGAC GCCCGGGGCA TGGTGGAGTT CCAGGAGGGC 2 940 

GTGGAGCTGG TGGACGTACG GGTGCCCCTC TTTATCCGGC CTGAGGATGA CGACGAGAAG 3000 

65 CAGCTGCTGG TGGAGGCCAT CGACGTGCCC GCAGGCACTG CCACCCTCGG CCGCCGCCTG 3060 

GTAAACATCA CCATCATCTiA GGAGCAAGCC AGAGACGTGG TGTCCTTTGA GCAGCCTGAG 3120 

TTCTCGGTCA GCCGCGGGGA CCAGGTGGCC CGCATCCCTG TCATCCGGCG TGTCCTGGAC 3180 

GGGGGGAAGT CCCAGGTCTC CTACCGCACA CAGGATGGCA CCGGGCAGGG CAACCGGGAC 3240 

TACATCCCCG TGGAGGGTGA GCTGCTGTTC CAGCCTGGGG AGGCCTGGAA AGAGCTGCAG 3300 

70 GTGAAGCTCC TGGAGCTGCA AGAAGTTGAC TCCCTCCTGC GGGGCCGCCA GGTCCGCCGT 3360 

TTCCACGTCC AGCTCAGCAA CCCTAAGTTT GGGGCCCACC TGGGCCAGGC CCACTCCACC 3420 

ACCATCATCA TCAGGGACCC AGATGAACTG GACCGGAGCT TCACGAGTCA GATGTTGTCA 3 480 

TCACAGCCAC CCCCTCACGG CGACCTGGGC GCCCCGCAGA AGCCCAATGC TAAGGCCGCT 3 540 

GGGTCCAGGA AGATCCATTT CAACTGGCTG CCCCCTTCTG GCAAGCCAAT GGGGTACAGG 3600 

75 GTAAAGTACT GGATTCAGGG TGACTCCGAA TCCGAAGCCC ACCTGCTCGA CAGCAAGGTG 3 660 

CCCTCAGTGG AGCTCAGCAA CCTGTACCCG TATTGCGACT ATGAGATGAA GGTGTGCGCC 3720 

TACGGGGCTC AGGGCGAGGG ACCCTACAGC TCCCTGGTGT CCTGCCGCAC CCACCAGGAA 3780 

GTGCCCAGCG AGCCAGGGCG TCTGGCCTTC AATGTCGTCT CCTCCACGGT GACCCAGCTG 3840 

AGCTGGGCTG AGCCGGCTGA GACCAACGGT GAGATCACAG CCTACGAGGT CTGCTATGGC 3900 

80 CTGGTCAACG ATGACAACCG ACCTATTGGG CCCATGAAGA AAGTGCTGGT TGACAACCCT 3960 

AAGAACCGGA TGCTGCTTAT TGAGAACCTT CGGGAGTCCC AGCCCTACCG CTACACGGTG 4020 

AAGGCGCGCA ACGGGGCCGG CTGGGGGCCT GAGCGGGAGG CCATCATCAA CCTGGCCACC 4080 

CAGCCCAAGA GGCCCATGTC CATCCCCATC ATCCCTGACA TCCCTATCGT GGACGCCCAG 4140 

AGCGGGGAGG ACTACGACAG CTTCCTTATG TACAGCGATG ACGTTCTACG CTCTCCATCG 42 00 

85 GGCAGCCAGA GGCCCAGC6T CTCCGATGAC ACTGAGCACC TGGTGAATGG CCGGATGGAC 42 60 

TTTGCCTTCC CGGGCAGCAC CAACTCCCTG CACAGGATGA CCACGACCAG TGCTGCTGCC 4320 

TATGGCACCC ACCTGAGCCC ACACGTGCCC CACCGCGTGC TAAGCACATC CTCCACCCTC 4380 



237 
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ACACGGGACT ACAACTCACT GACCCGCTCA GAACACTCAC ACTC6ACCAC ACTGCCGAGG 4440 

GACTACTCCA CCCTCACCTC CGTCTCCTCC CACGACTCTC GCCTGACTGC TGGTGTGCCC 4500 

GACACGCCCA CCCGCCTGGT GTTCTCTGCC CTGGGGCCCA CATCTCTCAG AGTGAGCTGG 4560 

CAGGAGCCGC GGTGCGAGCG GCCGCTGCAG GGCTACAGTG TGGAGTACCA GCTGCTGAAC 4620 

5 GGCGGTGAGC TGCATCGGCT CAACATCCCC AACCCTGCCC AGACCTCGGT GGTGGTGGAA 4680 

GACCTCCTGC CCAACCACTC CTACGTGTTC CGCGTGCGGG CCCAGAGCCA GGAAGGCTGG 4740 

GGCCGAGAGC GTGAGGGTGT CATCACCATT GAATCCCAGG TGCACCCGCA GAGCCCACTG 4800 

TGTCCCCTGC CAGGCTCCGC CTTCACTTTG AGCACTCCCA GTGCCCCAG6 CCCGCTGGTG 4860 

TTCACTGCCC TGAGCCCAGA CTCGCTGCAG CTGAGCTGGG AGCGGCCACG GAGGCCCAAT 492 0 

10 GGGGATATCG TCGGCTACCT GGTGACCTGT GAGATGGCCC AAGGAGGAGG GCCAGCCACC 4 98 0 

GCATTCCGGG TGGATGGAGA CAGCCCCGAG AGCCGGCTGA CCGTGCCGGG CCTCAGCGAG 5040 

AACGTGCCCT ACAAGTTCAA GGTGCAGGCC AGGACCACTG AGGGCTTCGG GCCAGAGCXSC 5100 

GAGGGCATCA TCACCATAGA GTCCCAGGAT GGAGGACCCT TCCCGCAGCT GGGCAGCCX3T 5160 

GCCGGGCTCT TCCAGCACCC GCTGCAAAGC GAGTACAGCA GCT^TCACCAC CACCCACACC 522 0 

15 AGCGCCACCG AGCCCTTCCT AGTGGATGGG CCGACCCTGG GGGCCCAGCA CCTGGAGGCA 52 80 

GGCGGCTCCC TCACCCGGCA TGTGACCCAG GAGTTTGTGA GCCGGAGACT GACCACCAGC 5340 

GGAACCCTTA GCACCCACAT GGACCAACAG TTCTTCCAAA CTTGACCGCA CCCTGCCCCA 5400 

CCCCCGCCAT GTCCCACTAG GCGTCCTCCC GACTCCTCTC CCGGAGCCTC CTCAGCTACT 5460 

' CCATCCTTGC ACCCCTGGGG GCCCAGCCCA CCCGCATGCA CAGAGCAGGG GCTAGGTGTC 5520 

20 TCCTGGGAGG CATGAAGGGG GCAAGGTCCX3 TCCTCTGTGG GCCCAAACCT ATTTGTAACC 5580 

AAAGAGCTGG OAGCAGCACA AGGACCCAGC CTTTGTTCTG CACTTAATAA ATGGTTTTGC 5640 
ACTG 



PCT/US02/12476 



25 



Seq ID NO: 130 Protein sequence: 
Protein Accession #t NP_000204 



30 
35 
40 
45 
50 
55 
60 
65 



MAGPRPSPWA 
CNTQAEIiIiAA 
EVFEPI*ESPV 
PQTDMRPEKL 
QTAVCTRDIG 
TQDYPSVPTIi 
EAFNRIRSNL 
THVCQIiPEDQ 
CSEGWSGQTC 
FQCPRTSGFL 
CHCHQQSLYT 
LKRAEEVWR 
LliPLLALIiLL 
SGNLKGRDW 
AQLRQEVEEN 
LTEKQVEQRA 
LVEAIDVPAG 
KSQVSYRTQD 
VQLSNPKFGA 
RKIHFNWIiPP 
AQGEGPYSSL 
MDDNRPIGPM 
KRPMSIPIIP 
FPGSTNSLHR 
STLTSVSSHD 
EX.HRLNIPNP 
LPGSAFTLST 
RVDGDSPESR 
LFQHPLQSEY 
LSTHMDQQFF 



11 

1 

RLLLAALISV 
GCQRESIWM 
DLYILMDFSN 
KEPWPNSDPP 
WRPDSTHLLV 
VRLLAKHNII 
DIRALDSPRG 
KGNIHLKPSF 
NCSTGSIiSDI 
CNDRGRCSMG 
DTICEINYSA 
CSFRDEDDDC 
LCWKYCACCK 
RWKVTNNMQR 
LNEVYRQISG 
FHDLKVAPGY 
TATLGRRLVN 
GTAQGNRDYI 
HLGQPHSTTI 
SGKPMGYRVK 
VSCRTHQEVP 
KKVLVDNPKN 
DIPIVDAQSG 
MTTTSAT^YG 
SRLTAGVPDT 
AQTSVWEDIi 
PSAPGPLVFT 
LTVPGLSENV 
SSITTTHTSA 
QT 



21 
I 

SLSGTLANRC 
ESSFQITEET 
SMSDDLDNLK 
FSFKNVISLT 
FSTESAFHYE 
PIFAVTNYSY 
LRTEVTSKMP 
SDGLKMDAGI 
QPCIiREGEDK 
QCVCEPGWTG 
IHPGLCEDLR 
TYSYTMEGDG 
ACLALLPCCN 
PGFATHAASI 
VHKLQQTKFR 
YTLTADQDAR 
ITIIKEQARD 
PVEGELLFQP 
IIRDPDELDR 
YWIQGDSESE 
SBPGRLAFNV 
RMLLIENLRE 
EDYDSFLMYS 
THLSPHVPHR 
PTRLVFSALG 
LPNHSYVFRV 
AIiSPDSLQLS 
PYKFKVQART 
TBPFLVDGPT 



31 

I 

KKAPVKSCTE 
QIDTTLRRSQ 
KMGQNLARVI, 
EDVDEFRNKL 
ADGANVLAGI 
SYYEKLHTYF 
QKTRTGSFHI 
ICDVCTCELQ 
PCSGRGECQC 
PSCDCPLSNA 
SCVQCQAWGT 
APGPNSTVLV 
RGHMVGFKED 
NPTELVPYGL 
QQPNAGKKQD 
GMVEFQEGVE 
WSFEQPEFS 
GEAWKELQVK 
SFTSQMLSSQ 
7UILLDSKVPS 
VSSTVTQLSW 
SQPYRYTVKA 
DDVLRSPSGS 
VLSTSSTLTR 
PTSLRVSWQE 
RAQSQE6WGR 
WERFRRPNGD 
TEGFGPERE6 
liGAQKLEAGG 



41 

1 

CVRVDKDCAY 
MSPQGLRVRL 
SQLTSDYTIG 

QGERISGNLD 
MSRNDERCHL 
PVSSLGVLQE 
RRGEVGIYQV 
KEVRSARCSF 
GHCVCYGEGR 
TCIDSNGGIC 
GEKKGRTCEE 
HKKKDCPPGS 
HYMLRENLMA 
SliRItARLCTE 
HTIVDTVLMA 
LVDVRVPLFI 
VSRGDQVARI 
LliELQEVDSL 
PPPHGDLGAP 
VELTNLYPYC 
AEPAETNGEI 
RNGAGWGPER 
QRPSVSDDTE 
DYNSLTRSEH 
PRCERPLQGY 
EREGVITIES 
IVGYLVTCEM 
IITIESQDGG 
SI»TRHVTQEF 



CTDEMFRDRR 



FGKFVDKVSV 
APEGGFDAIL 
DTTGTYTQYR 
DSSNIVELLE 
QIiRALEHVDG 
NGDFVCGQCV 
YEGQFCEYDN 
NGRGHCECGR 
CNFKVKMVDE 
FWWIjIPIjLIilj 
SDHLDTPMLR 
NLiLKPDTREC 
PRSAKPALLK 
RPEDDDEKQL 
PVIRRVLDGG 
LRGRQVRRFH 
QNPNAKAAGS 
DYEMKVCAYG 
TAYEVCYGLV 
EAIINLATQP 
HLVNGRMDFA 
SHSTTLPRDY 
SVEYQLLNGG 
QVHPQSPLCP 
AQGGGPATAF 
PFPQLGSRAG 
VSRTLTTSGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Seq ID NO: 131 DNA sequence 
Nucleic Acid Accession #: BC004372 
Coding sequence: 132.. 2231 



11 



21 



31 



41 



51 



70 
75 
80 
85 



CCTCGTGCCG 
GCCGGCCCCT 
CTCCGGACAC 
TGAGCCTGGC 
AGAAAAATGG 
ATAGCACCTT 
GCAGGTATGG 
GTGCAGCAAA 
ATTGCTTCAA 
ATGCCTTTGA 
AGAAAGGAGA 
ACGTGAGCAG 
CCTTTTCTAC 
ACAGAATCCC 
AAGAAAATGA 
ATGAAGATTT 
AGAACCAGGA 
CAACCACAAG 
ACCCAGAAGC 
ATTCTACAAG 
AGGAACAGTG 



CGGACCCCA6 
GCCCCGCGCC 
CATGGACAAG 
GCAGATCGAT 
TCGCTACAGC 
GCCCACAATG 
GTTCATAGAA 
CAACACAGGG 
TGCTTCAGCT 
TGGACCAATT 
ATACAGAACG 
CGGCTCCTCC 
TGTACACCCC 
TGCTACCAGT 
AGATGAAAGA 
TATCTCCAGC 
CTGGACCCAG 
GATGACTGAT 
ACACCCTCCC 
CACAATCCAG 
GTTTGGCAAC 



CCTCTGCCAG 
CAGGGATCCT 
TTTTGGTGGC 
TTGAATATAA 
ATCTCTCGGA 
GCCCAGATGG 
GGGCATGTGG 
GTGTACATCC 
CCACCTGAAG 
ACCATAACTA 
AATCCTGAAG 
AGTGAJUVGGA 
ATCCCAGACG 
ACGTCTTCAA 
GACAGACACC 
ACCATTTCAA 
TGGAACCCAA 
GTA6ACAGAA 
CTCATTCACC 
GCAACTCCTA 
AGATGGCATG 



GTTCGGTCCG 
CCAGCTCCTT 
ACGCAGCCTG 
CCTGCCGCTT 
CGGAGGCCGC 
AGAAA6CTCT 
TGATTCCCCG 
TCACATCCAA 
AAGATTGTAC 
TTGTTAACCG 
ACATCTACCC 
GCAGCACTTC 
AAGACAGTCC 
ATACCATCTC 
TCAGTTTTTC 
CCACACCACG 
GCCATTCAAA 
ATGGCACCAC 
ATGAGCATCA 
GTAGTACAAC 
AGGGATATCG 



CCATCCTCGT 
TCGCCCGCGC 
GGGACTCTGC 
TGCAGGTGTA 
TGACCTCTGC 
GAGCATCGGA 
GATCCACCCC 
CACCTCCCAG 
ATCAGTCACA 
TGATGGCACC 
CAGCAACCCT 
AGGAGGTTAC 
CTGGATCACC 
AGCAGGCTGG 
TGGATCAGGC 
GGCTTTTGAC 
TCCGGAAGTG 
TGCTTATGAA 
T6AGGAAGAA 
6GAA6AAACA 
CCAAACACCC 



CCCGTCCTCC 
CCTCCGTTCG 
CTCGTGCCGC 
TTCCACGTGG 
AAGGCTTTCA 
TTTGAGACCT 
AACTCCATCT 
TATGACACAT 
GACCTGCCCA 
CGCTATGTCC 
ACTGATGATG 
ATCTTTTACA 
GACAGCACAG 
GAGCCAT^TG 
ATTGATGATG 
CACACAAAAC 
CTACTTCAGA 
GGAAACTGGA 
GAGACCCCAC 
GCTACCCAGA 
AGAGAAGACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



238 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CCCATTCGAC AACAGGGACA GCTGCAGCCT 
GGACAACACC AAGCCCAGAG GACAGTTCCT 
CCATGGGACG AGGTCATCAA GCAGGAAGAA 
CGCTTCAGCC TACTGCAAAT CCAAACACAG 
CTCTTTCAAT GACAACGCAG CAGAGTAATT 
TGGAAGAAGA TAAAGACCAT CCAACAACTT 
TCACAGGTGG AAQAAGAGAC CCAAATCATT 
ATACCTCTCA TTACCCACAC ACGAAGGAAA 
AGACTGGGTC CTTTGGAGTT ACTGCAGTTA 
GTTCCTTATC AGGAGACCAA GACACATTCC 
GATCTGAATC AGATGGACAC TCACATGGGA 
GTCCTATAAG GACACCCCAA ATTCCAGAAT 
TGGCTTTGAT TCTTGCAGTT TGCATTGCAG 
AAAAGCTAGT GATCAACAGT GGCAATGGAG 
ACGGAGAGGC CAGCAAGTCT CAGGAAATGG 
CTCCAGACCA GTTTATGACA GCTGATGAGA 
TTGGGGTGTA ACACCTACAC CATTATCTTG 
TACAGGGAGC TGGGACACTT AACAGATGCA 
TTTTTAGCAT A/^TTTTCT ACTCTTAAAA 



Seq ID NO: 132 Protein sequence; 
Protein Accession #: AAH04372 



CAGCTCATAC 
GGACTGATTT 
GGATGGATAT 
GTTTGGTGGA 
CTCAGAGCTT 
CTACTCTGAC 
CTGAAGGCTC 
GCAGGACCTT 
CTGTTGGAGA 
ACCCCAGTGG 
GTCAAGAAGG 
GGCTGATCAT 
TCAACAGTCG 
CTGTGGAGGA 
TGCATTTGGT 
CAAGGAACCT 
GAAAGAAACA 
ATGTGCTACT 
AAAAAAAAAA 



CAGCCATCCA 
CTTCAACCCA 
GGACTCCAGT 
AGATTTGGAC 
CTCTACATCA 
ATCAAGCAAT 
AACTACTTTA 
CATCCCAGTG 
TTCCAACTCT 
GGGGTCCCAT 
TGGAGCAAAC 
CTTGGCATCC 
AAGAAGGTGT 
CAGAAAGCCA 
GAACAAGGAG 
GCAGAATGTG 
ACCGTTGGAA 
GATTGTTTCA 
AAAAAAA 



ATGCAAGGAA 
ATCTCACACC 
CATAGTACAA 
AGGACAGGAC 
CATGAAGGCT 
AGGAATGATG 
CTGGAA6GTT 
ACCTCAGCTA 
AATGTCAATC 
ACCACTCATG 
ACAACCTCTG 
CrCTTQGCCT 
GGGCAGAA6A 
AGTGGACTCA 
TCGTCAGAAA 
GACATGAAGA 
ACATAACCAT 
TTGCGAATCT 



11 



21 



31 



41 



51 



MDKFWWHAAW 
PTMAQMEKAL 
ASAPPEEDCT 
GSSSERSSTS 
DERDRHLSFS 
MTDVDRNGTT 
FGNRWHEGYR 
GHQAGRRMDM 
KDHPTTSTLT 
FGVTAVTVOa 
TPQIPEWIiII 
SKSQEMVHLV 



GLCLVPLSLA 
SIGFETCRYG 
SVTDLPNAFD 
GGYIFYTFST 
GSGIDDDEDF 
AYEGNWNPEA 
QTPREDSHST 
DSSHSTTLQP 
SSNRNDVTGG 
SNSNVNRSLS 
XiASLLAIJMiI 
NKESSETPDQ 



QIDLNXTCRF 
FIEGHWIPR 
GPITITIVNR 
VHPIPDEDSP 
ISSTISTTPR 
HPPLIHHEHH 
TGTAAASAHT 
TANPNTGIiVB 
RRDPNHSEGS 
GDQDTFHPS6 
LAVCIAVNSR 
FMTADETRNIi 



AGVFHVEKNG 
IHPNSICAAN 
DGTRYVQKGE 
WITDSTDRIP 
AFDHTKQNQD 
EEEETPHSTS 
SHPMQGRTTP 
DLDRTGPIiSM 
TTLLEGYTSH 
GSHTTHGSES 
RRCGQKKKLV 
QNVDMKIGV 



RYSISRTEAA 
NTGVYILTSN 
YRTNPEDIYP 
ATSTSSNTIS 
WTQWNPSHSN 
TIQATPSSTT 
SPEDSSWTDF 
TTQQSNSQSF 
YPHTKESRTF 
DGHSHGSQEG 
XNSGNGAVED 



DLCKAFNSTIi 
TSQYDTYCFN 
SNPTDDDVSS 
AGWEPNEENE 
PEVLLQTTTR 
EETATQKEQW 
FNPISHPMGR 
STSHEGLEED 
IPVTSAKTGS 
GANTTSGPIR 
RKPSGUTGEA 



Seq ID NO: 133 DNA sequence 
Nucleic Acid Accession #: NM_002882 
Coding sequence: 150-755 



CGAGGTTCGG 
GCGGAGGGAA 
AGCCGAGCCG 
ATGATACTTC 
TTTCTCTTCC 
TGCGGGCAAA 
GCACTGGTGA 
GGAGGGACAA 
AGCCCAACGC 
AGTGCCCCAA 
TCAAAACAAA 
CAGGCAAAAA 
AGGAGACCAA 
TCTCTTTCCT 
ATTCTTTCAT 



11 

I 

GTCGTGGGGC 
GGAGCTACGA 
CCGCCGCCGC 
CACTGAGAAT 
TGAGCAAGAA 
ACTGTTCCGA 
CGTCAAGCTC 
GACCCTGAAG 
AGGTAGCGAC 
GCCAGAGCTG 
GTTTGAAGAA 
TGATCATGCC 
GGAGGATGCT 
TTCCTTTTTT 
TTTTACAAGG 



21 
I 

GGAGGGAAGA 
GTAGCCGCCG 

CGCGCCCCCA 
ACAGACGAGT 
ATT7U\AACAC 
TTTGCCTCTG 
CTGAAGCACA 
ATCTGTGCCA 
CGTGCCTGGG 
CTGGCCATCC 
TGCAGGAAAG 
GAAAAAGTGG 
GAGGAGAAGC 
TAAAAAATTT 
GACGTTATAT 



31 
I 

6CGGGCGGGC 
AGAGGCCGCG 
TGGCGGCCGC 
CCAACCATGA 
TGGAAGAAGA 
AGAACGATCT 
AGGAGAAAGG 
ACCACTACAT 
TCTGGAACAC 
GCTTCCTGAA 
AGATCGAAGA 
CGGAAAAGCT 
AATAAATOGT 
TACCCTGCCC 
AAAGAACTGA 



Seq ID NO: 134 Protein sequence: 
Protein Accession #: NP_002873 



41 
I 

GGGAGGCGCC 
GAGCCAGCGA 
CAAGGACACT 
CCCTCAGTTT 
TGAAGAGGAA 
CCCAGAATGG 
GGCCATCCGC 
CACGCCGATG 
CCACGCTGAC 
TGCTGAGAAT 
GAGAGAAAAG 
AGAAGCTCTC 
CTTATTTTAT 
CTCTTTTTCG 
ACTC 



51 
I 

GGCGCCAGAC 
CGACCGACCC 
CATGAGGACC 
GAGCCAATAG 
CTTTTTAAAA 
AAGGAGCGAG 
CTCCTCATGC 
ATGGAGCTGA 
TTCGCCGACG 
GCACAGAAAT 
AAAGCAGGAT 
TCGGTGAAGG 
TTTCTTTTCC 
GTTTGTTTTT 



1 11 21 31 41 51 

I I I I I I 

MAAAKDTHED HDTSTENTDE SNHDPQFEPI VSLPEQEIKT LEEDEEELFK MRAKLFRFAS 
ENDLPEWKER GTGDVKLLKH KEKGAIRLLM RRDKTLKICA NHYITPMMEL KPNAGSDRAW 
VWNTHADFAD ECPKPELLAI RFLNAENAQK FKTKFEECRK BIEERBKKAG SGKNDHAEKV 
AEKLEALSVK EETKEDAEEK Q 

Seq ID NO: 135 DNA sequence 

Nucleic Acid Accession #: NM_000077.2 

Coding sequence: 277-742 



CCCAACCTGG 
TCCTCCGAGC 
GGATTTGAGG 
GGGCTGGCTG 
GGAGAGCAGG 
GCCTTCGGCT 
GCTGCTGGAG 
GGTCATGATG 



11 
1 

GGCGACTTCA 
ACTCGCTCAC 
GACAGGGTCG 
GTCACCAGAG 
CAGCGGGCG6 
GACTGGCTGG 
GCGGGGGCGC 
ATGGGCAGCG 



21 

I 

GGTGTGCCAC 
GGCGTCCCCT 
GAGGGGGCTC 
GGTGGGGCGG 
CGGGGAGCAG 
CCACGGCCGC 
TGCCCAACGC 
CCC6AGTGGC 



31 

1 

ATTCGCTAAG 
TGCCTGGAAA 
TTCCGCCAGC 
ACCGCGTGCG 
CATG6AGCCG 
GGCCCGGGGT 
ACCGAATAGT 
GGAGCTGCTG 



41 
I 

TGCTCGGAGT 
GATACCGCGG 
ACCGGAGGAA 
CTCGGCGGCT 
GCGGCGGG6A 
CGGGTAGAGG 
TACGGTCGGA 
CTGCTCCACG 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
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51 
I 

TAATAGCACC 
TCCCTCCAGA 
GAAAGAGGAG 
GCGGAGAGGG 
GCAGCATGGA 
AGGT6CGGGC 
GGCCGATCCA 
GCGCGGAGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 



239 



10 



15 



WO 02/086443 

CAACTGCGCC GACCCCGCCA CTCTCACCCG ACCCGTGCAC GACGCTGCCC GGOAGGGCTT 
CCTGGACACG CTGGTGGTGC TGCACCGGGC CGGGGCGCGG CTGGACGTGC GCGATGCCTG 
GGGCCGTCTG CCCGTGGACC TGGCTGAGGA GCTGGGCCAT CGCGATGTCG CACGGTACCT 
GCGCGCGGCT GCGGGGGGCA CCAGAGGCAG TAACCATGCC CGCATAGATG CCGCGGAAGG 
TCCCTCAGAC ATCCCCGATT GAAAGAACCA GAGAGGCTCT GAGAAACCTC GGGAAACTTA 
GATCATCAGT CACCGAAGGT CCTACAGGGC CACAACTGCC CCCGCCACAA CCCACCCCGC 
TTTCGTAGTT TTCATTTAGA AAATAGAGCT TTTAAAAATG TCCTGCCTTT TAACGTAGAT 
ATATGCCTTC CCCCACTACC GTAAATGTCC ATTTATATCA TTTTTTATAT ATTCTTATAA 
AAATGTAAAA AAGAAAAACA CCGCTTCTGC CTTTTCACTG TGTTGGAGTT TTCTGGAGTG 
•AGCACTCACG CCCTAAGCGC ACATTCATGT GGGCATTTCT TGCGAGCCTC GCAGCCTCCG 
GAAGCTGTCG ACTTCATGAC AAGGATTTTG TGAACTAGGG AAGCTCAGGG GGGTTACTGG 
CTTCTCTTGA GTCACACTGC TAGCAAATGG CAGAACCAAA 6CTCAAATAA AAATAAAATA 
ATTTTCATTC ATTCACTC 

Seq ID NO; 136 Protein sequence: 
Protein Accession #: NP_000068.1 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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20 



31 



41 



51 



1 11 21 

1)111) 
MEPAAGSSME PSADWLATAA ARGRVEEVRA LLEAGALPNA PNSYGRRPIQ VMMMGSARVA 
ELLIiIiHGAEP NCADPATLTR PVHDAARBGP LDTLVVLHRA GTJiljDVRDAW GRIiPVDLAEE 
IiGHRDVARYL RAAAGGTRGS NHARIDAAEG PSDIPD 



60 
120 



25 Seq ID NO: 137 DNA sequence 

Nucleic Acid Accession #: NM_058196.1 
Coding sequence: 104-421 



30 



35 



40 



45 



TGTGTGGGGG 
GCCCCCACCC 
CCGAGTGGCG 
TCTCACCCGA 
GCACOGGGCC 
GGCTGAGGAG 
CAGAGGCAGT 
AAAGAACCAG 
CTACAGGGCC 
AATAGAGCTT 
TAAATGTCCA 
CGCTTCTGCC 
CATTCATGTG 
AGCATTTTGT 
AGCATiATGGC 



11 
I 

TCTGCTTGGC 
TGGCTCTGAC 
GAGCTGCTGC 
CCCGTGCACG 
GGGGCGCGGC 
CTGGGCCATC 
AACCATGCCC 
AGAGGCTCTG 
ACAACTGCCC 
TTAAAAATGT 
TTTATATCAT 
TTTTCACTGT 
GGCATTTCTT 
GAACTAGGGA 
AGAACCAAAG 



21 
I 

GGTGAGGGGG 
CATTCTGTTC 
TGCTCCACGG 
ACGCTGCCCG 
TGGACGTGCG 
GCGATGTCGC 
GCATAGATGC 
AGAAACCTCG 
CCGCCACAAC 
CCTGCCTTTT 
TTTTTATATA 
GTTG6AGTTT 
GCGAGCCTCG 
AGCTCAGGGG 
CTCAAATAAA 



31 
I 

CTCTACACAA 
TCTCTGGCAG 
CGCGGAGCCC 
GGAGGGCTTC 
CGATGCCTGG 
ACGGTACCTG 
CGCGGAAGGT 
GGAAACTTAG 
CCACCCCGCT 
AACGTAGATA 
TTCTTATAAA 
TCTGGAGTGA 
CAGCCrCCGG 
GGTTACTGGC 
AATAAAATAA 



41 
I 

GCTTCCTTTC 
GTCATGATGA 
AACTGCGCCG 
CTGGACACGC 
GGCCGTCTGC 
CX3CGCX3GCTG 
CCCTCAGACA 
ATCATCAGTC 
TTCGTAGTTT 
TAAGCCTTCC 
AATGTAAAAA 
GCACTCACGC 
AAGCTGTCGA 
TTCTCTTGAG 
TTTTCATTCA 



51 
I 

CGTCATGCCG 

TGGGCAGCGC 
ACCCCGCCAC 
TGGTGGTGCT 
CCGTGGACCT 
CGGGGGGCAC 
TCCCCGATTG 
ACCGAAGGTC 
TCATTTAGAA 
CCCACTACCG 
AGAAAAACAC 
CCTAAGCGCA 
CTTCATGACA 
TCACACTGCT 
TTCACTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



Seq ID NO: 138 Protein sequence: 
50 Protein Accession #: NP_478103.1 

1 11 21 31 



51 



55 



MMMGSARVAE IiliIiLHGAEPN CADPATLTRP VHDAAREGFL DTLWIjHRAG AMjDVRDAWG 
RIiPVDIiAEEL GHRDVARYIiR AAAG6TRQSN HARIDAAEGP SDIPD 



60 



60 
65 
70 
75 
80 
85 



Seq ID NO: 13 9 DNA sequence 

Nucleic Acid Accession #: NM_058197.1 

Coding sequence: 272-684 



CCCAACCTGG 
TCCTCCGAGC 
GGATTTGAGG 
GGGCTGGCTG 
GGAGAGCAGG 
GCCGGCGGCG 
GGGTCGGGTA 
TAGTTACGGT 
CGGGCGACTC 
CCGGAAAAAG 
TCCTGGCGAC 
ACAGATCTCT 
TCATGATGAT 
ACTGCGCCGA 
TGGACACGCT 
GCCGTCTGCC 
GCGCGGCTGC 
CCTCAGACAT 
CATCAGTCAC 
CGTAGTTTTC 
TGCCTTCCCC 
TGTAAAAAAG 
ACTCACGCCC 
GCTGTCGACT 
CTCTTGAGTC 



11 

I 

GGCGACTTCA 
ACTCGCTCAC 
GACAGGGTCG 
GTCACCAGAG 
CAGCGGGCGG 
GGGAGCAGCA 
GAGGAGGTGC 
CGGAGGCCGA 
TGGAGGACGA 
GGGAGGCTTC 
GCCCTGGGGG 
CGAATGCTGA 
GGGCAGCGCC 
CCCCGCCACT 
GGTGGTGCTG 
CGTGGACCTG 
GGGGGGCACC 
CCCCGATTGA 
CGAAGGTCCT 
ATTTAGAAAA 
CACTACCGTA 
AAAAACACCG 
TAAGCGCACA 
TCATGACAAG 
ACACTGCTA6 



21 

1 

GGTGTGCCAC 
GGCGTCCCCT 
GAGGGGGCTC 
GGTGGGGCGG 
CGGGGAGCAG 
TGGAGCCTTC 
GGGCGCTGCT 
TCCAGGTGGG 
AGTTTGCAGG 
CTGGGGAGTT 
CTTGGGAAAC 
GAAGATCTGA 
CGAGTGGCGG 
CTCACCCGAC 
CACCGGGCCG 
GCTGAGGAGC 
AGAOGCAGTA 
AAGAACCAGA 
ACAGGGCCAC 
TAGAGCTTTT 
AATGTCCATT 
CTTCTGCCTT 
TTCATGTGGG 
CATTTTGTGA 
CAAATGGCAG 



31 
1 

ATTCGCTAAG 
TGCCTGGAAA 
TTCCGCCAGC 
ACCGCGTGCG 
CATGGAGCCG 
GGCTGACTGG 
GGAGGCGGGG 
TAGAAGGTCT 
GGAATTGGAA 
TTCAGAAGGG 
CAAGGAAGAG 
AGGGGGGAAC 
AGCTGCTGCT 
CCGTGCACGA 
GGGCGCGGCT 
TGGGCCATCG 
ACCATGCCCG 
GAGGCTCTGA 
AACTGCCCCC 
AAAAATGTCC 
TATATCATTT 
TTCACTGTGT 
CATTTCTTGC 
ACTAGGGAAG 
AACCAAAGCT 



41 
I 

TGCTCGGAGT 
GATACCGCGG 
ACCGGAGGAA 
CTCGGCGGCT 
GCGGCGGGGA 
CTGGCCACGG 
GCGCTGCCCA 
GCAGCGGGAG 
TCAGGTAGCG 
GTTTGTAATC 
GAATGAGGAG 
ATATTTGTAT 
GCTCCACGGC 
CGCTGCCCGG 
GGAOGTGCGC 
CGATGTCGCA 
CATAGATGCC 
GAAACCTCGG 
GCCACAACCC 
TGCCTTTTAA 
TTTATATATT 
TGGAGTTTTC 
GAGCCTCGCA 
CTCAGGGGGG 
CAAATAAT^ 



51 
1 

TAATAGCACC 
TCCCTCCAGA 
6AAAGAGGAG 
GCGGAGAGGG 
GCAGCATGGA 
CCGCGGCCCG 
ACGCACCGAA 
CAGGGGATGG 
CTTCGATTCT 
ACAGACCTCC 
CCACGCGCGT 
TAGATGGAAG 
GCGGAGCCCA 
GAGGGCTTCC 
GATGCCTGGG 
CGGTACCTGC 
GCGGAAGGTC 
GAACTTAGAT 
ACCCCGCTTT 
CGTAGATATA 
CTTATAAAAA 
TGGAGTGAGC 
GCCTCCGGAA 
TTACTGGCTT 
TAAAATAATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



240 



10 
15 
20 
25 
30 
35 
40 



WO 02/086443 

TTCATTCATT CACTC 

Seg ID NO: 140 Protein sequence: 
Protein Accession #: NP_47ai04.l 

1 11 21 31 41 SI 

I I I I I I 

MEPAAGSSME PAAGSSMEPS ADWIATAAAR GRVEEVRALIi EAGTILPNAPN SYGRRPIQVG 
RRSAAGAGDG GRLWRTKFAG ELESGSASIL RKKGRIiPGEF SE6VCNHRPP PGDALGAWET 



PCT/US02/12476 



Seq ID NO: 141 DNA sequence 

Nucleic Acid Accession #: NM_058195.1 

Coding sequence: 163-684 



CCTCCCTACG 
GATCTTGGAG 
GGGCGCGCTC 
GTGGGTCCCA 
GGCGGCGCAG 
TTGGTGACCC 
CACATCCCGC 
GTGCTGATGC 
CATGATGATG 
CTGCGCCGAC 
GGACACGCTG 
CCGTCTGCCC 
CGCGGCTGCG 
CTCAGACATC 
CATCAGTCAC 
CGTAGTTTTC 
TGCCTTCCCC 
TGTAAAAAAG 
ACTCACGCCC 
GCTGTCGACT 
CTCTTGAGTC 
TTCATTCATT 



11 
1 

GGCGCCTCCG 
GTCCGGGTGG 
AGGGAAGGCG 
GTCTGCAGTT 
CGGCTGCCGA 
TCCGGATTCG 
G6CTCACGGG 
TACTGAGGAG 
GGCAGCGCCC 
CCCGCCACTC 
GTGGTGCTGC 
GTGGACCTGG 
GGGGGCACCA 
CCCGATTGTWV 
CGAAGGTCCT 
ATTTAGAAAA 
CACTACCGTA 
AAAAACACCG 
TAAGCGCACA 
TCATGACAAG 
ACACTGCTAG 
CACTC 



21 

1 

GCAGCCCTTC 
GAGTGGGGGT 
GGTGOGCGCC 
AAGGGGGCAG 
GCTCGGCCCT 
GCGCGCGTGC 
6GAGTGGGCA 
CCAGCGTCTA 
GAGTGGCGGA 
TCACCCGACC 
ACCGGGCCGG 
CTGAGGAGCT 
GAGGCAGTAA 
AGAACCAGAG 
ACAGGGCCAC 
TAGAGCTTTT 
AATGTCCATT 
CTTCTGCCTT 
TTCATGTGGG 
CATTTTGTGA 
a\AATGGCAG 



31 

1 

CCGCGTGCGC 
GGGGTGGGGG 
TGCGGGGCGG 
GAGTGGCGCT 
GGAGGCGGCG 
GGCCCGCCGC 
GCGCCAGGGG 
GGGCAGCAGC 
GCTGCTGCTG 
CGTGCACGAC 
GGCGCGGCTG 
GGGCCATCGC 
CCATGCCCGC 
AGGCTCTGAG 
AACTGCCCCC 
AAAAATGTCC 
TATATCATTT 
TTCACTGTGT 
CATTTCTTGC 
ACTAGGGAA6 
AACCAAAGCT 



41 

I 

AGGGCTCAGA 
TGGGGGTGAA 
AGATGGGCAG 
GCTCACCTCT 
AGAACATGGT 
GAGTGAGGGT 
GGCCCGCCGC 
CGCTTCCTAG 
CTCCACGGCG 
GCTGCCCGGG 
GACGTGCGCG 
GATGTCGCAC 
ATAGATGCCG 
AAACCTCGGG 
GCCACAACCC 
TGCCTTTTAA 
TTTATATATT 
TGGAGTTTTC 
GAGCCTCGCA 
CTCAGGGGGG 
CAAATAAAAA 



51 

I 

GCCGTTCCGA 
GGTGGGGGGC 
GGGGCGGTGC 
GGTGCCAAAG 
GCGCAGGTTC 
TTTCGTGGTT 
TGTGGCCCTC 
AAGACCAGGT 
CGGAGCCCAA 
AGGGCTTCCT 
ATGCCTGGGG 
GGTACCTGCG 
CGGAAGGTCC 
AAACTTAGAT 
ACCCCGCTTT 
CGTAGATATA 
CTTATAAAAA 
TGGAGTGAGC 
GCCTCCGGAA 
TTACTGGCTT 
TAAAATAATT 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



Seq ID NO: 142 Protein sequence: 
Protein Accession #: NP 478102.1 



45 
50 
55 
60 
65 
70 
75 
80 
85 



1 11 21 31 41 51 

I I I I I I 

MGRGRCVGPS LQLRGQEWRC SPLVPKGGAA AAELGPGGGE NMVRRFLVTL RIRRACGPPR 
VRVFWHIPR LTGEWAAPGA PAAVALVLML LRSQRLGQQP LPRRPGHDDG QRPSGGAAAA 
PRRGAQLRRP RHSHPTRARR CPGGLPGHAG GAAPGRGA7W5 RARCLGPSAR GPG 

Seq ID NO: 143 DNA sequence 
Nucleic Acid Accession #: NM_018131 
Coding sec[uence: 412.. 1107 



60 
120 



GAAATTGCAC 
GATAAAGAGA 
AATGCTTATC 
GCCAGATATA 
AGGAGGGAGC 
TCTGCTGCAA 
CAGACTGTGG 
CAGCTGAAAG 
GTCTATGTAA 
6CTCATTCAC 
AAGCAGAAAT 
CAAACCATAA 
CAAAAAGAAG 
CATCTGGAAG 
ATTGCTAGGG 
CAGTCTCTTT 
GAACAACAGA 
GTGCAGCATC 
GTTGGAATCC 
AGGAGAGACT 
CAATGGAAGC 
CGATCTGCTT 
TTAAAAGATT 
CATTTTGCAT 
CTTTTAGGCT 
GGCTGCAATG 
AAAATACTTG 
GCAAAATATT 
ACTGTGTTCG 
ATTGTTAA6A 



11 
I 

ACTTAAAGAC 
GACAGAGACT 
AACTCACAGA 
GTACTACCGC 
AGGTGTTGAA 
CCTCACGAAT 
CTCCAAACTG 
ATGCTCTGGA 
AAGGACTTTT 
TCCCACAGCA 
GTTACAACGA 
CTCAGCTGAG 
TTCACAATTT 
ATGATAGGCA 
GAAAACTTGA 
ACACATCTCT 
TGCAGGCATG 
AATTGCATGT 
TTGAAACAGC 
GAAAACAGAG 
CTGGTGGAAT 
GTCCATGTGG 
CAATACTGTA 
AAAACTGCCT 
GCTGTGCATT 
ACAGAATGTG 
GTGAGGAAAA 
TTATGTTTCG 
CATTTTTTAT 
GGTGGTGATA 



21 

I 

ATCAGTGGAT 
TTTGGAGAAA 
GAAGGACAAA 
ATTGCTTGAA 
AGCCTTATCT 
TGCTGAACTT 
CTTCAACTCA 
GAAAAATCAG 
AGCAAAGATC 
GACAAAAAAG 
TCTCTTGGCA 
TTTTGAACTG 
AAATCAGCTG 
TAAAACAGAG 
AGAAGAGAAG 
GCTAAAGCAG 
TACTTTAGAC 
AATTCTTAAG 
TTCATGAGTT 
AAAAAGTTGC 
GTCCCAAGTG 
AATACTGTTC 
TTTTCTGTTA 
ATCTACCTTT 
TCTCTTGGCA 
GTGAGCAGCG 
GATAGCTCAG 
GGGGTTTTGA 
CCAAGCACTT 
GATACTATTT 



31 
I 

GAAATCACAA 
ATTCGAGTCC 
GAAATACAGC 
CAGCTGGAAG 
GAAGAGAAAG 
GAAAGCAAAA 
TCAATAAATA 
CAGTGGCTCG 
TTTGAGTTGG 
CCTGAATCAG 
AGTGCAAAAA 
AGTGAATTTC 
TTGTATTCAC 
AAGATACAAA 
AAGAGATCCG 
CAAGAAGAAC 
TTTGAAAATG 
GAGCTCCGAA 
TGCCATCACA 
CGCCTCACCA 
CAATATACAG 
J\7ikAGTAGC7U\ 
GCTTGTGGGC 
GACACTCCAG 
GTGATACCTC 
TCTACTGAGA 
GTTATTGCTA 
AAAATCAAAG 
AGAAAACCTA 
TTTTTTCATA 



41 

I 

GTGGGAAAGG 
TTGAGGCTGA 
GACTGAGAGA 
AGACAACGAG 
ACGTATTGAA 
CCAATACACT 
ATATTCATGA 
TGTATGATCA 
AAJU^GAAAAC 
AAGGTTATCT 
AAGATCTTGA 
GAAGAAAATA 
AAAGAAGGGC 
AACTCAGGGA 
AAGAGCTCTT 
AAACAAGGGT 
AAAAACTCGA 
AAGCAAGAAA 
GAGCCATTAG 
AAAAGTCCCA 
TATCCAGCCA 
AATAAGTATT 
ATTTTGAATT 
CATGCTAGTG 
CCTGACATGG 
TACTAACATT 
ATGGGTTAAT 
ATAATTAACC 
CAATCCTAAT 
TTGTATAGCG 



51 

I 

AAAGCTGACT 
GAAGGAGAAG 
CCAACTGAAG 
AGAAGGAGAA 
ACAACAGTTG 
CCGTTTATCA 
AATGGAAATA 
GCAGCGGGAA 
GGAAACAGCT 
TCAAGAAGAG 
GGTTGAACGA 
TGAAGAAACC 
AGATGTGCAA 
AGAGAATGAT 
ATCTCAGGTC 
AGCTCTGTTG 
CCGTCAACAT 
AAATAACACA 
TCACTTTCCA 
CTGCTGCACT 
CTGAGCATCG 
TGTTTTGATA 
ATATATTTCA 
AATCATGTAT 
TTCATCATCA 
TTGCACTGTC 
GCACCAGCAA 
AAGGATCTTA 
TTTGATGTCC 
GTTATTAGAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



241 



10 



wo 02/086443 

AAGTTGGGGA TTTTCTTGAT CTTTATTGCT GCTTACCATT GAAACTTAAC CCAGCTGTGT 1860 

TCCCCAACTC TGTTCTGCGC ACGRARCAGT RTCTGTTTGA GGCATAATCT TAAGTGGCCA 1920 

CACACAATGT TTTCTCTTAT GTTATCTGGC AGTAACTGTA ACTTGAATTA CATTAGCACA 1980 

TTCTGCTTAG CTAAAATTGT TAAAATAAAC TTTAATAAAC CCATGTAGCC CTCTCATTTG 2040 

ATTGACAGTA TTTTAGTTAT TTTTGGCATT CTTAAAGCTG GGCAATGTAA TGATCAGATC 2100 

TTTGTTTGTC TGAACAGGTA TTTTTATACA TGCTTTTTGT AAACC3VAAAA CTTTTAAATT 2160 

TCTTCAGQTT TTCTAACATG CTTACC3VCTG GGCTACTGTA AA.TGAGAAAA GAA1AAAATT 2220 
ATTTAATGTT TT 



Seg ID NO; 144 Protein sequence: 
Protein Accession #: NP 060601 



PCT/US02/12476 



15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



1 11 21 31 41 51 

I I I I I I 

MEIQIiKDALE KNQQWLVYDQ QREVYVKGLL AKIFBLEKKT ETAAHSLPQQ TKKPESEGYb 
QEBKQKCYND LLASAKKDLB VERQTITQLS FELSEFRRKY EETQKEVHNL NQLLYSQRRA 
DVQHLEDDRH KTEKIQKLRE ENDIARGKLE BEKKRSEELIi SQVQSLYTSL LKQQEEQTRV 
ALLEQQMQAC TLDFENEKLD RQHVQHQLHV ILKEI»RKARK NNTVGIDETA S 

Seq ID NO: 145 DNA sequence 
Nucleic Acid Accession #: NM_001168 
Coding sequence: 50.. 478 



60 
120 
180 



CCGCCAGATT 
GACGTTGCCC 
CTGGCCCTTC 
CCACTGCCCC 
GGAAGGCTGG 
CGCTTTCCTT 
GGACAGAGAA 
TGAGGAAACT 
CCTCTGGCCG 
GTGCCACCAG 
CAAATTAGAT 
TGCCTGTGCA 
GGGGGCTCAT 
AAGGCAGTGT 
GTGAATGTGT 
GGTGCCTGTT 
ACAGTTTTTT 
GTGATGAGAG 
TATTTTGTTT 
AGCCATTCTA 
AGTGATAGGA 
AGTGAGCCGC 
CTTTTTAAAT 
TCTGTCAGCC 
CCAGGTCCCC 
GATGGATTTG 
GCT6GAAACC 



11 

I 

TGAATCGCGG 
CCTGCCTGGC 
TTGGAGGGCT 
ACTGAGAACG 
GAGCCAGATG 
TCTGTCAAGA 
AGAGCCAAGA 
GCGAAGAAAG 
GAGCTGCCTG 
CCTTCCTGTG 
GTTTCAACTG 
GCGGGTGCTG 
TTTTGCTGTT 
CCCTTTTGCT 
CfTGGACCTCA 
GAATCTGAGC- 
TGTTGTTGTG 
AATGGAGACA 
GAATTGTTAA 
AGTCATTGGG 
AGCGTCTGGC 
GGGGCACATG 
GACTTGGCTC 
CAACCTTCAC 
GCTTTCTTTG 
ATTCGCCCTC 
TCTGGAGGTC 



21 
I 

GACCCGTTGG 
AGCCCTTTCT 
GCGCCTGCAC 
AGCCAGACTT 
ACGACCCCAT 
AGCAGTTTGA 
ACAAAATTGC 
TGCGCCGTGC 
GTCCCAGAGT 
GGCCCCTTAG 
TGCTCCTGTT 
CTGGTAACAG 
TTGATTCCCG 
AGAGCTGACA 
TGTTGTTGAG 
TGCAQGTTCC 
TTTTTTTGTT 
GAGTCCCTGG 
TTCACAGAAT 
GAAACGGGGT 
AGATACTCCT 
CTGGCCGCTC 
GATGCTGTGG 
ATCTGTCACG 
GAGGCAGCAG 
CTCCCTGTCA 
ATCTCGGCTG 



31 

1 

CAGAGGTGGC 
CAAGGACCAC 
CCCGGAGCGG 
GGCCCAGTGT 
AGAGGAACAT 
AGAATTAACC 
AAAGGAAACC 
CATCGAGCAG 
GGCTGCACCA 
CAATGTCTTA 
TTGTCTTGAA 
TGGCTGCTTC 
GGCTTACCAG 
GCTTTGTTCG 
GCTGTCACAG 
TTATCTGTCA 

CTCCTCTACT 
AGCACAAACT 
GAACTTCAGG 
TTTGCCACTG 
CTCCCTCAGA 
GGGACTGGCT 
TTCTCCACAC 
CTCCCGCAGG 
TAGAGCTGCA 
TTCCTGAGAA 



41 

I 

GGCGGCGGCA 
CGCATCTCTA 
ATGGCCGAGG 
TTCTTCTGCT 
AAAAAGCATT 
CTTGGTGAAT 
AACAATAAGA 
CTGGCTGCCA 
CTTCCAGG6T 
GGAAAGGAGA 
AGTGGCACCA 
TCTCTCTCTC 
GTGAGAAGTG 
CGTGGGCAGA 
TCCTGAGTGT 
CACCTGTGCC 
GGTAGATGCA 
GTTTAACAAC 
ACAATTAAAA 
TGGATGAGGA 
CTGTGTGATT 
AAAAGGCAGT 
GGGCTGCTGC 
GGGGGAGAGA 
GCTGAAGTCT 
GGGTGGATTG 
ATAAAAAGCC 



51 

I 

TGGGTGCCCC 
CATTCAAGAA 
CTGGCTTCAT 
TCAAGGAGCT 
CGTCCGGTTG 
TTTTGAAACT 
AGAAAGAATT 
TGGATTGAGG 
TTATTCCCTG 
TCAACATTTT 
GAGGTGCTTC 
TCTCTTTTTT 
AGGGAGGAAG 
GCCTTCCACA 
GGACTTGGCA 
TCCTCAGAGG 
TGACTTGTGT 
ATGGCTTTCT 
CTAAGCACAA 
GACAGAATAG 
AGACAGGCCC 
GGCCTAAATC 
AGGCCGTGTG 
CGCAGTCCGC 
GGCGTAAGAT 
TTACAGCTTC 
TGTCATTTC 



Seq ID NO: 146 Protein secpiences 
Protein Accession #: NP 001159 



1 11 21 31 41 51 

] I I 1 I I 

MGAPTLPPAW QPFLKDHRIS TFKNWPFLEG CACTPERMAE AGFIHCPTEN EPDLAQCFFC 
FKELEGWEPD DDPIEEHKKH SSGCAFLSVK KQFEELTLGE FLKLDRERAK NKIAKETNNK 
KKEFEETAKK VRRAIEQIiAA MD 

Seq ID NO: 147 DNA sequence 

Nucleic Acid Accession #; NM_014176.1 

Coding sequence: 127-720 



GCGCGCAGCG 
AGTGCATCCC 
GGGATCATGC 
CCCCCAGGCA 
TTAGGTGGAG 
GAGAGGTACC 
ATT6ATTCTG 
AGACCATCCC 
AACCCTGATG 
TTCCTCAAGA 
GAGGAAGAGA 
CAGAAAAGGA 
GGGACTTGTC 
ACCTTGAATX 
GTACATATGT 



11 
I 

CTGGTACCCC 
AGGCAGCTCT 
AGAGAGCTTC 
TCACATGTTG 
CCAACACACC 
CATTTGAACC 
CTGGAAGGAT 
TCAACATOGC 
ACCCGCTCAT 
ATGCCAGACA 
TGCTTGATAA 
AGGCCAGTCA 
CTG6TTCATC 
TTTTTTTAAA 
ATTTTGAAAT 



21 
I 

GTTGGTCCGC 
TAGTGTGGAG 
ACGTCTGAAG 
GCAAGATAAA 
TTATGAGAAA 
TCCTCAGATC 
TTGTCTGGAT 
AACT6TGTTG 
GGCTGACATA 
GTGGACAGAG 
TCTACCAGAG 
GCTAGTAGGC 
TTAGTTAATG 
TATATTTGAT 
CTTTTAAACC 



31 
I 

GCGTTGCTGC 
CAGTGAACTG 
AGAGAGCTGC 
GACCAAATGG 
GGTGTTTTTA 
CGATTTCTCA 
GTTCTCAAAT 
ACCTCTATTC 
TCCTCAGAAT 
AAGCATGCAA 
GCTGGTGACT 
ATAGAAAAGA 
TGTTCrrTGC 
GACATAATTT 
TGAAAAATAA 



41 
I 

GTTGTGAGGG 
TGTGTGGTTC 
ACATGTTAGC 
ATGACCTGCG 
AGCTAGAAGT 
CTCCAATTTA 
TGCCACCAAA 
AGCTGCTCAT 
TTAAATATAA 
GACAGAAACA 
CCAGAGTACA 
AATTTCATCC 
CAAGGTGATC 
TTGTGTAGTT 
ATAGTCATTT 



51 • 
I 

GTGTCAGCTC 
CTTCTACTTG 
CACAGAGCCA 
AGCTCAAATA 
TATCATTCCT 
TCATCCAAAC 
AGGTGCTTGG 
GTCAGAACCC 
TAAGCCAGCC 
AAAGGCTGAT 
CAACTCAACA 
TGATGTTTAG 
TAAGTTGCCT 
TATTTATCTT 
AATGTTGAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



242 



wo 02/086443 

AA7\AAAAAAA AAAAAAAAAA AAAAAAAA 



PCT/US02/12476 



Seq ID NO: 148 Protein sequence: 
^ Protein Accession #: NP_054895.1 

1 11 21 31 41 51 

1 I I t I 1 

MQRASRLKRE LHMLATEPPP GITCWQDKDQ MDDLRAQILG GANTPYBKGV FKLEVIIPER 60 
YPFEPPQIRF LTPIYHPNID SAQRICLDVL KLPPKGAWRP SIjNIATVLTS IQIiliMSEPNP 120 
10 DDPLMADISS EFKYNKPAFb KNARQWTEKH ARQKQKADEE EMLDNLPEAG DSRVHNSTQK 180 

RKASQLVGIE KKFHPDV 

Seg ID NO: 149 DNA sequence 
Nucleic Acid Accession #: NM_003812 
15 Coding sequence: 224-2722 

1 11 21 31 41 51 

I I I I I I 

TCCTCTGCGT CCCGCCCCX3G GAGTGGCTGC GAGGCTAGGC GAGCCGGGAA AGGGGGCGCC 60 
20 GCCCAGCCCC GAGCCCCX3CG CCCCGTGCCC CGAGCCCGGA GCCCCCTGCC CGCGGCGGCA 120 
CCATGCGCGC CGAGCCGGCG TGACCGGCTC CGCCCGCGGC CGCCCCGCAG CTAGCCCGGC 180 
GCTCTCGCCG GCCACACGGA GCGGCGCCCG GGAGCTATGA GCCATGAAGC CGCCCGGCAG 240 
CAGCTCGCGG CAGCCGCCCC TGGCGGGCTG CAGCCTTGCC GGCGCTTCCT GCGGCCCCCA 3 00 
ACGCGGCCCC GCCGGCTCGG TGCCTGCCAG CGCCCCGGCC CGCACGCCGC CCTGCCGCCT 360 
25 GCTTCTCGTC CTTCTCCTGC TGCCTCCGCT CGCCGCCTCG TCCCGGCCCC GCGCCTGGGG 420 

GGCTGCTGCG CCCAGCGCTC CGCATTGGAA TGAAACTGCA GAAAAAAATT TGGGAGTCCT 480 
GGCAGATGAA GACAATACAT TGCAACAGAA TAGCAGCAGT AATATCAGTT ACAGCAATGC 540 
AATGCAGAAA GAAATCACAC TGCCTTCAAG ACTCATATAT TACATCAACC AAGACTCGGA 600 
AAGCCCTTAT CACGTTCTTG ACACAAAGGC AAGACACGAG CAAAAACATA ATAAGGCTGT 660 
30 CCATCTGGCC CAGGCAAGCT TCCAGATTGA AGCCTTCGGC TCCAAATTCA TTCTTGACCT 720 

CATACTGAAC AATGGTTTGT TGTCTTCT6A TTATGTGGAG ATTCACTACG AATiATGGGAA 780 
ACCACAGTAC TCTAAGGGTG GAGAGCACTG TTACTACCAT GGTiAGCATCA GAGGCGTCAA 840 
AGACTCCAAG GTGGCTCTGT CAACCTGCAA TGGACTTCAT GGCATGTTTG AAGATGATAC 900 
CTTCGTGTAT ATGATAGAGC CACTAGAGCT GGTTCATGAT GAGAAAAGCA CAGGTCGACC 960 

35 ACATATAATC CAGAAAACCT TGGCAGGACA GTATTCTAAG CAAATGAAGA ATCTCACTAT 1020 

GGAAAGAGGT GACCAGTGGC CCTTTCTCTC TGAATTACAG TGGTTGAAAA GAAGGAAGAG 1080 

AGCAGTGAAT CCATCACGTG GTATATTTGA AGAAATGAAA TATTTGGAAC TTATGATTGT 1140 

TAATGATCAC AAAACGTATA AGAAGCATCG CTCTTCTCAT GCACATACCA ACAACTTTGC 1200 

AAAGTCCGTG GTCAA.CCTTG TGGATTCTAT TTACAAGGAG CAGCTCAACA CCAGGGTTGT 12 60 

40 CCTGGTGGCT GTAGAGACCT GGACTGAGAA GGATCAGATT GACATCACCA CCAACCCTGT 1320 

GCAGATGCTC CATGAGTTCT CAAAATACCG GCAGCGCATT AAGCAGCATG CTGATGCTGT 1380 

GCACCTCATC TCGCGGGTGA CATTTCACTA TAAGAGAAGC AGTCTGAGTT ACTTTGGAGG 1440 

TGTCTGTTCT CGCACAAGAG GAGTTGGTGT GAATGAGTAT GGTCTTCCAA TGGCAGTGGC 1500 

ACAAGTATTA TCGCAGAGCC TGGCTCAAAA CCTTGGAATC CAATGGGAAC CTTCTAGCAG 1560 

45 AAAGCCAAAA TGTGACTGCA CAGAATCCTG GGGTGGCTGC ATCATGGAGG AAACAGGGGT 1620 

GTCCCATTCT CGAAAATTTT CAAAGTGCAG CATTTTGGAG TATAGAGACT TTTTACAGAG 1680 

AGGAGGTGGA GCCTGCCTTT TCAACAGGCC AACAAAGCTA TTTGAGCCCA CGGAATGTGG 1740 

AAATGGATAC GTGGAAGCTG GGGAGGAGTG TGATTGTGGT TTTCATGTGG AATGCTATGG 1800 

ATTATGCTGT AAGAAATGTT CCCTCTCCAA CGGGGCTCAC TGCAGCGACG GGCCCTGCTG 1860 

50 TAACAATACC TCATGTCTTT TTCAGCCACG AGGGTATGAA TGCCGGGATG CTGTGAACGA 1920 

GTGTGATATT ACTGAATATT GTACTGGAGA CTCTGGTCAG TGCCCACCAA ATCTTCATAA 1980 

GCAAGACGGA TATGCATGCA ATCAAAATCA GGGCCGCTGC TACAATGGCG AGTGCAAGAC 2040 

CAGAGACAAC CAGTGTCAGT ACATCTGGGG AACAAAGGCT GCAGGGTCTG ACAAGTTCTG 2100 

CTATGAAAAG CTGAATACAG AAGGCACTGA GAAGGGAAAC TGCGGGAAGG ATGGAGACCG 2160 

55 GTGGATTCAG TGCAGCAAAC ATGATGTGTT CTGTGGATTC TTACTCTGTA CCAATCTTAC 2220 

TCGAGCTCCA CGTATTGGTC AACTTCAQGG TGAGATCATT CCAACTTCCT TCTACCATCA 2280 

AGGCCGGGTG ATTGACTGCA GTGGTGCCCA TGTAGTTTTA GATGATGATA CGGATGTGGG 2340 

CTATGTAGAA GATGGAACGC CATGTGGCCC GTCTATGATG TGTTTAGATC GGAAGTGCCT 240 0 

ACAAATTCAA GCCCTAAATA TGAGCAGCTG TCCACTCGAT TCCAAGGGTA AAGTCTGTTC 2460 

60 GGGCCATGGG GTGTGTAGTA ATGAAGCCAC CTGCATTTGT GATTTCACCT GGGCAGGGAC 2520 

AGATTGCAGT ATCCGGGATC CAGTTAGGAA CCTTCACCCC CCCAAGGATG AAGGACCCAA 2580 

GGGTCCTAGT GCCACCAATC TCATAATAGG CTCCATCGCT GGTGCCATCC TGGTAGCAGC 2640 

TATTGTCCTT GGGGGCACAG GCTGGGGATT TAAAT^TGTC AAGAAGAGAA GGTTCGATCC 2700 

TACTCAGCAA GGCCCCATCT GAATCAGCTG CGCTGGATGG ACACCGCCTT GCACTGTTGG 2760 

65 ATTCTGGGTA TGACATACTC GCAGCAGTGT TACTGGAACT ATTAAGTTTG TAAACATiAAC 2820 

CTTTGGGTGG TAATGACTAC GGAGCTAAAG TTGGGGTGAC AAGGATGGGG TAAAAGAAAA 2880 

CTGTCTCTTT TGGAAATAAT 6TCAAAGAAC ACCTTTCACC ACCTGTCAGT AAACGGGGGA 2940 

GGGGGCAAAA 6ACCATGCTA TAAAAAGAAC TGTTCCAGAA TCTTTTTTTT TCCCTAATGG 3000 
ACGiU^GGAAC AACACACACA CAAAAATTAA ATGCAATAAA GGAATCATTA AAAA 



70 



Seq ID NO: 150 Protein sequence: 
Protein Accession #: NP 003803 



75 1 11 21 31 41 51 

I I 1 I I I 

MKPPGSSSRQ PPLAGCSLAG ASCGPQRGPA GSVPASAP/Ul TPPCRLLLVL LLLPPIiAASS 60 

RPRAWGAAAP SAPHWNETAE KNLGVLADED NTLQQNSSSN ISYSNAMQKE ITLPSRLIYY 120 

INQDSESPYH VLDTKARHQQ KHNKAVHLAQ ASFQIEAFGS KFILDLILNN GLLSSDYVEI 180 

80 HYENGKPQYS KGGEHCYYHG SIRGVKDSKV ALSTCNGLHG MFEDDTFVYM lEPLELVHDE 240 

KSTGRPHIIQ KTLAGQYSKQ MKNLTMERGD QWPFLSELQW LKRRKRAVNP SRGIFEEMKY 3 00 

LELMIVNDHK TYKKHRSSHA HTNNFAKSW NLVDSIYKEQ LNTRWLVAV ETWTEKDQID 3 60 

ITTNPVQMLH EPSKYRQRIK QHADAVHLIS RVTFHYKRSS LSYFGGVCSR TRGVGVNEYG 420 

LPMAVAQVIiS QSLAQNLGIQ WEPSSRKPKC DCTESWGGCI MEETGVSHSR KFSKCSILEY 480 

85 RDPLQRGGGA CLFNRPTKLF BPTECGNGYV EAGEECDCGF HVECYGLCCK KCSLSNGAHC 540 

SDGPCCNNTS CLFQPRGYEC RDAVNECDIT EYCTGDSGQC PPNLHKQD6Y ACNQNQGRCY 600 

NGECKTRDNQ CQYIWGTKAA GSDKFCYEKL NTEGTEKGNC GKDODRWIQC SKHDVPCGFL 660 
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LCTNLTRAPR IGQLQGEIIP TSFYHQGRVI DCSGAHWIiD DDTDVGYVED GTPCXSPSMMC 
LDRKCLQIQA LNMSSCPLDS KGKVCSGHGV CSNEATCICD FTWAGTDCSI RDPVRNLHPP 
KDEGPKGPSA TNLIIGSIAG AILVAAIVLG GTGWGFKNVK KRRFDPTQQG PI 



720 
780 



PCT/US02/12476 



Seq ID NO: 151 DNA sequence 
Nucleic Acid Accession ft: NM_023915 
Coding sequence: 250-1326 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



GGCACGAGGG 
TCAAAGCTTA 
GTGAATGGAC 
CCCACGCCTC 
AACTGAAGAA 
CAAGAGAGTC 
AATGAATTTG 
TTGCTGAATG 
TTCTATCTCA 
ATAGTCCATG 
TCAGTTTTGT 
GATCGCTATC 
ACGAAGGTTT 
ATCCTGACAA 
CCTTTGGGGG 
GTGCTGGTGA 
AGGCAATTCA 
GTGGCTGTGT 
AGTCACTTAG 
ATTACACTTT 
TGTAGGTCAT 
ATCAGATCAC 
GTGTAGGCCT 
TTCATTATCC 



11 
I 

TTTCGTTTTC 
TTCTTAATTA 
AGCCAGCCAC 
AATCGTCCCC 
TGGGGTTCAA 
ACAATTCAGG 
ACACAATTGT 
GTTTAGCAGT 
AAAACATAGT 
ATGCAGGATT 
TTTATGCAAA 
TGAAGGTGGT 
TATCTGTTTG 
ATGGTCAGCC 
TCAAATGGCA 
TTCTGATCGG 
TAAGTCAGTC 
TTTTTACCTG 
ACAGGCTTTT 
TCTTGTCTGC 
TTTCAAGAAG 
TGCAAAGTGT 
TTTATTGTTT 
TTAAAAAAAA 



21 

1 

ATGCTTTACC 
GA6ACAAGAA 
CACAATGA7A 
AAGTGTTTCC 
CTTGACGCTT 
CAACAGGAGC 
CTTGCCGGTG 
GTGGATCTTC 
GGTTGCAGAC 
TGGACCTTGG 
CATGTATACT 
CAAGCCATTT 
TGTTTGGGTG 
AACAGAGGAC 
TACGGCAGTC 
ATGTTACATA 
AAGCCGAAAG 
CTTTCTACCA 
AGATGAATCT 
GTGTAATGTT 
GCTGTTCAAA 
GAG2\AGATCG 
GTTGGAATCG 
AA 



31 
I 

AGAAAATCCA 
ACCTGTTTCA 
GAAATCAAAC 
TGACACGCAT 
GCAAAATTAC 
GACGGGCCAG 
CTTTATCTCA 
TTCCACATTA 
CTCATAATGA 
TACTTCAAGT 
TCCATCGTGT 
GGGGACTCTC 
ATCATGGCTG 
AATATCCATG 
ACCTATGTGA 
GCCATATCCA 
CGAAAACATA 
TATCACTTGT 
GCACAAAAAA 
TGCCTGGATC 
AAATCAAATA 
GAAGTTCGCA 
ATATGTACAA 



41 

I 

CTTCCCTGCC 
ACTTGAAGAC 
CAGGAATAAC 
CTTTGCTTAC 
CAAATAACGA 
GAAAGAACAC 
TTATATTTGT 
GGAATAAAAC 
CGCTGACATT 
TTATTCTCTG 
TCCTTGGGCT 
GGATGTACAG 
TTTTGT CTTT 
ACTGCTCAAA 
ACAGCTGCTT 
GGTACATCCA 
ACCAGAGCAT 
GCAGAATTCC 
TCCTATATTA 
CAATAATTTA 
TCAGAACCAG 
TATATTATGA 
AGTGTAAATA 



51 

I 

GACCTTAGTT 
ACCGTATGAG 
CTATGCTGAA 
AGTGCATCAC 
GCTGCACGGC 
CACCCTTCAC 
GGCAAGCATC 
CAGCTTCATA 
TCCATTTCGA 
CAGATACACT 
GATAAGCATT 
CATAACCTTC 
GCCAAACATC 
ACTTAAAAGT 
GTTTGTGGCC 
CAAATCCAGC 
CAGGGTTGTT 
TTTTACTTTT 
CTGCAAAGAA 
CTTTTTCATG 
GAGTGAAAGC 
TTACACT6AT 
AATGTTTCTT 



Seq ID NO: 152 Protein sequences 
Protein Accession #; NP_076404 



MGFNLTLAKL 
6LAVWIFFHI 
FYANMYTSIV 
NGQPTEDNIH 
ISQSSRKRKH 
FLSACNVCLD 



11 
I 

PNNELHGQES 
RNKTSFIFYL 
PLGLISIDRY 
DCSKLKSPLG 
NQSIRVWAV 
PIIYFFMCRS 



21 



31 



41 



51 



HNSGNRSDGP 
KNIWADLIM 
liKWKPFGDS 
VKWHTAVTYV 
FFTCFLPYHL 
FSRRIiFKKSN 



GKNTTLHNEF 
TLTFPFRIVH 
RMYSITFTKV 
NSCLFVAVLV 
CRIPFTFSHL 
IRTRSESIRS 



DTIVLPVLYL 
DAGFGPWYFK 
LSVCVWVIMA 
IIiIGCYIAlS 
DRLLDESAQK 
LQSVRRSEVR 



IIFVASILLN 
FILCRYTSVL 
VliSLPNIILT 
RYIHKSSRQF 
ILrYYCKEITI. 
lYYDYTDV 



Seq ID NO; 153 DNA sequence 
Nucleic Acid Accession #i D80008.1 
Coding sequence: 149-739 



1 
1 

GTTCGGCGCC 
CGAAAGGAGT 
AAGGCCGCGG 
CGAGCTGCAT 
AGTTCTGGAG 
GTCAGGTGGA 
AAATCGACGC 
ATGGGAATAT 
GGAGTGGTTT 
TGAAGGTTTG 
GTGTCTAAAA 
AAATAGCCAG 
GGAGCACATC 
CTCCTCTGTA 
TAGACATTGT 
AGGACTTTCT 
GTTTTGTAGA 
AGTCCTCCCA 
CCCCTACTCC 
GTGTGTTTTT 
TTGGCTGGAC 
CAAGCTAGAG 
TGGTCTGTAG 
TATTTGGGAA 
CTTGTGGCTA 
CTAGAGAAGG 
AGAGTTGATT 
TCCAGTTTAT 
TCCCAAGATC 
TACTTTGGTC 
GTTTTGAGAT 
CACTGCAATC 
GGGATTACAG 



11 

1 

AAAGCGCGGA 
GAGGCGCCGA 
GAGTGGGAAG 
CGCGCGCCCG 
GAGATGAAAG 
C6AAGT6ATT 
TGCACTGTAG 
GGTAGCGTCT 
AATAATTATA 
GACATTACAC 
GACTATGGAG 
CACTTTTTAC 
CTGTCATGAC 
CTCACTCTCT 
TTAAGATAAC 
TTTTTTAATG 
GACTGTCTCA 
CCTTAGCTTC 
TTTTTCTAAT 
TAAATGAAAG 
AGGAAGAAGG 
AGCTGAATTT 
AAATTTTCAG 
GGAAGGACAC 
TGGGGTGATC 
AACTTTGTAC 
GTCTTTTAAT 
TCGTTTGTTC 
ACAATTTTTT 
TATGACCCGT 
GGA6TCTTGT 
TCTATCCCCT 
GCACAGGCCG 



21 
1 

GCGGAGGCCG 
GAGCCCAGAT 
CGTCCGCCAT 
AAGGGCAACT 
CTTTGTATGA 
TGATACCAAC 
CATACCTGTA 
TGCCAAATGC 
AAAGATCTCT 
AGGATATGAA 
AATTTGAAGT 
CTCGATGGAA 
CATGCGCCGA 
CCACCACTCC 
TAAGAATACT 
TTGTACACTA 
CTATGTTGCC 
TCAAAGTGTT 
AAGCTGTATC 
TAAACATGGT 
TAGATCCTGT 
CTGAGATACA 
TATATATAAT 
ACATGGATTT 
ACCAGTATCA 
AGTTTTCCCT 
GGTATGTTTT 
TTTTATGCTT 
TTCCTTTTTA 
TTTTTTTTTT 
TCTGTCACCC 
GGGTTCAAGT 
CCACGCCTGG 



31 

1 

AGGCGAGAGC 
ACCATTTTGG 
GTTCTGCGAA 
GCCTGCCTTC 
ACAAAACCAG 
TATCAAATTT 
TGACCGCTTG 
ATTACGATTT 
TGCTACTTAT 
ACCACCAAAA 
TGATGATGGC 
ATGTGAGCAG 
GGCACTTCCA 
CTTCACCTCC 
TGGCTAAGAA 
TTCTTCCTAC 
CAAGCTGGTC 
GAGATCACAG 
TGTAATCACA 
TACATTTGAA 
GTGTCTTGTT 
CATTTTCAAA 
GTTTAATGAC 
TGCACATTTC 
CCACTTTGGA 
GAGATTCAGA 
AAACAGCTGA 
TGGGTGTTGC 
CTTCTAGAAG 
GTTTTGTTTT 
AGGCTGGGGT 
GATTCTCTT6 
CTAATTTTTG 



41 
1 

CTGGCGCTGT 
CGTGAGAGCT 
AAAGCCATGG 
AACGAGGATG 
TCTGATGTGA 
CGACACTGTT 
CTTCGGATCA 
CACATGGCTG 
ATGAGGTCAC 
AGCCTATATA 
ACTTCAGTCC 
CTGATCAGAC 
GGCTTCACTC 
CTCTTTGATT 
GTATAATTTG 
TCTTTTTTGG 
TCAAACTCCT 
GCGTGAGCCA 
GCATTCCTAC 
TCTCTTAAAT 
TTCTGGTCAT 
TCACATGCAA 
ATACTAATTT 
CACCATGGTG 
AGGGGACAGT 
TTGACTGAAA 
CATTTTAAAT 
ATCCGAGAAA 
TGTTATAATT 
GTTTTTTCGT 
GCAGTG6C6T 
TCTCAGCCTC 
TATTTTTAGT 



51 

1 

AGGACTAGAA 
GGTGGTTGGC 
AACTGATCCG 
GACTCAGACA 
ATGAAGCAAA 
CTCTGTTAAG 
GAGCACTCAG 
CTGAAGAAAT 
TGGGAGGAGA 
TTGAAGTCCG 
TATTAAAAAA 
AAGGAGTCCT 
AACTCATGGA 
TTAGAAGCTA 
CTAACTATTA 
TTTTGGTTTT 
GGCCTCAAGC 
CTGCACCCGG 
AGTTGTTACA 
AAGCAGTCAC 
GTGTATTGTA 
GTGAAGATGA 
ATCATCTGGC 
GCTGGTGTGG 
GAAATTGGGG 
AGTCACATGA 
TTTGATGAAA 
TCTTTTCCCA 
TTAAGCTTTA 
TTGTTTCTTT 
GATCTTGGCT 
CCAAGTAGCT 
AGAGACA6AG 



60 
120 
ISO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13 80 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
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TTTTACCATG TTGGCCAGGC TGGTTTCAAA CTCCTGACCT CAAGTGACCC ACCTTGGCCT 2040 

CCCAAAGTTT TGGGATTACA AGTGTGGGCC ACCGCGGCCA GCCTATGATC CATTTTGAAT 2100 

GAATTTTTTA TATGGTGCAA GGTGTCAATC CACCTTCACT TTTTCTTGGG AATATAGATA 2160 

TCCAGCTGTT TCACTACCAT TTTTTGAAAG GACTGCCCTT TGCTCTATCA CCTTTGCATT 2220 

TTTGTTAAAA AGTAGTTGTC AATGTATATG TGGGTTTATT TCAGGACTCT GTTTTGTTCC 2280 

ATTGACCTGT TTTTCTCTCC TGAATGCCAA TACCATATTT GTATGTAGTG TATGTAATTT 2340 

TCTAATAATT CTTGAAACAG ATA6TATTAA TGT6TCATAT TTTTGCTGTT GTTTGTATTT 2400 

TTTGTAGAGA TGGGGTTTCA CCGTGTTGGC CAGGCTGTGT TGAACTCCTG AGCTAAAGCA 2460 

ATACACTTGC CTCGTCCTCC CCATGTGCTG GGATTACAGG CGTGAGCCTT GGTGCTGGCC 2520 

CAGTGTACCA CATTTCTTTT TGAGATTTGT TTTGGCTATG TTAAGTCCTT TGCTTTTGAT 2580 

GTGAAATTTG GGAACAGGCA GGGTGTGGTG GCTTATGCCT GTAATCCTAG AACTTTGGGA 2640 

GGCCTAGATG GGTGGATCAC TTGAGCTCAG GAGTTCCAGA CCAGCCCGGG CCTATGGCAA 2700 

AACTCCGTCT CTACAAAAAA TAGAAAAAAT TAGCCAGGTG TGGTGGTGCA TGCCTGTAGT 2760 

CACAGTTACA CGGCAGGCTG AGGTGGGAGG ATCACTTGAA CCCCAGAGGT CAAGACTGCA 2820 

GTGAGCTGAG ATCACACCAC TGTACTCCAG CCTGGGTGAC AAAGTGAGAC TCTATCTCAA 28 80 

AAAGAAATTA GGATCAATTT GTCAATTTCT ACAACAACAA CAACAAAAAC CCCTGTTGGG 2940 

CACCTTGATT GAGATTGCAT TGAATTTATA TAAAACTGTT GGGAGAATTG ACATCTTAAT 3000 

AATATTGAGT CTTCTGGCCT ATAAACAAGG TCTGTCTTCC TAGGTATTAA TGTTTTGTCT 3060 

TCTATTTCTC TTAATAATCT TTTGTAGTTT TCAGTGTACA GGTCTACCAT GTCA6CATTT 3120 

CATAGTTTTG ATGCTAAATG GTATTTTAAA ATTTCAAATT CTAACCACTT GTTGCTAGTA 3180 

AATAGAAATA CAATTGATGT TGAACTTGTA TCCTTCAGCC TTGCTAAACT GTGAGTTCTC 3240 

ATGGTGTTTT TGTA7VATTAC ATCAACAGTC ATGTGTTCTA TGAATAAAGA GTTTTACTCC 3300 
TTC 

Seq ID NO: 154 Protein sequence: 
Protein Accession #: BAA11503.1 

1 11 21 31 

I I I I 

MFCEKAMELI RELHRAPEGQ LPAFNEDGLR QVLEEMKALY 
TIKFRHCSLIi RNRRCTVAYb YDRLLRIRAL RWEYGSVLPN 
LATYMRSLGG DEGLDITQDM KPPKSLYIEV RCLKDYGEFB 
KCEQLIRQGV LiEHILS 

Seq ID NO: 155 DNA sequence 
Nucleic Acid Accession #: Eos sequence 
Coding sequence: 149-709 

1 11 21 31 41 51 

I I I I I I 

GTTCGGCGCC AAAGCGCGGA GCGGAGGCCG AGGCGAGAGC CTGGCGCTGT AGGACTAGAA 60 

CGAAAGGAGT GAGGCGCCGA GAGCCCAGAT ACCATTTTGG CGTGAGAGCT GGTGGTTGGC 120 

AAGGCCGCGG GAGTGGGAAG CGTCCGCCAT GTTCTGCGAA AAAGCCATGG AACTGATCCG 180 

CGAGCTGCAT CGCGCGCCCG AAGGGCAACT GCCTGCCTTC AACGAGGATG GACTCAGACA 240 

AGTTCTGGAG GAGATGAAAG CTTTGTATGA ACAAAACCAG TCTGATGTGA ATGAAGCAAA 300 

GTCAGGTGGA CGAAGTGATT TGATACCAAC TATCAAATTT CGACACTGTT CTCTGTTAAG 3 60 

AAATCGACGC TGCACTGTAG CATACCTGTA TGACCGCTTG CTTCGGATCA GAGCACTCAG 420 

ATGGGAATAT GGTAGCGTCT TGCCAAATGC ATTACGATTT CACATGGCTG CTGAAGAAAT 480 

GGAGTGGTTT AATAATTATA AAAGATCTCT TGCTACTTAT ATGAGGTCAC TGGGAGGAGA 540 

TGAAGGTTTG GACATTACAC AGGATATGAA ACCACCAAAA AGCCTATATA TTGAAGCTGG 600 

ATGCAGTGGC GCGATCTCGG CTCAACCTGC AACCTCCACC TCCCAGGTTC ACCTCAACTG 660 

CAACCTCCAC CTCCCAGGTC CGGTGTCTAA AAGACTATGG AGAATTTGAA GTTGATGATG 720 

GCACTTCAGT CCTATTAAAA AAAAATAGCC AGCACTTTTT ACCTCGATGG AAATGTGAGC 780 

AGCTGATCAG ACAAGGAGTC CTGGAGCACA TCCTGTCATG ACCATGCGCC GAGGCACTTC 840 

CAGGCTTCAC TCAACTCATG GACTCCTCTG TACTCACTCT CTCCACCACT CCCTTCACCT 900 

CCCTCTTTGA TTTTAGAAGC TATAGACATT GTTTAAGATA ACTAAGAATA CTTGGCTAAG 960 

AAGTATAATT TGCTAACTAT TAAGGACTTT CTTTTTTTAA TGTTGTACAC TATTCTTCCT 102 0 

ACTCTTTTTT GGTTTTGGTT TTGTTTTGTA GAGACTGTCT CACTATGTTG CCCAAGCTGG 1080 

TCTCAAACTC CTGGCCTCAA GCAGTCCTCC CACCTTAGCT TCTCAAAGTG TTGAGATCAC 1140 

AGGCGTGAGC CACTGCACCC GGCCCCTACT CCTTTTTCTA ATAAGCTGTA TCTGTAATCA 1200 

CAGCATTCCT ACAGTTGTTA CAGTGTGTTT TTTAAATGAA AGTAAACATG GTTACATTTG 1260 

AATCTCTTAA ATAAGCAGTC ACTTGGCTGG ACAGGAAGAA GGTAGATCCT GTGTGTCTTG 1320 

TTTTCTGGTC ATGTGTATTG TACAAGCTAG AGAGCTGAAT TTCTGAGATA CACATTTTCA 13 80 

AATCACATGC AAGTGAAGAT GATGGTCTGT AGAAATTTTC AGTATATATA ATGTTTAATG 1440 

ACATACTAAT TTATCATCTG GCTATTTGGG AAGGAAGGAC ACACATGGAT TTTGCACATT 1500 

TCCACCATGG TGGCTGGTGT GGCTTGTGGC TATGGGQTGA TCACCAGTAT CACCACTTTG 1560 

GAAGGGGACA GTGAAATTGG GGCTAGAGAA GGAACTTTGT ACAGTTTTCC CTGAGATTCA 1620 

GATTGACTGA AAAGTCACAT GAAGAGTTGA TTGTCTTTTA ATGGTATGTT TTAAACAGCT 168 0 

GACATTTTAA ATTTTGATGA /U^TCCAGTTT ATTCGTTTGT TCTTTTATGC TTTGGGTGTT 1740 

GCATCCGAGA AATCTTTTCC CATCCCAAGA TCACAATTTT TTTTCCTTTT TACTTCTAGA 1800 

AGTGTTATAA TTTTAAGCTT TATACTTTGG TCTATGACCC qTTTTTTTTT TTGTTTTGTT 1860 

TTGTTTTTTC GTTTGTTTCT TTGTTTTGAG ATGGAGTCTT GTTCT6TCAC CCAGGCTGGG 1920 

GTGCAGTGGC GTGATCTTGG CTCACTGCAA TCTCTATCCC CTGGGTTCAA GTGATTCTCT 1980 

TGTCTCAGCC TCCCAAGTAG CTGGGATTAC AGGCACAGGC CGCCACGCCT GGCTAATTTT 2040 

TGTATTTTTA GTAGAGACAG AGTTTTACCA TGTTGGCCAG GCTGGTTTCA AACTCCTGAC 2100 

CTCAAGTGAC CCACCTTGGC CTCCCAAAGT TTTGGGATTA CAAGTGTGGG CCACCGCGGC 2160 

CAGCCTATGA TCCATTTTGA ATGAATTTTT TATATGGTGC AAGGTGTCAA TCCACCTTCA 2220 

CTTTTTCTTG GG7\ATATAGA TATCCAGCTG TTTCACTACC ATTTTTTGAA AGGACTGCCC 2280 

TTTGCTCTAT CACCTTTGCA TTTTTGTTAA AAAGTAGTTG TCAATGTATA TGTGGGTTTA 2340 

TTTCAGGACT CTGTTTTGTT CCATTGACCT GTTTTTCTCT CCTGAATGCC AATACCATAT 2400 

TTGTATGTAG TGTATGTAAT TTTCTAATAA TTCTTGAAAC AGATAGTATT AATGTGTCAT 2460 

ATTTTTGCTG TTGTTTGTAT TTTTTGTAGA GATGGGGTTT CACCGTGTTG GCCAGGCTGT 2520 

GTTGAACTCC TGAGCTAAAG CAATACACTT GCCTCGTCCT CCCCATGTGC TGGGATTACA 2580 

GGCGTGAGCC TTGGTGCTGG CCCAGTGTAC CACATTTCTT TTTGAGATTT GTTTTGGCTA 2640 

TGTTAAGTCC TTTGCTTTTG ATGTGAAATT TGGGAACAGG CAGGGTGTGG TGGCTTATGC 2700 

CTGTAATCCT AGAACTTTGG GAGGCCTAGA TGGGTGGATC ACTTGAGCTC AG6AGTTCCA 2760 

GACCAGCCCG GGCCTATGGC AAAACTCCX3T CTCTACAAAA AATAGAAAAA ATTAGCCAGG 2820 



41 51 
I I 

EQNQSDVNEA KSGGRSDLIP 60 
AIjRFHMAAEB MEWFNNYKRS 120 
VDDGTSVIiLK KNSQHPLPRW 180 
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TGTGGTGGTG CATGCCTGTA GTCACAGTTA CACGGCAGGC TGAGGTGGGA GGATCACTTG 2880 

AACCCCAGAG GTCAAGACTG CAGTGAGCTG AGATCACACC ACTGTACTCC AGCCTGGGTG 2 940 

ACAAAGTGAG ACTCTATCTC AAAAAGAAAT TAGGATCAAT TTGTCAATTT CTACAACAAC 3000 

AACAACAAAA ACCCCTGTTG GGCACCTTGA TTGAGATTGC ATTGAATTTA TATAAAACTG 3060 

TTGGGAGAAT TGACATCTTA ATAATATTGA GTCTTCTGGC CTATAAACAA G6TCTGTCTT 3120 

CCTAGGTATT AATGTTTTGT CTTCTATTTC TCTTAATAAT CTTTTGTAGT TTTCAGTGTA 3180 

CAGGTCTACC ATGTCAGCAT TTCATAGTTT TGATGCTAAA TGGTATTTTA AAATTTCAAA 3240 

TTCTJ\ACCAC TTGTTGCTAG TAAATAGAAA TACAATTGAT GTTGAACTT6 TATCCTTCAG 3300 

CCTTGCTAAA CTGTGAGTTC TCATGGTGTT TTTGTAAATT ACATCAACAG TCATGTGTTC 3360 
TATGAATAAA GAGTTTTACT CCTTC 

Seg ID NO: 156 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MFCEKAMELI RELHRAPEGQ LPAFNEDGLR QVLEEMKALY EQNQSDVNEA KSGGRSDLIP 60 

TIKFRHCSLL RNRRCTVAYL YDRLLRIRAL RWEYGSVLPN 7\LRFHMAAEE MEWFNNYKRS 120 

LATYMRSLGG DEGLDITQDM KPPKSLYIEA GCSGAISAQP ATSTSQVHLN CNLHLPGPVS 180 
KRLWRI 

Seq ID NO: 157 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sec[uence: 148-621 

1 11 21 31 41 51 

I I I I I I 

TTCGGCGCCA AAGCGCGGAG CGGAGGCCGA GGCGAGAGCC TGGCGCTGTA GGACTAGAAC' 60 

GAAAGGAGTG AGGCGCCGAG AGCCCAGATA CCATTTTGGC GTGAGAGCTG GTGGTTGGCA 120 

AGGCCGCGGG AGTGGGAAGC GTCCGCCATG TTCTGCGZ^ AAGCCATGGA ACTGATCCGC 180 

GAGCTGCATC GCGCGCCCGA AGGGCAACTG CCTGCCTTCA ACGAGGATGG ACTCAGACAA 240 

GTTCTGGAGG AGATGAAAGC TTTGTATGAA CAAAACCAGT CTGATGTGAA TGAAGCAAAG 300 

TCAGGTGGAC GAAGTGATTT GATACCAACT ATCAAATTTC GACACTGTTC TCTGTTAAGA 3 60 

AATCGACGCT GCACTGTAGC ATACCTGTAT GACCGCTTGC TTCGGATCAG AGGACTCAGA 420 

TGGGAATATG GTAGCGTCTT GCCAAATGCA TTACGATTTC ACATGGCTGC TGAAGAAGTC 480 

CGGTGTCTAA AAGACTATG6 AGAATTTGAA GTTGATGATG GCACTTCAGT CCTATTAAAA 540 

AAAAATAGCC AGCACTTTTT ACCTCGATGG AAATGTGAGC AGCTGATCAG ACAAGGAGTC 600 

CTGGAGCACA TCCTGTCATG ACCATGCGCC GAGGCACTTC CAGGCTTCAC TCAACTCATG 660 

GACTCCTCTG TACTCACTCT CTCCACCACT CCCTTCACCT CCCTCTTTGA TTTTAGAAGC 72 0 

TATAGACATT GTTTAAGATA ACTAAGAATA CTTGGCTAAG AAGTATAATT TGCTAACTAT 780 

TAAGGACTTT CTTTTTTTAA TGTTGTACAC TATTCTTCCT ACTCTTTTTT GGTTTTGGTT 840 

TTGTTTTGTA GAGACTGTCT CACTATGTTG CCCAAGCTGG TCTCT^AACTC CTGGCCTCTVA 900 

GCAGTCCTCC CACCTTAGCT TCTCAAAGTG TTGAGATCAC AGGCX3TGAGC CACTGCACCC 960 

GGCCCCTACT CCTTTTTCTA ATAAGCTGTA TCTGTAATCA CAGCATTCCT ACAGTTGTTA 1020 

CAGTGTGTTT TTTAAATGAA AGTAAACATG GTTACATTTG AATCTCTTAA ATAAGCAGTC 1080 

ACTTGGCTGG ACAGGAAGAA GGTAGATCCT GTGTGTCTTG TTTTCTGGTC ATGTGTATTG 1140 

TACAAGCTAG AGAGCTGAAT TTCTGAGATA CACATTTTCA AATCACATGC AAGTGAAGAT 1200 

GATGGTCTGT AGAAATTTTC AGTATATATA ATGTTTAATG ACATACTAAT TTATCATCTG 1260 

GCTATTTGGG AAGGT^GGAC ACACATGGAT TTTGCACATT TCCACCATGG TGGCTGGTGT 1320 

GGCTTGTGGC TATGGGGTGA TCACCAGTAT CACCACTTTG GAAGGGGACA GTGAAATTGG 13 80 

GGCTAGAGAA GGAACTTTGT ACAGTTTTCC CTGAGATTCA GATTGACTGA AAAGTCACAT 1440 

GAAGAGTTGA TTGTCTTTTA ATGGTATGTT TTAAACAGCT GACATTTTAA ATTTTGATGA 1500 

AATCCAGTTT ATTCGTTTGT TCTTTTATGC TTTGGGTGTT GCATCCGAGA AATCTTTTCC 1560 

CATCCCAAGA TCACAATTTT TTTTCCTTTT TACTTCTAGA AGTGTTATAA TTTTAAGCTT 1620 

TATACTTTGG TCTATGACCC GTTTTTTTTT TTGTTTTGTT TTGTTTTTTC GTTTGTTTCT 1680 

TTGTTTTGAG ATGGAGTCTT GTTCTGTCAC CCAGGCTGGG GTGCAGTGGC GTGATCTTGG 1740 

CTCACTGCAA TCTCTATCCC CTGGGTTCAA GTGATTCTCT TGTCTCAGCC TCCCAAGTAG 180 0 

CTGGGATTAC AGGCACAGGC CGCCACGCCT GGCTAATTTT TGTATTTTTA GTAGAGACAG 18 60 

AGTTTTACCA TGTTGGCCAG GCTGGTTTCA AACTCCTGAC CTCT^GTGAC CCACCTTGGC 1920 

CTCCCi\AAGT TTTGGGATTA CAAGTGTGGG CCACCGCGGC CAGCCTATGA TCCATTTTGA 1980 

ATGAATTTTT TATATGGTGC AAGGTGTCAA TCCACCTTCA CTTTTTCTTG GGAATATAGA 2040 

TATCCAGCTG TTTCACTACC ATTTTTTGAA AGGACTGCCC TTTGCTCTAT CACCTTTGCA 2100 

TTTTTGTTAA AAAGTAGTTG TCAATGTATA TGTGGGTTTA TTTCAGGACT CTGTTTTGTT 2160 

CCATTGACCT GTTTTTCTCT CCTGAATGCC AATACCATAT TTGTATGTAG TGTATGTAAT 2220 

TTTCTAATAA TTCTTGAAAC AGATAGTATT AATGTGTCAT ATTTTTGCTG TTGTTTGTAT 2280 

TTTTTGTAGA GATGGGGTTT CACCGTGTTG GCCAGGCTGT GTTGAACTCC TGAGCTAAAG 2340 

CAATACACTT GCCTCGTCCT CCCCATGTGC TGGGATTACA GGCGTGAGCC TTGGTGCTGG 240 0 

CCCAGTGTAC CACATTTCTT TTTGAGATTT GTTTTGGCTA TGTTAAGTCC TTTGCTTTTG 2460 

ATGTGAAATT TGGGAACAGG CAGGGTGTGG TGGCTTATGC CTGTAATCCT AGAACTTTGG 2 520 

GAGGCCTAGA TGGGTGGATC ACTTGAGCTC AGGAGTTCCA GACCAGCCCG GGCCTATGGC 2 580 

AAAACTCCGT CTCTACAAAA AATAGAAAAA ATTAGCCAGG TGTGGTGGTG CATGCCTGTA 2640 

GTCACAGTTA CACGGCAGGC TGAGGTGGGA GGATCACTTG AACCCCAGAG GTCAAGACTG 2700 

CAGTGAGCTG AGATCACACC ACTGTACTCC AGCCTGGGTG ACAAAGTGAG ACTCTATCTC 2760 

AAAAAGAAAT TAGGATCAAT TTGTCAATTT CTACAACAAC AACAACA7\AA ACCCCTGTTG 2820 

GGCACCTTGA TTGAGATTGC ATTGAATTTA TATAAAACTG TTGGGAGAAT TGACATCTTA 28 8 0 

ATAATATTGA GTCTTCTGGC CTATAAACAA GGTCTGTCTT CCTAGGTATT AATGTTTTGT 2 940 

CTTCTATTTC TCTTAATAAT CTTTTGTAGT TTTCAGTGTA CAGGTCTACC ATGTCAGCAT 3000 

TTCATAGTTT TGATGCTAAA TGGTATTTTA AAATTTCAAA TTCTAACCAC TTGTTGCTAG 3060 

TAAATAGAAA TACAATTGAT GTTGAACTTG TATCCTTCAG CCTTGCTAAA CTGTGAGTTC 3120 

TCATGGTGTT TTTGTAAATT ACATCAACAG TCATGTGTTC TATGAATAAA GAGTTTTACT 3180 
CCTTC 



Seq ID NO: 158 Protein sequence: 
Protein Accession #: Eos sequence 



31 41 51 
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MFCEKAMELI RELHRAPEGQ LPAFNEDGLR QVLEEMKALY EQNQSDVNEA KSGGRSDLIP 
TIKFRHCSIiL RNRRCTVAYL YDRLIiRIRAL RWEYGSVIiPN ALRFHMAAEE VRCLKDYGEF 
EVDDGTSVLL KKNSQHFLPR WKCEQLIRQG VLEHILS 

Seq ID NO: 159 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sec[uencei 149-229 



PCT/US02/12476 



60 
120 



1 
1 

GTTCGGCGCC 
CGAAAGGAGT 
AAGGCCGCGG 
CGAGCTGCAT 
GGCACACACC 
GGAAGTTGAA 
TGAATCCCTG 
AAGCTTTGTA 
GTAGCATACC 



11 
1 

AAAGCGCGGA 
GAGGCGCCGA 
GAGTGGGAAG 
CGCGCGCCC6 
TGTAGTCCCA 
ACTGCAGTGA 
TCTCAAAAAG 
TGAACAAAAC 
TGTATGACCG 



21 

I 

GCGGAGGCCG 
GAGCCCAGAT 
CGTCCGCCAT 
AAGGGCAACT 
GCAACTTAGG 
ACTGTGGTCA 
GAAAAGGAGG 
CAGTCTGATG 
CTTGCTTCX3G 



31 
I 

AGGCGAGAGC 
ACCATTTTGG 
GTTCTGCGAA 
GCCTGCCTTC 
AGGCTGT^GT 
CGCTATTACA 
ATGGACTCAG 
TGTTCTCTGT 
ATCAGAGCAC 



41 
I 

CTGGCGCTGT 
CGTGAGAGCT 
AAAGCCATGG 
AACAATTAGC 
GAGAGGATTG 
CTCCAGCCTG 
ACAAGTTCTG 
TAAGAAATCG 
TCAGATGG 



Sag ID NO: 160 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 



41 



51 
I 

AGGACTAGAA 
GGTGGTTGGC 
AACTGATCCG 
TGGGTGTGGT 
CATGGCTCCA 
GGTGACAGAC 
GAGGAGATGA 
ACGCTGCACT 



51 



21 31 

I I I I I 

ATGTTCTGCG AAAAAGCCAT GGAACTGATC CGCGAGCTGC ATCGCGCGCC CGAAGGGCAA 
CTGCCTGCCT TCAACAATTA G 

Seq ID NO: 161 DNA sequence 
Nucleic Acid Accession #: U10694 
Coding sequence: 1333-2280 



GGATCCGGCC 
ACAGGGAGGC 
AGGAACCTAA 
TTTGCCCTGC 
CCTTTATCCT 
CACGTCAGCA 
CCCACTCACC 
CCGCTGGGTG 
AGGTCAACAG 
CTCACCCCAG 
AACCCTGGGC 
ATGTGAGCTC 
CAGGAGAAAG 
AGAACTCT^G 
CAGTCTGCAG 
CTTGGTCTGA 
TGAAGGTGAA 
GGACCCCATA 
AGGCTAGCTG 
ACCAGGAGGA 
GCCTTTGTTA 
TCCCCAGGCC 
ACCAGAGTCA 
GAAGCCCAAG 
GAGACTACCT 
CCTCCCCAGA 
AGCCAATTCG 
CCAGCTCAGC 
CATTTCCTGC 
AGCGTCATCA 
ATGCAGGTGA 
CTTGTCACTG 
AAGGCCGCCC 
GAAGAGGTTA 
TTCTACGGGG 
TACCGGCAGG 
CACGCTGAAA 
CCCATCTGCT 
GCACCAGCCG 
GCCCCAT6TG 
CAGTGGCAGT 
GTGATTTGGA 
AGTTTAATGA 
ATGTTATTTA 
TGAATTGGGA 
CAGCAAAATA 
ACCTCTTTCT 
TCTTTGAGCA 
TTTTTTCTTC 



11 
I 

GGATCTCAGG 
CCTGTGTTCG 
GGGAGGATCG 
CCCTACTGTC 
GGGATCACTG 
GAGGGAGGGT 
AAACACAGAG 
GATGGACTCC 
AGGGAGGGTT 
GACACATGGA 
AGGTGTGGGC 
CTGATCTGAG 
GTCAGGGCCC 
AGTGTCCAGC 
CCTAAGGGCC 
GACAGTGTCC 
GTGTTCACCC 
GCACCTGGCC 
CACGCTGAGT 
CAGGAGCCCC 
GAACCTCCAA 
TGTGGGTCTC 
TCATGTCTCT 
GAGAGGACTT 
CCTCCTCTGA 
GTCCTCAGGG 
ATGAGGGCTC 
TGGAGTTCAT 
TCCACAAATA 
AAAATTACAA 
TCTTTGGCAC 
CTCTTGGCCT 
TCCTGATCAT 
TCTGGGAAGC 
AGCCCAGGAA 
TGCCCGGCAG 
CCAGCTATGA 
ACCCATCCCT 
CAGCCGGGGC 
ACATGAGGCC 
GGGTGGAAGT 
GATTTATCCT 
ACTTCACCAT 
GGAGTAAGAT 
CAAGATAACA 
GAGCTCATAA 
CTCTCCTGTA 
TGTAAGAGAA 
AGACACGCAC 



21 

1 

GAGGTGAGGA 
ACAGACACAG 
AGGGTACCTC 
ACCCCAGAGA 
GTGTCGGGGA 
CCCAGGCCCT 
GACCTAGCCC 
CCTCACTTCC 
CCAGACCCTG 
CCCCATTGAA 
AGATGTTGGT 
AGACTCTCAG 
TGAGTGAGCG 
CCGCCCTCTT 
CCTCGATTCC 
TCAGGTCGCA 
TGAATGTGCA 
CCATTCCCCC 
AGCCCTCTCA 
AAGAGGCCCC 
GGTTCGGTTC 
CATCGCCCAG 
CGAGCAGAGG 
GGGCCTGATG 
CAGCAAGGAG 
AGGCGCTTCC 
CAGCAGTCAA 
GTTCCAAGAA 
TCGAGTCAAG 
GCGCTACTTT 
TGATGTGAAG 
CTCGTGCGAT 
TGTCCTGGGT 
GTTGAGTGTG 
GCTGCTCACC 
TGATCCTGCG 
GAAGGTCATA 
TTATGAAGAG 
CAAAGTTTGT 
CATTCTTCGC 
GAGCACACTG 
TGCTCCCTTT 
CGAAGTTAAT 
TCTTGCTTTT 
TAGCAGAGGA 
AGAAATAGTG 
AAATTAAAAC 
ATAAAAATTG 
TGAACATCTG 



31 
I 

CTTTGTTCTC 
TGGTCCCAGG 
CAGGCCAGAG 
GCCCGGGCAG 
GGGCTGGCCT 
GCCAGGAGTC 
CACCCTGCCC 
TCTTCAGGTG 
CAGGCATCAA 
TTTAGACATC 
TGGGGCATGT 
GCAAGTAGAG 
CAGAGGGGAC 
GACAGCACTG 
TCTTCCAGGA 
GAGCAGAGGA 
CCAAGGGCCC 
TACTGTCACT 
CTTCCTCCCT 
AGAGCAGCAC 
TCAGCTGAAG 
CTCCTGCCCA 
AGTCCGCACT 
GGTGCACAGG 
GAGGAGGTGT 
TCCTCCATTT 
GAAGAGGAAG 
GCACTGAAAT 
GAGCCGGTCA 
CCTGTGATCT 
GAGGTGGACC 
AGCATGCTGG 
GTGATCCTAA 
ATGGGGGTGT 
CAAGATTGGG 
CACTACGAGT 
AATTATTTGG 
GTTTTGGGAG 
GGGGTCAGGG 
TCTGTGTTTG 
TATGTCATCT 
TGGAATTGTT 
GAATGACAGT 
GAGTCACATG 
ATTAATAATT 
AAATGAAAAT 
ATATACATGT 
AAAGAATAAT 
TTATTCGGAA 



41 
I 

AGAGGGTGTG 
ATTGGAGAGC 
AAACTCTCAG 
GGCTGTCTGC 
TGGTCTGAGG 
CAGGTGCAGA 
CTTGTGTCAG 
TCTCCTGGAG 
GATGAGGACC 
TCTTACTGTA 
CCTTCTGTTC 
GAGTAGAGTC 
CATCCACCCC 
AGGGACCGGG 
GCTCCAGGAA 
GACCCAGGCA 
CACCTGCCCC 
CATAGAGCCT 
CAGGTTCTCG 
TGACGAAGAC 
TCTCTCACAC 
CGCTCCTGAC 
GCAAGCCTGA 
AACCCACAGG 
CTGCTGCTGG 
CCGTCTACTA 
AGCCAAGCTC 
TGAAGGTGGC 
CA7UVGGCAGA 
TCGGCAAAGC 
CCGCCGGCCA 
GTGATGGTCA 
CCATU^GACAA 
ATGTTGGGAA 
TGCAGGAAAA 
TCCTGTGGGG 
TCATGCTCAA 
AGGAGCAAGA 
CCCCATCCAG 
AAGAGAGCAA 
CTGGGTTCCT 
CAAATGTTCT 
AGTCACACAT 
GGGAAATCCC 
TTTTTGAAAC 
GTAGTTAATT 
ATACCTGGAT 
TTTTCCTGTT 
CACCCTGGGT 



51 
1 

TGTGGACAAA 
AGTCCAGGTG 
ATCAAGAGAG 
TGAGGTCCCT 
GGGCTGCACT 
CTGAGGGGAC 
CTGAGGGAAG 
ATAGGGCCTC 
AGGCAGTATC 
CTTCCGAGGA 
CATATCAGGG 
CAGTCCCTGC 
AAAAGTGTGT 
GCTCTGCCTG 
GCAGGCAGGC 
GTGTCAGCAG 
AGCACACATG 
TGATCTCTGC 
GGACAGGCTA 
GTGTAAGTCA 
ACTCCCTCTC 
TGCTGCCCTG 
TGAAGACCTT 
CGAGGAGGAG 
GTCATCAAGT 
CACTTTATGG 
CTCGGTCGAC 
TGAGTTGGTT 
AATGCTGGAG 
CTCCGAGTTC 
CTCCTACATC 
TAGCATGCCC 
CTGCGCCCCT 
GGAGCACATG 
CTACCTGGAG 
TTCCAAGGCC 
TGCAAGAGAG 
GGGAGTCTGA 
CAGCTGCCCT 
TCAGT6TTCT 
TGTCTATTGG 
TTTAATGGTC 
ATTGCTGTTT 
TGTTATTTTG 
TTGAACTTAG 
CTTGCCTTAT 
TTGCTTGGCT 
CACTGGCTCA 
T 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 



Seq ID NO: 162 Protein sequence: 
Protein Accession tt: AAA68877.1 
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10 



1 11 21 31 41 51 

I I I I I I 

MSLEQRSPHC KPDEDLEAQG EDLGLMGAQE PTGEEEETTS SSDSKEEEVS AAGSSSPPQS 60 

PQGGASSSIS VYYTLWSQFD EGSSSQEEEE PSSSVDPAQL EFMFQEALKL KVAELVHFLIi 120 

HKYRVKEPVT KAEMLESVIK NYKRYFPVIF GKASEFMQVI FGTDVKEVDP AGHSYILVTA 180 

LGLSCDSMLG DGHSMPKAAL LIIVLGVILT KDNCAPEBVI WEALSVMGVY VGKEHMFYGE 240 

PRKLLTQDWV QENYIiEYRQV PGSDPAHYEF LWGSKAHAET SYEKVINYLV MLNAREPICY 300 
PSLYEEVLGE EQEGV 

Seq ID NO: 163 DNA sequence 
Nucleic Acid Accession #: AF292100 
Coding sequence: 30-809 

15 1 11 21 31 41 51 

I I I I I I 

GGGGGGGGAG AGGCCTGGAG GACACCAACA TGAACAAGTT GAAATCATCG CAGAAGGATA 60 

AAGTTCGTCA GTTTATGATC TTCACACAAT CTAGTGA7\AA AACAGCAGTA AGTTGTCTTT 120 

CTCAAAATGA CTGGAAGTTA GATGTTGCAA CAGATAATTT TTTCCAAAAT CCTGAACTTT 180 

20 ATATACGAGA GAGTGTAAAA GGATCATTGG ACAGGAA6AA GTTAGAACAG CTGTACAATA 240 

GATACAAAGA CCCTCAAGAT GAGAATAAAA TTGGAATAGA TGGCT^TACAG CAGTTCTGTG 300 

ATGACCTGGC ACTCGATCCA GCCAGCATTA GTGTGTTGAT TATTGCGTGG AAGTTCAGAG 3 60 

CAGCAACACA GTGCGAGTTC TCCAAACAGG AGTTCATGGA TGGCATGACA GAATTAGGAT 42 0 

GTGACAGCAT AGAACAACTA AAGGCCCAGA TACCCAAGAT GGAACAAGAA TTGAAAGAAC 480 

25 CAGGACGATT TAAGGATTTT TACCAGTTTA CTTTTAATTT TGCAAAGAAT CCAGGACAAA 540 

AAGGATTAGA TCTAGAAATG GCCATTGCCT ACTGGAACTT AGTGCTTAAT GGAAGATTTA 600 

AATTCTTAGA CTTATGGAAT AAATTTTTGT TGGAACATCA TAAACX3ATCA ATACCAAAAG 660 

ACACTTGGAA TCTTCTTTTA GACTTCAGTA CGATGATTGC AGATGACATG TCTAATTATG 72 0 

ATGAAGAAGG AGCATGGCCT GTTCTTATTG ATGACTTTGT GGAATTTGCA CGCCCTCAAA 780 

30 TTGCTGGGAC AAAAAGTACA ACAGTGTAGC ACTAAAGGAA CCTTTTAGAA TGTACATAGT 840 

CTGTACAATA AATACAACAG AAAATTGCAC AGTCAATTTC TGCTGGCTGG ACTGAACTGA 900 

AGATCJ^ATCC TCACAATTCA GACTGAGGGT TGAGACAAAA CTTTAAGGAT ACATCTTGGA 960 

CCATATCGTA TTTCATTCTT CTAATGGTGG TTTGGGCTTG TCTTCTAGTC TGGGCCGCTC 102 0 

TAAACATTTA TAATTCCAAC ATTGTGGATT TCATCTTATA TCTGTGGACC ATCCTAGTTT 108 0 

35 ATTCTCCCAT AAGTCTTAGA AGCTTTATGG TGATTATTTT GAGGTTTTCA TTCTCGCATA 114 0 

AAGCACAATG CTGTCTTCAT CAGAAAACAG TTGGCATAAG AATTAAACAT ATGAACATCA 12 0 0 

CAAAACAATT TATAAAAAGT TCTTAAATAT ACGCTTTGGG CTAGTTGCAA AGACTATGCT 1260 

AATAGCACTT CCAGTGAGAG TGATATATTT AAGTGTACTG GATCTGGAAT GGTGTTTTGG 1320 

TTTGGGGGGA ATTTTTTTTT TTTCCTGGCA AATCACATAT GTTGTTGATG TGAGTATCTG 13 80 

40 ATGAAAAAAC AATGTCAGAA TAACCGACAT GAAAATTTTT TAGGATAACT TGGTGCCTAC 144 0 

CTGAAAAATG TATTGTGTTT TAGACTCTTG ATTTCAAAAG GTTCCACAGA ACTAGTCTGC 1500 

GCTTACCTTA CCCATGTTTA TATATAGCTG TCCTACAGGG AGCTTTTATT TAGAAAATGT 1560 

CTGCATAATG TTAGATTCTT CTCCTGTCTA CATTATGCAC TACATAATTG GACTTCATTA 1620 

TGCTTTTGAA ATGCTTATCT GCCTGTCACA TAAGTTAAAC TATTTAATTT GTTTTGAATG 1680 

45 TTTTGGATTG CTACAC/^ATA CAATATTCTA AATTTAGGCA TGAGGGTTTT TTTGTTTTAT 1740 

TTTTACTTTT TTTTTGTCAT TGCACTATGG AACACAAATG AAATTCTCTT AATTTATAAG 1800 

AAGATAGTAG GAGTTAAATT TTGAAAATGG TTGTGATGAG CCACGAAATT CAATCTTTAT 1860 

AATATAGGTA CTGCTCTTTC AGACAAACAG TCCATTTTTA ATGACTTCTT ATTTTGTTGA 192 0 

AATTACTTTA ACTGCTAATC ACTGTGGTTG CCAAATATTT ACTTCAGAAG CAAAGATTTT 1980 

50 CAAACAAGCA TACAC6ATGC AAAATACCAG TCTGGCTTCT AGTCTATTTA CTGTTTTGTT 2040 

TCACTCAGAT TAGCTCAGTT TTCTCATCAA AGCAGAATGC TATCTTGCGT GTGTGTGTGT 2100 

GTGTGTGTGT GTGTGTGTGT GTATGTGTGT ATATATATAT ATATATATAT ATATATATTT 2160 

^^^^^^^^^^ ATTACAAAAG CCATGAGCTG CTTTTATGCT GAAAATGGTC 2220 

ATTTCCCTGT TCACTTACTG ACATGTGAAG AAGGGTTTCT TGCTTTCTTA AACATTTCCG 2280 

55 TAAGGCAGGC TAGAAATGTA ATACTTCAAA TGTTTGATGA TTATGGTCTT TTGATAGGAA 2340 

TAGATTCTGC TTGGGATATA TATCCAGGCA CTCTCTAAGG TCTAGGGTTG ATATTAACAA 2400 

AGGAATGTAC TTAGAATAGC AGTACATTTT ATGCAAATAT GGAAATTATT TTAAGAAACA 2460 

ATGACATATC AAAACTGCTT TTTACATGAT TTTGAAATAG ACTAGAAAGC TTTCCCTATA 2520 

GACATATTAA TATTCCAATC ATAACTTTAA TTCAAGAATG CAGTTTTACC AAAAGAAAAA 258 0 

60 TTTGAAAATT TCTATTCAGG CTACTGGAAT TGGTTATTAA AAGAAAAAGG AAAAAGAAGA 2 640 

ATCTTGCTGC TTTCAGTATT TCCTGATTTT TTTGTAAATA TAAAGAGGAA CTTCAATTAT 2700 

GAAAAATTTT TTVAAAGATAT ATATATCTAT ATATCTATAT ATATGTACTG TTTTGTTTCC 2760 

TGTCTTGAAG ATTTTGAGTT ATGGTTATTG GTTTCAGATT GATTAATTCA CATATGCTGT 2820 

GTTTTCTTTA AAAGTCATAT GGGTTCGTGG CCTAATGCCT TGGATTTTAC ATATTTTTCT 2880 

65 TTTTAAATGC AAAACCTTTT CAACAAAATA GTGTTTGTCA TCAGGTTGGT ACTAAACATT 2 940 

TATAATTACT GTGTAATTAT AAACAAAAAT ACATAAAGCT TTGAATATAA TTATGTAGCA 3000 

TAAAAGTTAA GGTTGTTCAC TAT6ATGGCA TCTTAGAATT AAACAAAACT TTTACTAGGG 3060 

CTGAJ\AAGAG AAGACTGATT TAATGTGGTG TGATTATTCT GAAGATAAAT GTCTGGCTAC 3120 

AGGGAATATT TTGTACTAAA AAATGATTAC ACATATGGCT GTGTGTGTTT GAGTCTGTGT 3180 

70 CTGTGAGAGA GCCAGAGAGA GTGAGAGAGA TTGACAGAGA AAGGGAGAGA CACACACACG 3240 

CCCCTTGAAT TGCTTTAACT CCTAAGTGTT TCAGTCCTCA TTCCGGTAAA CTCCCCATGC 3300 

TGATTCTTTG TTTTAAACTG AACCATAGGT ACAGTTTCCT TTTTGCCAAA TGTCAAAACA 3360 

GGTACAAATT TTAAAATGTA ATGCTTTTTA AATAGAAAAA TGTATAAAAT TAGAAGTGCC 3420 

CACATATATiA AAATACTT6A GATGAAGATT ATCTTTAGTG AATATCATCT GCATATCTCT 3480 

75 GTAAGTTCAA TTGTGTTTCT TACAGTCCCT GTCATATTAC CAACAGAGGC AATAAAAGCT 3540 
GCAGTGAAAT TG 



80 



Seq ID NO: 164 Protein sequence: 
Protein Accession #: AAG00606 



1 11 21 31 41 51 

i I I I I I 

MNKLKSSQKD KVRQFMIFTQ SSEKTAVSCL SQNDWKLDVA TDNFFQNPEL YIRESVKGSL 60 

DRKKLEQLYN RYKDPQDENK IGIDGIQQFC DDLALDPASI SVLIIAWKFR AATQCEFSKQ 120 

85 EFMDGMTEIiG CDSIEQLKAQ IPKMEQELKE PGRFKDFYQF TFNFAKNPGQ KGLDLEMAIA 180 

YWNLVLNGRF KFLDLWNKFIj LEHHKRSIPK DTWNLLLDFS TMIADDMSNY DEEGAWPVLI 240 
DDFVEFARPQ lAGTKSTTV 



248 



wo 02/086443 



Seq ID NOt 165 DNA sec[uence 
Nucleic Acid Accession #: AF256215 
Coding sequence: 220-2 028 

1 11 21 31 41 51 

•I I I I I I 

CTCCAGTCCG CATGCTCAGT AGCTGCTGCC GGCCGGGCTG CGGGGCGGCG TCCGCTGCGC 60 

GCCTACGGGC TGCGGTGGCG GCCGCCGCGG CACCCGGCAG GGCCCGCCAG TCCCCGCTTC 120 

CCTGCTCCAG AGCCGCCGCC TGGGCCGGGG CAGGGCGGGC CCGGGGCTCC TCCATGCTGC 180 

CAGCCGCCGG GCTGCGGAGC CGACCAAGTG GCTCCTGCGA TGGCGGCGGA AGAGGAGGCT 240 

GCGGCGGGAG GTAAAGTGTT GAGAGAGGAG AACCAGTGCA TTGCTCCTGT GGTTTCCAGC 300 

CGCGTGAGTC CAGGGACAAG ACCAACAGCT ATGGGGTCTT TCAGCTCACA CATGACAGAG 360 

TTTCCACGJ\A AACGCAAAGG AAGTGATTCA GACCCATCCC AAGTGGAAGA TGGTGAACAC 42 0 

CAAGTTAAAA TGAAGGCCTT CAGAGAAGCT CATAGCCAAA CTGAAAAGCG GAGGAGAGAT 4 80 

AAAATGAATA ACCTGATTGA AGAACTGTCT GCAATGATCC CTCAGTGCAA CCCCATGGCG 54 0 

CGTAAACTGG ACAAACTTAC AGTTTTAAGA ATGGCTGTTC AACACTTGAG ATCTTTAAAA 600 

GGCTTGACAA ATTCTTATGT GGGAAGTAAT TATAGACCAT CATTTCTTCA GGATAATGAG 660 

CTCAGACATT TAATCCTTAA GACTGCAGAA GGCTTCTTAT TTGTGGTTGG ATGTGAAAGA 720 

GGAAAAATTC TCTTCGTTTC TAAGTCAGTC TCCAAAATAC TTAATTATGA TCAGGCTAGT 780 

TTGACTGGAC AAAGCTTATT TGACTTCTTA CATCCAAAAG ATGTTGCCAA AGTAAAGGAA 840 

CAACTTTCTT CTTTTGATAT TTCACCAAGA GAAAAGCTAA TAGATGCCAA AACTGGTTTG 90 0 

CAAGTTCACA GTAATCTCCA CGCTGGAAGG ACACGTGTGT ATTCTGGCTC AAGACGATCT 960 

TTTTTCTGTC GGATAAAGAG TTGTAAAATC TCTGTCAAAG AAGAGCATGG ATGCTTACCC 1020 

AACTCAAAGA AGAAAGAGCA CAGAAAATTC TATACTATCC ATTGCACTGG TTACTTGAGA 1080 

AGCTGGCCTC CAAATATTGT TGGAATGGAA GAAGAAAGGA ACAGTAAGAA AGACAACAGT 1140 

AATTTTACCT GCCTTGTGGC CATTGGAAGA TTACAGCCAT ATATTGTTCC ACAGAACAGT 1200 

GGAGAGATTA ATGTGAAACC AACTGAATTT ATAACCCGGT TTGCAGTGAA TGGAAAATTT 1260 

GTCTATGTAG ATCAAAGGGC AACAGCGATT TTAGGATATC TGCCTCAGGA ACTTTTGGGA 132 0 

ACTTCTTGTT ATGAATATTT TCATCAAGAT GACCACAATA ATTTGACTGA CAAGCACAAA 1380 

GCAGTTCTAC AGA6TAAGGA GAAAATACTT ACAGATTCCT ACAAATTCAG AGCAAAAGAT 1440 

GGCTCTTTTG TAACTTTAAA AAGCCAATGG TTTAGTTTCA CAAATCCTTG GACAAAAGAA 1500 

CTGGAATATA TTGTATCTGT CAACACTTTA GTTTTGGGAC ATAGTGAGCC TGGAGAAGCA 1560 

TCATTTTTAC CTTGTAGCTC TCAATCATCA GAAGAATCCT CTAGACAGTC CTGTATGAGT 162 0 

GTACCTGGAA TGTCTACTGG AACAGTACTT GGTGCTGGTA GTATTGGAAC AGATATTGCA 1680 

AATGAAATTC TGGATTTACA GAGGTTACAG TCTTCTTCAT ACCTTGATGA TTCGAGTCCA 1740 

ACAGGTTTAA TGAAAGATAC TCATACTGTA AACTGCAGGA GTATGTCAAA TAAGGAGTTG 1800 

TTTCGACC7VA GTCCTTCTGA AATGGGGGAG CTAGAGGCTA CCAGGCAAAA CCAGAGTACT 1860 

GTTGCTGTCC ACAGCCATGA GCCACTCCTC AGTGATGGTG CACAGTTGGA TTTCGATGCC 1920 

CTATGTGACA ATGATGACAC AGCCATGGCT GCATTTATGA ATTACTTAGA AGCAGAGGGG 1980 

GGCCTGGGAG ACCCTGGGGA CTTCAGTGAC ATCCAGTGGA CCCTCTAGCC TTTGATTTTT 2040 

AACTCCAAAA ATGAGAAACA TTTTAAAGCA TTATTTACGA AAAAACTGTC TCAACTATTC 2100 

TTAAGTACTG TATTGATATT GTTTGTATCT TTTATTAATG TTCTACCACT TTTTATAGAT 2160 

TTGCATCTTC CTGTCACAGG GATGTGGGGA AATACGTTTT CCTCCCAAGA GAACCAAGTT 2220 

TATTATAGAC TCCTTTATTC AGTGAAATGG CTTATAATCC ACTAGTTGCC ATATTTTTGC 2280 

TAAAATATTT CTAACCAAGA ATACTACTTA CATATTGTTT TGGCTTTGTT TTATTTTTGA 2340 

TGCAGTTTTT TTTAGTTGAG GTAATGTAAT ATATTGATGT TTTCCTTTGT GTCTAAGATT 2400 

GATTTATAAT AGTAGGTTTG TATAATTTGG AACATTTTCC ATGCCTTGCG AATTTCCTTA 2460 

ATTGAGGATA GGGCTTACAC ACTTTAAGAA AACAGTGAGT ACTTGAACAT TTAAAGGGAC 2520 

AGTGCAATTT ATAGTCATAA TCACATTGAA TACTGTATTT GATCTTTGGA GACTTAGGCA 2580 

AGCACAGAGC TGGGATATTT ATGCTCAGTT GAGCACTTTA AGATGAATTT TAAGTGAGAT 2 64 0 

GATTTCTTGC TTAAAACTCA GAAAGTCAAA AGAGTTTCAG CTTTCCTTAC AGAAAAGGAA 2700 

GGATCTTGGG CCCTAGATCT TGGGGATTAA CCTCTGCATA TAAGATTTAC TCTTAATAGG 2760 

CCAGACGTGG TGCTCACGCC TGTAATCCCA GTACTTTGGG AGGCTGAGAC GGGCAGATCA 2820 

CTTGAGGTCA GGA6TTCAAG ACCAGCCTGG CCAATATGGT GAAACCCCX5T TTCTACTAAA 2880 

AATACAAAAA AAATTACCCA GGCACTCACT CTTGAGGTAA CTAACCAACT CCCACGATAA 2940 

TGACAGTCCA TTCATGAGCG CAAAGGCCTC ATGACCTAAT GGCACACACC TGTAATCCCA 3000 

ACTGCTTGGG AGGCTGAGGC GAGAGGATTG CTTGAACCTG GGAGGCAGAG GTTGCAGTGA 3060 

GCCGAGATCG CACCACTGCA CTCCAGTCTG GGCAACAGAG TGAGACTTCA TCTCAAAAAA 3120 

AGTAAAAAAA AAGATTT7AT ATAATCACTG AAGATCTCTA TTATAGATAG ATTAGGTTTT 3180 

TGACATTGGA AACATACTTA GGGATAGATT TGTCCTAAAG GAAAAAAGTA GGCCCGGGCA 3240 

GATTAAATGT CTTGTGTAAA GTCACACATT AAATTCAGTC ACACATTAAA TTCATAGAGT 3300 

TTTAAATGTT TAATGTATAT AAACCAGTTT CTTTATACAC ATTTGGGAAA ACATTGGTCT 33 60 

CACAGATTAA ATGATTAACT AACTGACCCA GGAACTAGTT GTAGCTTTCT AAGTAATTAG 3420 

GCAATTACAG TTATTGCCTG TAACCAAAGG TAATAAAACA AAATGACAAG TACATGTTTA 3480 

AAATTATGAG GCAATGAGAA ATAATTTAAA AACCAATTTT CTAGTTATAA TTTAAAATTT 3540 

GGAGAGCATT TTTAACAGTA ATTAATCCAG AGGTGGCTCA AATTGAGTAT AAGAATTAAG 3600 

ATTATTTAAA ATACTGCATG TCTACCTTCT CGGGGATCAT ACTTTATAAC ACTTTCTGCT 3660 

TCAGTAGCTC TTCATAGCTT GCCAAGTATG CTCCCATATT TTCTCTCTCG TGCCTCGCAA 3720 

ATGAAAGTCA GATAGGCTGG GAACTCATGG GGCAGCCCTC AGACTTCAAT GTGGGCTTCA 3780 

AATCCAGTTT CCTGTTCTAT ATGGTGCTAC ATCTTTCCAG AAAATTTCCC TCAGAGCCCC 3840 

TCGCCAAAAC TUIAGCATTAT TTTGACCCTG CATGCTATTT CTTTAGCTGT AGGTGATAGA 3 900 

TTAGAACTTC TGTCAGACAT GTTAATGACA AACATACCAA CAGACAATAA CCTUWVGCAAA 3960 

TGTTTCCTTC AAGTGTGAAA TGTGCAGGGG CTCGTGGGCA AGGATGTATT GGCACACTGT 4020 

CCTCTTGAAC TGATAGTGTC CCAGCAATGT TGGAGGTTGG CACCATTCCT GGTCCGACAC 4080 

TTGAGGACCT GAGAGACATC AGGTTTAGAA TGAGCCAAAG AAATCCTACA AGATGGGGAG 4140 

AATTGGTGTG CAGCAGCCTA AGTGTTATAG TTAAGTCTAA AGAAGTATGA AAGATCCCCT 4200 

GTGTTCTCTA AATTGAGCAG AGGGGCCTGC CTACCAATAT CACTTTTTAG GGGACTGAAC 4260 

CATTGCAGGT TAGACTTGGC TTCCAAAGAG TCTGCCTAAG CCAGGGGTGG CAGGGTAGGC 4320 

CATCATAGCT GGATGGCCTC AAAAGCAGAT GGGGGCAGAC TTGCCCTCGT GATGCCAGGA 4380 

TTTGAGAGGC AGAGTTTCTA GAGGGAGACC AGTGCTGCCT CTCACAGTGG CAGTTTTTTC 4440 

TCTTTGCAAG AGGAGGGGCT GTTCAATTCC ATAGACCAGT GGGCAGATAG CCAGTTGAAT 450 0 

ACTCTGTGCA TGGTTTGATC CTTTATTAGT TCGCTCTAAT ATTTTTCTGT AGATCCTTTT 4560 

GTCCTGGACT CAAAATCTAA TCCATGCATT GTATGATACC GTAGCTCTCC TAAGGTTTGT 4620 

GTTTCCTTCA AAATGTTTTA GTTTTCTTCA ACTAAATTTG ATTTTTGCTG TTAGAAGTGA 4680 

CATATTTTTA TGGTATACAC TATGTTCCTT TTTTCTACTG CGA6TCAATT TTTTGAATTT 4740 

TCGTGAGAAA GAATATATCT ACAAATT6CA CGAAAGTATC ATAAAAACAG TACTCTAGAG 4800 
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CAGCGCTGTC CAATAGAAAT ATAATCTGAG CCACATGTAT AATTTTATTT TCTTCTAGCC 4860 

ACATTAAAGA AGTAAAAAGA TACAAGTAGA ACTAATTTTA ATGTTTTAAT TCAGTATATC 4920 

CAAAATATCA TTTGT^CATG TAATTAATAT AAAATTATTA ATGTGATATT TTACATTCTT 4980 

TTGGTAATAC TAGTCTTCAA AATCTGGTAT GTATCTTACA TTGATAGCAC ATCTCACTTT 5040 

GTACTAGCCA CATTGCAAGT GCTCAGTAGC CACATGTGGC TAGTGGCTAC TGCACTGGAC 5100 

AGCACAGTTC TAGGTTCCAC GCTAACACCG AAGTCCTGTG GATTAGAATC CCAGAATCAG 5160 

AGCTGGAAGT AAACATAGAG ATCAAACCTC CTTTTAAAAA TGAGGACGCT GAGGCACAGA 5220 

GTTTAAATGG CTTGCATGAG GTCATACAGC TAAATTCAGC CTCAACAGGG TCTTCTGATT 5280 

CCAGGCACTC TTCCCACTCC ACTACATTAC TGTAGTGGTA ATTCTTAGGG TTAAAAAAAG 5 340 

TGTAGAGTAG GCCGGGCGCA GTGGCTCAT6 CCTGTAATCC CAGCACTTTG GGAGGCCGAA 5400 

GTGGGCGGAT CACGAGGTCA GGAGATCGAG ACCATCCTGG CCAACATGGT GAAACCCCGT 5460 

CTCTACTGAA AATACAAAGC AAAATTAGCC AGGTGTGGTG GCGGGCGCCT GTGGTCCCAG 5520 

CTGCTCTGGA GGCTGAGGCA GAATGGCGTG AACCCAGGAG GCAGAGATGG CAGTGAGCCA 5580 

AGATCGCGCC ACTGCACCCC AGCCTGGGCX3 ACAGAGCX3AG ACTCCATCTC AAAAAAAAAA 5640 

AAAAAAAAAA AAGAAAAGAA AAGATUUVGTC TAGAGAACAT TATATTAAGT GGTTATTATT 5700 

GAAGTAGACC AAAGTTTATA CCATAAGGAT ATTTTTCCTT AAATACCATG TTTGAAGAAC 5760 

AATTATTTAT TGATCCTTGA ATCTGTAAGA TCAAATAACA AGTCTCTATC CATGTTACCA 582 0 

AATTTAACCT TTTGAAAATA ATAAACTTTA AAATATCAGA TGTGTTATTA CAGGATGATA 5880 

CTTGGAATCA AGTGAAATGA GTTATATGGT CATCACTAAA TTTAGAAATC TATTGTGAAA 5940 

CAAAGACAAA CAGGAAAGTA CAGAATAGAG ACTTTTAGTA AATAAATGGA ATTTAAAAGA 6000 

AAGTGTTTAT TTACAGTGTC ACGACAGAAA AGGATGTCTT TGTTGTCATA GTCTTTGAGG 60 60 

GATCTCCGTA AAATCTGGGG CACAGGTACA AGAAATAGCC AATATTTAGT TCCCAGACCA 612 0 

TGTTTAGTAG TGTCCAGTTT CAGATCATGC TGCCAAGAGG TATCTCCCCC TCAGGTGGGT 6180 

CATCACTGAG CCCTGGAATT GGAGACTCAT ACTTGCCCAG CACAATGTTA CGGGCAGACA 6240 

GGCCGACATC TATGATTAGC TAGAAGCCAT AAAGAAAAGC TGCTAAGTGG CCACTAGGTG 6300 

CCACTTTTCT GTTTTTGTAA TGCTTTCATT AGCAGATCTT TTTTTTCCAA GCTCCATGGG 6360 

GCCTATGAGA GGCATTTATG ATTTTTGTGC CTACAATAAG TCAGCCTGTC TGGTGTGAGT 6420 

TGTTTTATGA GAAATGCTTT CCAAGGGAGG TCTAGGAAGA TCCTGACACA TAAGAACTTT 6480 

GGCTTAGAGA GCTTTCCAGG TGTAGTGCCA ATA7VAAACTG ACCTGGAAAG AAAACCTGCC 6540 

CAGCACGGAA CATGCTTTCT GAACTCACTT GAGAGTGTAT GGTGTATGTC ACTTCTCATA 6600 

TATTCTTGAG TTTAGATTTG TCTTTTATAC AATTTTTAGC TCTTTTCCAG TTCACTTGTG 6660 

CTCGTCTGTA TATTGGTATT TTTAAATTTT TGTGGTAAAT AATGAAAAGA GTGAAATTAT 6720 

ATTTTATAAT TACTCATTTG TAGTTTTTTT .TTTTAATTTA ATAAACTTCC TCCAAAAAGT 6780 
GCTCCCTTAA AA 



Seq ID NO: 166 Protein sequence: 
Protein Accession #: AAG34652 

1 11 21 31 

I I I I 

MAAEEEAAAG GKVLREBNQC lAPWSSRVS PGTRPTAMGS 
QVEDGEHQVK MKAFREAHSQ TEKRRRDKMM NLIEELSAMI 
QHIiRSLKGLT NSYVGSNYRP SFLQDNEIjRH LIIiKTAEGFL 
IiNYDQASLTG QSLFDFLHPK DVAKVKEQLS SFDISPREKL 
YSGSRRSFFC RIKSCKISVK EEHGCLPNSK KKEHRKFYTI 
NSKKDNSNFT CLVAIGRLQP YIVPQNSGEI NVKPTEFITR 
LPQELLGTSC YEYFHQDDHN NLTDKHKAVL QSKEKILTDS 
TNPWTKELEY IVSVNTLVLG HSEPGEASFL PCSSQSSEES 
SIGTDIANEI IiDLQRLQSSS YIjDDSSPTGL MKDTHTVNCR 
TRQNQSTVAV HSHEPLLSDG AQLDFDALCD NDDTAMAAPM 
Tli 

Seq ID NO: 167 DNA sequence 
Nucleic Acid Accession #j NM_014400 
Coding sequence: 86-1126 

1 11 21 31 41 51 

1 I I I I 1 

GGTTACTCAT CCTGGGCTCA GGTAAGAGGG CCCGAGCTCG GAGGCGGCAC ACCCAGGGGG 60 

GACGCCAAGG GAGCAGGACG GAGCCATGGA CCCCGCCAGG AAAGCAGGTG CCCAGGCCAT 120 

GATCTGGACT 6CAGGCTGGC TGCTGCTGCT GCTGCTTCGC GGAGGAGCGC AGGCCCTGGA 180 

GTGCTACAGC TGCGTGCAGA AAGCAGATGA CGGATGCTCC CCGAACAAGA TGAAGACAGT 240 

GAAGTGCGCG CCGGGCGTGG ACGTCTGCAC CGAGGCCGTG GGGGCGGTGG AGACCATCCA 300 

CGGACAATTC TCGCTGGCAG TGCSGGGTTG CGGTTCGGGA CTCCCCGGCA AGAATGACCG 360 

CGGCCTGGAT CTTCACGGGC TTCTGGCGTT CATCCAGCTG CAGCAATGCG CTCAGGATCG 420 

CTGCAACGCC AAGCTCAACC TCACCTCGCG GGCGCTCGAC CCGGCAGGTA ATGAGAGTGC 480 

ATACCCGCCC AACGGCGTGG AGTGCTACAG CTGTGTGGGC CTGAGCCGGG AGGCGTGCCA 540 

GGGTACATCG CCGCCGGTCG TGAGCTGCTA CAACGCCAGC GATCATGTCT ACAAGGGCTG 60 0 

CTTCGACGGC AACGTCACCT TGACGGCAGC TAATGTGACT GTGTCCTTGC CTGTCCGGGG 660 

CTGTGTCCAG GATGAATTCT GCACTCGGGA TGGAGTAACA GGCCCAGGGT TCACGCTCAG 720 

TGGCTCCTGT TGCCAGGGGT CCCGCTGTAA CTCTGACCTC CGCAACAAGA CCTACTTCTC 780 

CCCTCGAATC CCACCCCTTG TCCGGCTGCC CCCTCCAGAG CCCACGACTG TGGCCTCAAC 840 

CACATCTGTC ACCACTTCTA CCTCQGCCCC AGTGAGACCC ACATCCACCA CCAAACCCAT 900 

GCCAGCGCCA ACCAGTCAGA CTCCGAGACA GGGAGTAGAA CACGAGGCCT CCCGGGATGA 960 

GGAGCCCAGG TTGACTGGAG GCGCCGCTGG CCACCAGGAC CGCAGCAATT CAGGGCAGTA 1020 

TCCTGCAAAA GGGGGGCCCC AGCAGCCCCA TAATAAAGGC TGTGTGGCTC CCACAGCTGG 1080 

ATTGGCAGCC CTTCTGTTGG CCGTGGCTGC TGGTGTCCTA CTGTGAGCTT CTCCACCTGG 1140 

AAATTTCCCT CTCACCTACT TCTCTGGCCC TGGGTACCCC TCTTCTCATC ACTTCCTGTT 1200 

CCCACGACTG GACTGGGCTG GCCCAGCCCC TGTTTTTCCA ACATTCCCCA GTATCCCCAG 1260 

CTTCTGCTGC GCTGGTTTGC GGCTTTGGGA AATAAAATAC CGTTGTATAT ATTCTGGCAG 1320 

GGGTGTTCTA GCTTTTTGAG GACAGCTCCT GTATCCTTCT CATCCTTGTC TCTCCGCTTG 13 80 

TCCTCTTGTG ATGTTAGGAC AGAGTGAGAG AAGTCAGCTG TCACGGGGAA GGTGAGAGAG 1440 

AGGATGCTAA GCTTCCTACT CACTTTCTCC TAGCCAGCCT GGACTTTGGA GCGTGGGGTG 1500 

GGTGGGACAA TGGCTCCCCA CTCTAAGCAC TGCCTCCCCT ACTCCCCGCA TCTTTGGGGA 1560 

ATCGGTTCCC CATATGTCTT CCTTACTAGA CTGTGAGCTC CTCGAGGGCA GGGACCGTGC 1620 

CTTATGTCTG TGTGTGATCA GTTTCTGGCA CATAAATGCC TCAATAAAGA TTTAATTACT 1680 



41 51 
1 I 

FSSHMTBFPR KRKGSDSDPS 60 

PQCNPMARKL DKLTVLRMAV 120 

FWGCERGKI LFVSKSVSKI 180 

IDAKTGLQVH SNLHAGRTRV 240 

HCTGYLRSWP PNIVGMEEER 3 00 

FAVNGKFVYV DQRATAILGY 3 60 

YKFRAKDGSF VTLKSQWFSF 42 0 

SRQSCMSVPG MSTGTVLGAG 480 

SMSNKELFPP SPSEMGELEA 540 

NYLEAEGGLG DPGDFSDIQW 600 
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TTGTATAGTG AfiAAAAAA 

Seg ID NO: 168 FroCein sequence: 
Protein Accession # : NP_055215 



PCT/US02/12476 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



MDPARKAGAQ 
CTEAVGAVET 
SRALDPAGNE 

AANVTVSIiPV 
LPPPEPTTVA 
AGHQDRSNSG 



11 
1 

AMiWTAGWLIi 
IHGQFSLAVX 
SAYPPNGVEC 
RGCVQDEFCT 
STTSVTTSTS 
QYPAKGGPQQ 



21 

1 

LLLLRGGAQA 
GCGSGIiPGKN 
YSCVGIjSREA 
RDGVTGPGFT 
APVRPTSTTK 
PPINKGCVAPT 



31 
I 

LECYSCVQKA 
DRGIjDIiHGLL 
CQGTSPPWS 
LSGSCCQGSR 
PMPAPTSQTP 
AGLAALLLAV 



41 


51 




1 

DDGCSPNKMK 


1 

TVKCAPGVDV 


60 


AFIQLQQCAQ 


DRCNAKLNIiT 


120 


CYNASDHVYK 


GCFDGNVTLT 


180 


CNSDLRNKTY 


FSPRIPPLVR 


240 


RQGVEHEASR 


DEEPRLTGGA 


300 


AAGVLL 







Seq ID NO: 169 DNA sequence 
Nucleic Acid Accession #: NM_006875 
Coding sequence: 186-1190 



GAATTCGGCA 

caccagtttc 
cccgggcgtc 
cctccatgtt 

CGCCAGGAGG 
GTAAGGGGGG 
CCATCAAAGT 
CATGCCCACT 
TGATCCGCCT 
CTTTGCCCGC 
CAAGCCGCTG 
TTGTCCATCG 
TUICTCATTGA 
GGACAAGG6T 
CCACTGTCTG 
AGAGGGACCA 
GCTGTGCCCT 
AGAtCCTGCT 
AAAGGAGGCC 
TGGCCCCCAA 
GTTGACTTGG 
ATTGAGGATC 
CAAAGGAGCC 
CTCATTTTGC 
TTATTTTGAT 
TCATATGCTT 
AGTAAAGGGA 
TCAGCCCAGG 

CTGGTGAGAA 
CACCACCAGA 
GCTTGCTGTT 
CTGAGCCGGG 
TCCAAGTGTG 
CCACTATTTA 



11 
I 

CX3AGCGCGCG 
TCTGCTTTCC 

CACGCCCTGC 
GACCAAGCGT 
CAAGGATCGG 
CTTTGGCACC 
GATTCCCCGG 
CGAAGTCGCA 
GCTTGACTGG 
CCAGGATCTC 
CTTCTTTGGC 
TGACATCAAG 
TTTTGGTTCT 
6TACAGCCCC 
GTCACTGGGC 
GGAGATTCTG 
AATCCGCCGG 
GGACCCCTGG 
CTGCCCCTTT 
TGGTCAGAAG 
TTTTACAGGT 
AGGGGTTAGA 
TTCCTCCCAG 
TAAGGAAGTT 
GATGTGTCAC 
TTACTTGGGC 
CCCTTTCCCC 
ATTTTTTATT 
TTTTTTTTTG 
6AACCTTAAT 
CAATAGGATG 
TGTTTTCCTG 
ATTGTCCAAT 
CCCTCCTTTT 
ATAAAAGTAA 



21 . 
I 

GCGAATCTCA 
ACCCTGGCGC 

GGGCTTAGCG 
CTACAGGGGC 
GAAGCGTTCG 
GTCTTCGCAG 
AATCGTGTGC 
CTGCTATGGA 
TTTGAGACAC 
TTTGACTATA 
CAAGTAGTGG 
GATGAGAACA 
GGTGCCCTGC 
CCAGAGTGGA 
ATCCTCCTCT 
GAAGCTGAGC 
TGCCTGGCCC 
ATGCAAACAC 
GGCCTGGTCC 
AGCCATCCCA 
CATTACCAGT 
AGACATAAAC 
AACCTGTGGT 
TATTTTGGTG 
CCCACATTGG 
AAGGGTGCTT 
TAGCCTAGGG 
TTGGGGGAGG 
GGTGAGGGGA 
TCCATAATTT 
GGATGGATGG 
GGGCGCTCCC 
TACTAAAATG 
TTTTCCTGCC 
TAGAATCAGA 



31 

I 

AOGCTGCGCC 
CCCCCAGCCC 

GGTTCAGTGG 
CTCCCGCGCC 
AGGCCGAGTA 
GACACCGCCT 
TGGGCTGGTC 
AAGTGGGTGC 
AGGAAGGCTT 
TCACAGAGT^ 
CAGCCATCCA 
TCCTGATAGA 
TTCATGATGA 
TCTCTCGACA 
ATGACATGGT 
TCCACTTCCC 
CCAAACCTTC 
CAGCC6AGGA 
TTGCTACCCT 
TGGCCATGTC 
CATTAAAGTC 
CAAGTTTGCC 
CCCTGATTTT 
AAGTTGTTCC 
CACCTCCTAC 
TCCTTCCAAT 
TCCCATATTG 
TAATGCCCTG 
CCCTACTTTG 
GGGAA6GAAT 
TTTTTTGGGG 
TCCAATTTTG 
TAAATAATCA 
TGGATTATTT 
AAAAAAAAAA 



41 
I 

GTCTGCGGGC 
TGGCTCCCCA 
GCTCAATCTG 
CCCCGGGACC 
TCGACTCGGC 
CACAGATCGA 
CCCCTTGTCA 
AGGTGGTGGG 
CATGCTGGTC 
GGGCCCACTG 
GCACTGCCAT 
CCTACGCCGT 
ACCCTACACT 
CCAGTACCAT 
GTGTGGGGAC 
AGCCCATGTC 
TTCCCGACCC 
TGTTACCCCT 
AAGCCTGGCC 
ACAGGGATAG 
CAGTATTACT 
CAGTTCCCTT 
GGAGGGGGAA 
CATTTTGAGC 
TACCACCACA 
ACCCCAGTAG 
GGTCAAGCTG 
TTGTTACCCC 
TTATCCCAAG 
GGAAGATG6A 
GATGGGCTAG 
CAGATTTTTG 
CGTATTGTGG 
AAAAAGCCAT 
AAAAAAAA 



51 

I 

GCTTCCGGGC 
GCTGCGCTGC 
CGCAGCGCCA 
CCCACGCCGC 
CCCCTCCTGG 
CTCCAGGTGG 
GACTCAGTCA 
CACCCTGGCG 
CTCGAGCGGC 
GGTGAAGGCC 
TCCCGTGGAG 
GGCTGTGCCA 
GACTTTGATG 
GCACTCCCGG 
ATTCCCTTTG 
TCCCCAGACT 
TCACTGGAAG 
CAACCCCTCC 
TGGCCTGGCC 
ATGGACATTT 
AAGGTAAGGG 
CCCAATCCTA 
CTTCTTGCTT 
CCCGGGACTC 
CAAACTTAGT 
CTTTTATTTT 
CTTACCTGCC 
AAGGCTTCTT 
TGCTCTTATT 
CACCACCGGA 
6GGAAATAAG 
CAACCTCCTC 
GGAGGGGAGT 
GTGTGGAAAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



60 
65 
70 
75 
80 
85 



Seq ID NO: 170 Protein sequence: 
Protein Accession #; NP_006866 



MLTKPLQGPP 
KVIPRNRVLG 
PAQDLFDYIT 
IDFGSGALLH 
DQEIIiEAEIiH 
RPCPFGLVLA 



11 
1 

APPGTPTPPP 
WSPIiSDSVTC 
EKGPLGEGPS 
DEPYTDFDGT 
FPAHVSPDCC 
TLSLAWPGIiA 



21 
I 

GGKDREAFEA 
PLEVALLWKV 
RCFFGQWAA 
RVYSPPEWIS 
AtilRRCIiAPK 
PNGQKSHPMA 



31 
i 

BYRLGPLLGK 
GAGGGHPGVI 
IQHCHSRGW 
RHQYHALPAT 
PSSRPSLEEI 
MSQG 



Seq ID NO: 171 DNA sequence 
Nucleic Acid Accession NM_003646 
Coding sequence: 89.. 2875 



GCGGCGCGGA 
GCCCTCCGCC 
GAGCAGCGAC 
CGAGCCGGAC 
CGGGCACAGG 
CCCTGGGGCC 
GTCAGCGACA 
CGTTGGGGAG 
AGCCTGCAAG 
CTGTAAGCCG 
GCACCACTGG 
CCAGCAGAAG 



11 
I 

GCGGGCGTGC 
GGCCGGGGCT 
TCCGAGTCGG 
AAGGCGCCGC 
AAAGCCATCA 
CCGTGCAGCG 
TATGGGGAGC 
CAGTACTGTG 
ATTGTGGTGC 
TCCTTCCGTG 
GTACACAGAC 
TTCACCTTCC 



21 

1 

TGAGCCCCGG 
AGGGCCGGAT 
CTTCCGCCTC 
GGCGACTCAA 
CCAAGTCGGG 
AGTCAGAGCG 
ACATCTGGTT 
TAGCCAGGAT 
ACACGCCCTG 
AATCAGGCTC 
GACGCCAGGA 
ACAGCAAGGA 



31 
1 

CCGCCGGCCC 
GGAGCCGCGG 
GTCCAGCGGC 
CAAGCG6CGC 
CCTCCA6CAC 
GCAGATCCGG 
CGAGACCAAC 
GCTGAAGTCA 
CATCGAGCAG 
CAGGAAT6TC 
C6GCAAGT6T 
GATTGTGGCC 



41 
i 

G6F6TVFAGH 

RLLDWFETQE 
HRDIKDENIL 
VWSLGIIiLYD 
LLDPWMQTPA 



41 

1 

GGCATGGGCG 
GACGGTAGCC 
TCCGAGCGCG 
TTCCCGGGGC 
CTGGCCCCCC 
AGTACAGTGG 
GTGTCCGGGG 
GTGTCTCGAA 
CTG6AGAAGA 
CGCGAGCCAA 
CGGCACTGTG 
ATCAGCTGCT 



51 
I 

RLTDRLQVAI 
GFMLVIiERPL 
IDLRRGCAKL 
MVCGDIPFER 
EDVTPQPLQR 



51 
1 

TCTCCCGCGG 
CCGAGGCCCG 
ACGCCGGTCC 
TGCGGCTCTT 
CTCCGCCCAC 
ACTGGAGCGA 
ACTTCTGCTA 
GAAAGTGCGC 
TAAATTTCCG 
CCTTTGTAC6 
GGAAGGGATT 
CGTGGT6CAA 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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GCAGGCATAC CACAGCAAGG TGTCCTGCTT CATGCTGCAG CAGATCGAGG AGCCGTGCTC 780 

GCTGGGGGTC CACGCAGCCG TGGTCATCCC GCCCACCTGG ATCCTCCGCG CCCGGAGGCC 840 

CCAGAATACT CTGAAAGCAA GCAAGAAGAA GAAGAGGGCA TCCTTCAAGA GGAAGTCCAG 900 

CAAGAAAGGG CCTGAGGAGG GCCGCTGGAG ACCCTTCATC ATCAGGCCCA CCCCCTCCCC 960 

GCTCATGAAG CCCCTGCTGG TGTTTGTGAA CCCCAAGAGT GGGGGCAACC AGGGTGCAAA 102 0 

GATCATCCAG TCTTTCCTCT GGTATCTCAA TCCCCGACAA GTCTTCGACC TGAGCCAGGG 1080 

AGGGCCCAAG GAGGCGCTGG AGATGTACCG CAAAGTGCAC AACCTGCGGA TCCTGGCGTG 1140 

CGGGGGCGAC GGCACGGTGG GCTGGATCCT CTCCACCCTG GACCAGCTAC GCCTGAAGCC 1200 

GCCACCCCCT GTTGCCATCC TGCCCCTGQG TACTGGCAAC GACTTGGCCC GAACCCTCAA 1260 

CTGGGGTGGG GGCTACACAG ATGAGCCTGT GTCCT^GATC CTCTCCCACG TGGAGGAGGG 1320 

GAACGTGGTA CAGCTGGACC GCTGGGACCT CCACGCTGAG CCCAACCCCG AGGCAGGGCC 1380 

TGAGGACCGA GATGAAGGCG CCACCGACCG GTTGCCCCTG GATGTCTTCA ACAACTACTT 144 0 

CAGCCTGGGC TTTGACGCCC ACGTCACCCT GGAGTTCCAC GAGTCTCGAG AGGCCAACCC 1500 

AGAGAAATTC AACAGCCGCT TTCGGAATAA GATGTTCTAC GCCX3GGACAG CTTTCTCTGA 1560 

CTTCCTGATG GGCAGCTCCA AGGACCTGGC CAAGCACATC CGAGTGGTGT GTGATGGAAT 1620 

GGACTTGACT CCCAAGATCC AGGACCTGAA ACCCCAGTGT GTTGTTTTCC TGAACATCCC 1680 

CAGGTACTGT GCGGGCACCA TGCCCTGGGG CCACCCTGGG GAGCACCACG ACTTTGAGCC 1740 

CCAGCGGCAT GACGACGGCT ACCTCGAGGT CATTGGCTTC ACCATGACGT CGTTGGCCGC 1800 

GCTGCAGGTG GGCGGACACG GCGAGCGGCT GACGCAOTGT CGCGAGGTGG TGCTCACCAC 1860 

ATCCAAGGCC ATCCCGGTGC AGGTGGATGG CGAGCCCTGC AAGCTTGCAG CCTCACGCAT 1920 

CCGCATCGCC CTGCGCAACC AGGCCACCAT GGTGCAGAAG GCCAAGCGGC GGAGCGCCGC 1980 

CCCCCTGCAC AGCGACCAGC AGCCGGTGCC AGAGCAGTTG CGCATCCAGG TGAGTCGCGT 2 040 

CAGCATGCAC GACTATGAGG CCCTGCACTA CGACAAGGAG CAGCTCAAGG AGGCCTCTGT 210 0 

GCCGCTGGGC ACTGTGGTGG TCCCAGGAGA CAGTGACCTA GAGCTCTGCC GTGCCCACAT 2160 

TGAGAGACTC CAGCAGGAGC CCGATGGTGC TGGAGCCAAG TCCCCGACAT GCCAGAAACT 2220 

GTCCCCCAAG TGGTGCTTCC TGGACGCCAC CACTGCCAGC CGCTTCTACA GGATCGACCG 2280 

AGCCCAGGAG CACCTCAACT ATGTGACTGA GATCGCACAG GATGAGATTT ATATCCTGGA 2340 

CCCTGAGCTG CTGGGGGCAT CGGCCCGGCC TGACCTCCCA ACCCCCACTT CCCCTCTCCC 2400 

CACCTCACCC TGCTCACCCA CGCCCCGGTC ACTGCAAGGG GATGCTGCAC CCCCTCAAGG 2460 

TGAAGAGCTG ATTGAGGCTG CCAAGAGGAA CGACTTCTGT AAGCTCCAGG AGCTGCACCG 2520 

AGCTGGGGGC GACCTCATGC ACCGAGACX3A GCAGAGTCGC ACGCTCCTGC ACCACX3CAGT 2580 

CAGCACTGGC AGCAAGGATG TGGTCCGCTA CCTGCTGGAC CACGCCCCCC CAGAGATCCT 2640 

TGATGCGGTG GAGGAAAACG GGGAGACCTG TTTGCACCAA GCAGCGGCCC TGGGCCAGCG 2 70 0 

CACCATCTGC CACTACATCG TGGAGGCCGG GGCCTCGCTC ATGAAGACAG ACCAGCAGGG 2760 

CGACACTCCC CGGCAGCGG6 CTGAGAAGGC TCAGGACACC GAGCTGGCCG CCTACCTGGA 2820 

GAACCGGCAG CACTACCAQA TGATCCAGCX3 GGAGGACCT^G GAGACGGCTG TGTAGCGGGC 2880 

Seq ID NO: 172 Protein sequence: 
Protein Accession #: NP^003637 

1 11 21 31 41 51 

I 1 I I I I 

MEPRDGSPEA RSSDSESASA SSSGSERDAG PEPDKAPRRL NKRRFPGLRL FGHRKAITKS 60 

GLQHLAPPPP TPGAPCSESE RQIRSTVDWS ESATYGEHIW FETNVSGDFC YVGEQYCVAR 120 

MLKSVSRRKC AACKIWHTP CIEQLEKINF RCKPSFRESG SRNVREPTFV RHHWVHRRRQ 180 

DGKCRHCGKG FQQKFTFHSK EIVAISCSWC KQAYHSKVSC FMLQQIEEPC SLGVHAAWI 240 

PPTWILRARR PQNTLKASKK KKRASFKRKS SKKGPEEGRW RPFIIRPTPS PLMKPLLVFV 300 

NPKSGGNQGA KIIQSFLWYL NPRQVFDLSQ GGPKEALEMY RKVHNLRILA CGGDGTVGWI 3 60 

LSTLDQLRLK PPPPVAILPL GTGNDLARTL NWGGGYTDEP VSKILSHVEE GNWQLDRWD 420 

LHAEPNPEAG PEDRDEGATD RLPLDVFNNY FSLGFDAHVT LEFHESREAN PEKFNSRFRN 480 

KMFYAGTAFS DFLMGSSKDD AKHIRWCDG MDLTPKIQDL KPQCWFLNI PRYCAGTMPW 540 

GHPGEHHDFE PQRHDDGYLE VIGFTMTSLA ALQVGGHGER LTQCREWLT TSKAIPVQVD 600 

GEPCKLAASR IRIALRNQAT MVQKAKRRSA APLHSDQQPV PEQLRIQVSR VSMHDYEALH 660 

YDKEQLKEAS VPLGTVWPG DSDLELCRAH lERLQQEPDG AGAKSPTCQK liSPKWCFLDA 720 

TTASRFYRID RAQBHLNYVT EIAQDBIYIL DPELLGASAR PDLPTPTSPL PTSPCSPTPR 780 

SLQGDAAPPQ GEEIiIEi^AKR NDFCKLQELH RAGGDIiMHRD EQSRTLLHHA VSTGSKDWR 840 

YLLDHAPPEI LDAVEENGET CLHQAAALGQ RTICHYIVBA GASLMKTDQQ GDTPRQRAEK 900 
AQDTELAAYL ENRQHYQMIQ REDQETAV 

Seq ID NO: 173 DNA sequence 
Nucleic Acid Accession AF232772 
Coding sequence: 1-1662 

1 11 21 31 41 51 

I I I I I I 

ATGCCGGTGC AGCTGACGAC AGCCCTGCGT GTGGTGGGCA CCAGCCTGTT TGCCCTGGCA 60 

GTGCTGGGTG GCATCCTGGC AGCCTATGTG ACX3GGCTACC AGTTCATCCA CACGGAAAAG 120 

CACTACCTGT CCTTCGGCCT GTACGGCGCC ATCCTGGGCC TGCACCTGCT CATTCAGAGC 180 

CTTTTTGCCT TCCTGGAGCA CCGGCGCATG CGACGTGCCG GCCAGGCCCT GAAGCTGCCC 240 

TCCCCGCGGC GGGGCTCGGT GGCACTGTGC ATTGCCGCAT ACCAGGAGGA CCCTGACTAC 300 

TTGCGCAAGT GCCTGCGCTC GGCCCAGCGC ATCTCCTTCC CTGACCTCAA GGTGGTCATG 360 

GTGGTGGATG GCAACCGCCA GGAGGACGCC TACATGCTGG ACATCTTCCA CGAGGTGCTG ' 420 

GGCGGCACCG AGCAGGCCGG CTTCTTTGTG TGGCGCAGCA ACTTCCATGA GGCAGGCGAG 480 

G6TGAGACGG AGGCCAGCCT GCAGGAGGGC ATGGACCGTG TGCGGGATGT GGTGCGGGCC 540 

AGCACCTTCT CGTGCATCAT GCAGAAGTGG GGAGGCAAGC GCGAGGTCAT GTACACGGCC 600 

TTCJ\AGGCCC TCGGCGATTC GGTGGACTAC ATCCAGGTGT GCGACTCTGA CACTGTGCTG 660 

GATCCAGCCT GCACCATCGA GATGCTTCGA GTCCTGGAGG AGGATCCCCA AGTAGGGGGA 720 

GTCGGGGGAG ATGTCCAGAT CCTCAACAAG TACGACTCAT GGATTTCCTT CCTGAGGAGG 780 

GTGCGGTACT GGATGGCCTT CAACGTGGAG CGGGCCTGCC AGTCCTACTT TGGCTGTGTG 840 

CAGTGTATTA GTGGGCCCTT GGGCATGTAC CGCAACAGCC TCCTCCAGCA GTTCCTGGAG 900 

GACTGGTACC ATCAGAAGTT CCTAGGCAGC AAGTGCAGCT TCGGGGATGA CCGGCACCTC 960 

ACCAACCGAG TCCTGAGCCT TGGCTACCGA ACTAAGTATA CCGCGCGCTC CAAGTGCCTC 1020 

ACAGAGACCC CCACTAAGTA CCTCCGGTGG CTCAACCAGC AAACCCGCTG GAGCAAGTCT 1080 

TACTTCCGGG AGTGGCTCTA CAACTCTCTG TGGTTCCATA AGCACCACCT CTGGATGACC 1140 

TACGAGTCAG TGGTCACGGG TTTCTTCCCC TTCTTCCTCA TTGCCACGGT TATACAGCTT 1200 

TTCTACCGGG GCCGCATCTG GAACATTCTC CTCTTCCOXSC TGACGGTGCA GCTGGTGGGC 1260 

ATTATCAAGG CCACCTACGC CTGCTTCCTT CGGGGCAATG CAGAGATGAT CTTCATGTCC 1320 
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CTCTACTCCC TCCTCTATAT GTCCAGCCTT CTGCCGGCCA AGATCTTTGC CATTGCTACC 1380 

ATCAACAAAT CTGGCTGGGG CACCTCTGGC CGAA7\AACCA TTGTGGTGAA CTTCATTGGC 1440 

CTCATTCCTG TGTCCATCTG GGTGGCAGTT CTCCTGGAGG GGCTGGCCTA CACAGCTTAT 1500 

TGCCAGGACC TGTTCAGTGA GACAGAGCTA GCCTTCCTTG TCTCTGGGGC TATACTGTAT 1560 

GGCTGCTACT GGGTGGCCCT CCTCATGCTA TATCTGGCCA TCATCGCCCG GCGATGTGGG 162 0 

AAGAAGCCGG AGCAGTACAG CTTGGCTTTT GCTGAGGTGT GACATGGCCC CCAAGCAGAG 1680 

CGGGTAAAGT GCAATGGGTA AGGGAGGGAA GGGGAATGGA AGAGAAAAGA CAGGGTGGGA 1740 

GGGAGGAGGG AGTGCTGTGT TTTAGTCTCT TAATGGTCCA AAGGACAAAT CTAAAATGCA 1800 

AAGAACGGTG ATGTAGTATG GCCTGACAGC TCTGTTTAGA GGAGGCAACA CTGATCCCCC 1860 

AGATGCAGGG CTGCAGGGGA TTCTGTGTTT TCAGACTGCC TGTCTGCTTG CATCTGCACA 192 0 

TAGGCAGTAG CCTCCTCCTG GGCTCCAGAG GGCACTCAGA AGTTGTGCTA AACCAAGTTA 1980 

AGTCCCATTC AGTGGCAACT TGTGATAGGT ACCTGAGTGA CGGCAACCTG CGGAAGGAGG 2 040 

TTCTCCCAGC CCATCTGAAC ACAACCAGAG GTGGCAGGAG AATTTCTACT GAGCGAGGTG 2100 

GGCCGGTTAG TGTATGTCAC CCCCACCCCA CCCATAAGTA GTCATCAATG CAATAAGATT 2160 

GCGCGTGAGA TACAAGGCCC AGAAGCCTGA TCTTTGGGCA TCAGAAAACA GGGTCCAGGA 2220 

ATGGTGCTTT ATGTGAGATA CCCCACTCCA CATCAACATT CCT^GGGATGA GCCAAACCAG 2280 

CAGGGAGTTA GCACTGAACT GCTTTTAAAA GTGCACATTA AAAAGGAAAG TTTGCCAGGA 2340 

GGAACAAAGA GATTGTGGTG GTGCTAAAGG AGGCCATAAG CTACACAGAG GCCTTGGGTG 2400 

TTCCACCTGG AAACTGCTCA GACGTCTAGA TGGGTTCTTA GCTTGTCTGT GATCTCTGCT 2460 

GGGGAGATAA AAAGATTAAG CCCCAACATG TTCAGAAAAG AAGTGAAGTC TTGGGTATTT 2520 

TAACCTGTAT ACTCTTGAAT TCCTCTCAAA TTCAGCTCTG ATCTGAGGCT AAGACACACT 2 580 

CCCCACTTCA CTTTCTTCAA AGCCACATTT TTTGAGGTAT CACTGCAGTC ACCTCTTCTA 2 640 

CCCTCATCAT CATAGGTAAG GTTTTCAAGG TGGCAATTGG GGCGGAGCCC CGGCTTCTTA 2 700 

TAGAAGCTTC AGCAGGAGGC AAGCGTGTTC TCAGCACATA TGGGAACTAT GAGGAGCCTC 2 760 

TGATCAAATT GGCTACAATC TTGGAGCTGC TTGGACGGAT TCCTTGGCAG CCGGGTTAGC 2820 

ATGTGTGACT TTCAGGCTAC TGTTCTTGAC AATCATCTCC AATGGAAAGC TTTTCAGTGT 2880 

TCCCAAAGTG AACTCTCAAA TCCA7VAATGG TTATCTTTGA GACCATCCAT TCTCCTCAGT 2940 

GGCTTCTCCA GGGAATTCTT ACAGCCAAGT TGTGACAGTC ACTGCATTTG CCTGCTTCTT 3000 

TCCAGAAACC AAACTAGGAG ATGA/y^CTGG TTCCTACATC CTAAGGTTCT TGCTTTCTCT 3 0 60 

CTCATGCCTC CTGAGGCTGT TTTTGGCTGT TTTCCCTCTG CTGCTTTTGG GGAATGAGGG 3120 

GAAGCCATTT TCCAAGTGAC TTGCAATCCA GGCTGTTCTC AGCGTTTTGA GTTTAAAACC 3180 

TGGGATCCTG ACTAAGCCTT TGACTTAAGG GTTGCTTGCT TGCCCTCCAA ATGTCCTTTC 3240 

TCAT^GGGGC CAACTAACCC GTGCAGAACC AGCACTAAGG TGGACAGCAG ACAAGAGGGC 33 00 

AAGCCTCTAA TGTACCAAGT GCTTCCTACA AAGACGCAAG GTGTGCTCCG AACCACAGAT 3 3 60 

GGGCAAACCC TGGTGCTTTC CTTCATCTCC CACGAACTCA AGGGTTTTCC AAGTGTAGCT 3 420 

AACAGTTGCC ACATCACACA GACCTCCAGT TTCTGGTAAG ACTGCTGGTT GACATCAGAC 3 48 0 

CCAACCCATT GAAGGCTGGA AGGCAGCAGG CATTTGCTAA GGCAGCTGAT CCAGGCAATC 3540 

GTTCTGCTGG CCAAGAAGTT AAACTATTTT GAGCATTAGA ATGGAGGAAA TCCGGTCAGC 3 600 

CAAGTGCAGA GTTCAGACTT CGCTAAGGGC TTGTTTTTCT TCAGCATTTA CTTGAAGATT 3660 

AATGTAGGAT GACAGGCTCT CCTGGCTGTC CTACCATCAG CTCTGCCTTG CACTGTGGTC 3720 

GTCAACTTTC CTCAAATCAA AAACAGGCAG GTACAGGTAG TGGGCTCACA ACGTTTGACC 3 78Q 

TCGACTGGTT TTTCTAAGTT ATTTTGTACA TTTTTCAGCA GCAAAACCAA ACTGGGTCTT 3 840 

CAGCTTTATG CGCGTTTCTT GCAAGGGAAG AGCCTTTATA CAATTGGACG CATTTTGGTT 3 900 

TTTCCTCATT GAGAATTCAA ATCCTCTTTT GTATTGTTTC TACAATAATT TGTAAACATA 3 960 

TTTATTTTTA CCTGCTTTTT tTTTTTTTTT TAATTTTCAG GTCAAGTTTT TTATACTGCA 4020 
CTTATTTGTC AAAATAAAGA TTCTCACAT 

Seq ID NO: 174 Protein sequence: 
Protein Accession #: AAF36984 

1 11 21 31 41 51 

I 1 I 1 1 1 

MPVQLTTALR WGTSLFALA VLGGILAAYV TGYQFIHTEK HYLSFGLYGA ILGLHLLIQS 60 

LFAFLEHRRM RRAGQALKLP SPRRGSVALC ITAYQEDPDY LRKCLRSAQR ISFPDLKWM 120 

WDGNRQEDA YMLDIFHEVL GGTEQAGFFV WRSNFHEAGE GETEASLQEG MDRVRDWRA 180 

STFSCIMQKW GGKREVMYTA FKALGDSVDY IQVCDSDTVL DPACTIEMLR VLEEDPQVGG 240 

VGGDVQIIjNK YDSWISFIiSS VRYWMAFNVE RACQSYFGCV QCISGPLGMY RNSLLQQFLE 300 

DWYHQKFLGS KCSFGDDRHL TNRVLSLGYR TKYTARSKCL TETPTKYLRW LNQQTRWSKS 360 

YFREWLYNSL WFHKHHLWMT YESWTGFFP FFLIATVIQL FYRGRIWNIL LFLLTVQLVG 420 

IIKATYACFL RGNAEMIFMS LYSLLYMSSL LPAKIFAIAT INKSGWGTSG RKTIWNFIG 480 

LIPVSIWVAV LLEGDAYTAY CQDLFSETEL AFLVSGAILY GCYWVALLML YLAIIARRCG 540 
KKPEQYSLAF ABV 

Seq ID NO: 175 DNA sequence 
Nucleic Acid Accession #: NM_000691 
Coding sequence : 43 1404 



1 11 21 31 41 51 

111111 

CCAGGAGCCC CAGTTACCGG GAGAGGCTGT GTCAAAGGCG CCATGAGCAA GATCAGCGAG 60 

GCCGTGAAGC GCGCCCGCGC CGCCTTCAGC TCGGGCAGGA CCCGTCCGCT GCAGTTCCGA 120 

TTCCAGCAGC TGGAGGCGCT GCAGCGCCTG ATCCAGGAGC AGGAGCAGGA GCTGGTGGGC 180 

GCX3CTGGCCG CAGACCTGCA CAAGAAT6AA TGGAACGCCT ACTATGAGGA GGTGGTGTAC 240 

GTCCTAGAGG AGATCGAGTA CATGATCCAG AAGCTCCCTG A6TGGGCCGC GGATGAGCCC 300 

GTGGAGAAGA CGCCCCAGAC TCAGCAGGAC GAGCTCTACA TCCACTCGGA GCCACTGGGC 360 

GTGGTCCTCG TCATTGGCAC CTGGAACTAC CCCTTCAACC TCACCATCCA GCCCATGGTG 420 

GGCGCCATCG CTGCAGGGAA CGCAGTGGTC CTCAAGCCCT CGGAGCTGAG TGAGAACATG 480 

GCX5AGCCTGC TGGCTACCAT CATCCCCCAG TACCTGGACA AGGATCTGTA CCCAGTAATC 540 

AATGGGGGTG TCCCTGAGAC CACGGAGCTG CTCAAGGAGA GGTTCGACCA TATCCTGTAC 600 

ACGGGCAGCA CGGGGGTGGG GAAGATCATC ATGACGGCTG CTGCCAAGCA CCTGACCCCT 660 

GTCACGCTGG AGCTGGGAGG GAAGAGTCCC TGCTACGTGG ACAAGAACTG TGACCTGGAC 720 

GTGGCCTGCC GACGCATCGC CTGGGGGAAA TTCATGAACA GTGGCCAGAC CTGCGTGGCC 780 

CCAGACTACA TCCTCTGTGA CCCCTCGATC CAGAACCAAA TTGTGGAGAA GCTCAAGAAG 840 

TCACTGAAAG AGTTCTACGG GGAAGATGCT AAGAAATCCC GGGACTATGG AAGAATCATT 900 

AGTGCCCGGC ACTTCCAGAG GGTGATGGGC CTGATTGAGG GCCAGAAGGT GGCTTATGGG 960 

GGCACCGGGG ATGCCGCCAC TCGCTACATA GCCCCCACCA TCCTCACGGA CGTGGACCCC 1020 
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CAGTCCCCGG TGATGCAAGA GGAGATCTTC GGGCCTGTGC TGCCCATCGT GTGCGTGCGC 1080 

AGCCTGGAGG AGGCCATCCA GTTCATCAAC CAGCGTGAGA AGCCCCTGGC CCTCTACATG 1140 

TTCTCCAGCA ACGACAAGGT GATTAAGAA6 ATGATTGCAG AGACATCCAG TGGTGGGGTG 1200 

GCGGCCAACG ATGTCATCGT CCACATCACC TTGCACTCTC TGCCCTTCGG GGGCGTGGGG 1260 

AACAGCGGCA TGGGATCCTA CCATGGCAAG AAGAGCTTCG AGACTTTCTC TCACCGCCGC 1320 

TCTTGCCTGG TGAGGCCTCT GATGAATGAT GAAGGCCTGA AGGTCAGATA CCCCCCGAGC 1380 

CCGGCCAAGA TGACCCAGCA CTGAG6AGGG GTTGCTCCGC CTGGCCTGGC CATACTGTGT 1440 

CCCATCGGAG TGCGGACCAC CCTCACTGGC TCTCCTGGCC CTGGAGAATC GCTCCTGCAG 1500 

CCCCAGCCCA GCCCCACTCC TCTGCTGACC TGCTGACCTG TGCACACCCC ACTCCCACAT 1560 

GGGCCCAGGC CTCACCATTC CAAGTCTCCA CCCCTTTCTA GACCAATAAA GAGACT^AATA 1620 
CAATTTTCTA ACTCGG 

Seq ID NO: 176 Protein sequence: 
Protein Accession #: NP_000682 

1 11 21 31 41 51 

1 I I I I I 

MSKISEAVKR ARAAFSSGRT RPLQFRFQQL EALQRLIQEQ EQELVGALAA DliHKNEWNAY 60 

YEEWYVLEE lEYMIQKLPE WAADEPVEKT PQTQQDELYI HSEPLGWLV IGTWNYPFNL 120 

TIQPMVGAIA AGNAWLKPS ELSENMASLIi ATIIPQYLDK DLYPVINGGV PETTELLKER 180 

FDHILYTGST GVGKIIMTAA AKHLTPVTLE LGGKSPCYVD KNCDLDVACR RIAWGKFMNS 2 40 

GQTCVAPDYI LCDPSIQNQI VEKLKKSLKE FYGEDAKKSR DYGRIISARH FQRVMGLIEG 300 

QKVAYGGTGD AATRYIAPTI LTDVDPQSPV MQEEIFGPVL PIVCVRSLEE AIQFINQREK 3 60 

PLALYMFSSN DKVIKKMIAE TSSGGVAAND VIVHITLHSL PFGGVGNSGM GSYHGKKSFE 420 
TFSHRRSCLV RPLMNDEGLK VRYPPSPAKM TQH 

Seq ID NOs 177 DNA sequence 

Nucleic Acid Accession #: NM_001067.1 

Coding sequence: 108-4703 

1 11 21 31 41 51 

I I I I i I 

CTAACCGACG CGCGTCTGTG GAGAAGCGGC TTGGTCGGGG GTGGTCTCGT GGGGTCCTGC 60 

CTGTTTAGTC GCTTTCAGGG TTCTTGAGCC CCTTCACGAC CGTCACCATG GAAGTGTCAC 120 

CATTGCAGCC TGTAAATGAA AATATGCAAG TCAACAA?AT AAAGAAAAAT GAAGATGCTA 180 

AGAAAAGACT GTCTGTTGAA AGAATCTATC AAAAGAAAAC ACAATTGGAA CATATTTTGC 240 

TCCGCCCAGA CACCTACATT GGTTCTGTGG AATTAGTGAC CCAGCAAATG TGGGTTTACG 300 

ATGAAGATGT TGGCATTAAC TATAGGGAAG TCACTTTTGT TCCTG6TTTG TACAAAATCT 360 

TTGATGAGAT TCTAGTTAAT GCTGCGGACA ACA7VACAAAG GGACCCAAT^ ATGTCTTGTA 420 

TTAGAGTCAC AATTGATCCG GAAAACAATT TAATTAGTAT ATGGAATAAT GGAAAAGGTA 480 

TTCCTGTTGT TGAACACAAA GTTGAAAAGA TGTATGTCCC AGCTCTCATA TTTGGACAGC 540 

TCCTJ\ACTTC TAGTAACTAT GATGATGATG AAAAGAAAGT GACAGGTGGT CGAAATGGCT 600 

ATGGAGCCAA ATTGTGTAAC ATATTCAGTA CCAAATTTAC TGTGGAAACA GCCAGTA6AG 660 

AATACAAGAA AATGTTCAAA CAGACATGGA TGGATAATAT GGGAAGAGCT GGTGAGATGG 720 

AACTCAAGCC CTTCAATGGA GAAGATTATA CATGTATCAC CTTTCAGCCT GATTTGTCTA 780 

AGTTTAAAAT GCAAAGCCTG GACAAAGATA TTGTTGCACT AATGGTCAGA AGAGCATATG 840 

ATATTGCTGG ATCCACCAAA GATGTCAAAG TCTTTCTTAA TGGAAATAAA CTGCCAGTAA 90 0 

AAGGATTTCG TAGTTATGTG GACATGTATT TGAAGGACAA GTTGGATGAA ACTGGTAACT 960 

CCTTGAAAGT AATACATGAA CAAGTAAACC ACAGGTGGGA AGTGTGTTTA ACTATGAGTG 102 0 

AAAAAGGCTT TCAGCAAATT AGCTTTGTCA ACAGCATTGC TACATCCAAG GGTGGCA6AC 1080 

ATGTTGATTA TGTAGCTGAT CAGATTGTGA CTAAACTTGT TGATGTTGTG AAGAAGTUVGA 1140 

ACAAGGGTGG TGTTGCAGTA AAAGCACATC AGGTGAAAAA TCACATGTGG ATTTTTGTAA 12 00 

ATGCCTTAAT TGAAAACCCA ACCTTTGACT CTCAGACAAA AGAAAACATG ACTTTACAAC 12 60 

CCAAGAGCTT TGGATCAACA TGCCAATTGA GTGAAAAATT TATCAAAGCT GCCATTGGCT 132 0 

GTGGTATTGT AGAAAGCATA CTAAACTGGG TGAAGTTTAA GGCCCAAGTC CAGTTAAACA 1380 

AGAAGTGTTC AGCTGTAAAA CATAATAGAA TCAAGGGAAT TCCCAAACTC GATGATGCCA 1440 

ATGATGCAGG GGGCCX3AAAC TCCACTGAGT GTACGCTTAT CCTGACTGAG GGAGATTCAG 1500 

CCAAAACTTT GGCTGTTTCA GGCCTTGGTG TGGTTGGGAG AGACAAATAT GGGGTTTTCC 1560 

CTCTTAGAGG AAAAATACTC AATGTTCGAG AAGCTTCTCA TAAGCAGATC ATGGAAAATG 1620 

CTGAGATTAA CAATATCATC AAGATTGTGG GTCTTCAGTA CAAGAAAAAC TATGAAGATG 1680 

AAGATTCATT GAAGACGCTT CGTTATGGGA AGATAATGAT TATGACAGAT CAGGACCAAG 1740 

ATGGTTCCCA CATCAAAGGC TTGCTGATTA ATTTTATCCA TCACAACTGG CCCTCTCTTC 1800 

TGCGACATCG TTTTCTGGAG GAATTTATCA CTCCCATTGT AAAGGTATCT AAAAACAAGC 1860 

AAGAAATGGC ATTTTACAGC CTTCCTGAAT TTGAAGAGTG GAAGAGTTCT ACTCCAAATC 1920 

ATAAA^J^TG GAAAGTCAAA TATTACAAAG GTTTGGGCAC CAGCACATCA AAGGAAGCTA 1980 

AAGAATACTT TGCAGATATG AAAAGACATC GTATCCAGTT CAAATATTCT GGTCCTGAAG 2040 

ATGATGCTGC TATCAGCCTG GCCTTTAGCA AAA/UICAGAT A6ATGATCGA AAGGAATGGT 2100 

TAACTAATTT CATGGAGGAT AGAAGACAAC GAAAGTTACT TGGGCTTCCT GAGGATTACT 2160 

TGTATGGACA AACTACCACA TATCTGACAT ATAATGACTT CATCAACAAG GAACTTATCT 2220 

TGTTCTCAAA TTCTGATAAC GAGAGATCTA TCCCTTCTAT GGTGGATGGT TTGAAACCAG 2280 

GTCAGAGAAA GGTTTTGTTT ACTTGCTTCA AACGGAATGA CAAGCGAGAA GTAAAGGTTG- 2340 

CCCAATTAGC TGGATCAGTG GCTGAAATGT CTTCTTATCA TCATGGTGAG ATGTCACTAA 240 0 

TGATGACCAT TATCT^TTTG GCTCAGAATT TTGTGGGTAG CAATAATCTA AACCTCTTGC 2460 

AGCCCATTGG TCAGTTTGGT ACCAGGCTAC ATGGTGGCAA GGATTCTGCT AGTCCACGAT 2520 

ACATCTTTAC 7U\TGCTCAGC TCTTTGGCTC GATTGTTATT TCCACCAAAA GATGATCACA 2580 

CGTTGAAGTT TTTATATGAT GACAACCAGC GTGTTGAGCC TGAATGGTAC ATTCCTATTA 2640 

TTCCCATGGT GCTGATAAAT GGTGCTGTU^G GAATCGGTAC TGGGTGGTCC TGCAAAATCC 2 70 0 

CCAACTTTGA TGTGCGTGAA ATTGTAAATA ACATCAGGCG TTTGATGGAT GGAGAAGAAC 2760 

CTTTGCCAAT GCTTCCAAGT TACAAGAACT TCAAGGGTAC TATTGAAGAA CTGGCTCCAA 2820 

ATCAATATGT GATTAGTGGT GAAGTAGCTA TTCTTAATTC TACAACCATT GAAATCTCAG 2880 

AGCTTCCCGT CAGAACATGG ACCCAGACAT ACAAAGAACA AGTTCTAGAA CCCATGTTGA 2 940 

ATGGCACCX3A GTVAGACACCT CCTCTCATAA CAGACTATAG GGAATACCAT ACAGATACCA 3000 

CTGTGAAATT TGTTGTGAAG ATGACTGAAG AAAAACTGGC AGAGGCAGAG AGAGTTGGAC 3060 

TACACAAAGT CTTCAAACTC CAAACTAGTC TCACATGCAA CTCTATGGTG CTTTTTGACC 3120 

ACGTAGGCTG TTTAAAGAAA TATGACACGG TGTTGGATAT TCTAAGAGAC TTTTTTGAAC 3180 

TCAGACTTAA ATATTATGGA TTAAGAAAAG AATGGCTCCT AGGAATGCTT GGTGCTGAAT 3240 

CTGCTAAACT GAATAATCAG GCTCGCTTTA TCTTAGAG7UV AATAGATGGC AAAATAATCA 3300 
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TTGAAAATAA GCCTAAGAAA GAATTAATTA AAGTTCTGAT TCAGAGGGGA TATGATTCGG 3360 

ATCCTGTGAA GGCCTGGAAA GAAGCCCAGC AAAAGGTTCC AGATGAAGAA GAAAATGAAG 3420 

AGAGTGACAA CGATU^GGAA ACTGAAAAGA GTGACTCCGT AACAGATTCT GGACCAACCT 3480 

TCAACTATCT TCTTGATATG CCCCTTTGGT ATTTAACCAA GGAAAAGAAA GATGAACTCT 3540 

5 GCAGGCTAAG AAATGAAAAA GAACAAGAGC TGGACACATT AAAAAGAAAG AGTCCATCAG 3600 

ATTTGTGGAA AGAAGACTTG GCTACATTTA TTGAAGAATT GGAGGCTGTT GAAGCCAAGG 3660 

AAAAACAA6A T6AACAAGTC G6ACTTCCTG GGAAAGGGGG GAAGGCCAAG GGGAAAAAAA 3720 

CACAAATGGC TGAAGTTTTG CCTTCTCCGC GTGGTC7VAAG AGTCATTCCA CGAATAACCA 3780 

TAGAAATGAA AGCAGAGGCA GJ\AAAGAAAA ATAAAAAGAA AATTAAGAAT GAAAATACTG 3 840 

10 AAGGAAGCCC TCAAGAAGAT GGTGTGGAAC TAGAAGGCCT AAAACAAAGA TTAGAAAAGA 3900 

AACAGAAAAG AGAACCAGGT ACAAAGACAA AGAAACAAAC TACATTGGCA TTTAAGCCAA 3 960 

TCAAAAAAGG AAAGAAGAGA AATCCCTGGC CTGATTCAGA ATCAGATAGG AGCAGTGACG 4020 

AAAGTAATTT TGATGTCCCT CCACGAGAAA CAGAGCCACG GAGAGCAGCA ACAAAAACAA 4080 

AATTCACAAT G6ATTTGGAT TCAGATGAAG ATTTCTCAGA TTTTGATGAA AAAACTGATG 4140 

15 ATGAAGATTT TGTCCCATCA GATGCTAGTC CACCTAAGAC CAAAACTTCC CCAAAACTTA 4200 

GTAACAAAGA ACTGAAACCA CAGAAAAGTG TCGTGTCAGA CCTTGAAGCT GATGATGTTA 4260 

AGGGCAGTGT ACCACTGTCT TCAAGCCCTC CTGCTACACA TTTCCCAGAT GAAACTGAAA 4320 

TTACAAACCC AGTTCCTAAA AAGAATGTGA CAGTGAAGAA GAGAGCAGCA AAAAGTCAGT 4380 

CTTCCACCTC CACTACCGGT GGGAAAAAAA GGGCTGCCCC AAAAGGAACT AAAAGGGATC 4440 

20 CAGCTTTGAA TTCTGGTGTC TCTCAAAAGC CTGATCCTGC CTW^CCAAG AATCGCCGCA 4500 

AAAGGAAGCC ATCCACTTCT GATGATTCTG ACTCTAATTT TGAGAAAATT GTTTCGAAAG 4560 

CAGTCACAAG CAAGAAATCC AAGGGGGAGA GTGATGACTT CCATATGGAC TTTGACTCAG 4620 

CTGTGGCTCC TCGGGCAAAA TCTGTACGGG CAAAGAAACC TATAAAGTAC CTGGAAGAGT 4680 

CAGATGAAGA TGATCTGTTT TAAAATGTGA GGCGATTATT TTAAGTAATT ATCTTACCAA 4 740 

25 GCCCAAGACT GGTTTTAAAG TTACCTGAAG CTCTTAACTT CCTCCCCTCT GAATTTAGTT 4800 

TGGGGAAGGT GTTTTTAGTA CAAGACATCA AAGTG/ikAGTA AAGCCCAAGT GTTCTTTAGC 4860 

TTTTTATAAT ACTGTCTAAA TAGTGACCAT CTCATGGGCA TTGTTTTCTT CTCTGCTTTG 4920 

TCTGTGTTTT GAGTCTGCTT TCTTTTGTCT TTAAAACCTG ATTTTTAAGT TCTTCTGAAC 4980 

TGTAGAAATA GCTATCTGAT CACTTCAGCG TAAAGCAGTG TGTTTATTAA CCATCCACTA 5040 

30 AGCTAAAACT AGAGCAGTTT GATTTAAAAG TGTCACTCTT CCTCCTTTTC TACTTTCAGT 5100 

AGATATGAGA TAGAGCATTW^ TTATCTGTTT TATCTTAGTT TTATACATAA TTTACCATCA 5160 

GATAGAACTT TATGGTTCTA GTACAGATAC TCTACTACAC TCAGCCTCTT ATGTGCCAAG 5220 

TTTTTCTTTA AGCAATGAGA AATTGCTCAT GTTCTTCATC TTCTCAAATC ATCAGAGGCC 5280 

AAAGAAAAAC ACTTTGGCTG TGTCTATAAC TTGACACAGT CAATAGAATG AAGAA?U^TTA 5340 

35 GAGTAGTTAT GTGATTATTT CAGCTCTTGA CCTGTCCCCT CTGGCTGCCT CTGAGTCTGA 5400 

ATCTCCCAAA GAGAGAAACC AATTTCTAAG AGGACTGGAT TGCAGAAGAC TCGGGGACAA 5460 

CATTTGATCC AAGATCTTAA ATGTTATATT GATAACCATG CTCAGCAATG AGCTATTAGA 5520 

TTCATTTTGG GAAATCTCCA TAATTTCAAT TTGTAAACTT TGTTAAGACC TGTCTACATT 5580 

GTTATATGTG TGTGACTTGA GTAATGTTAT CAACGTTTTT GTAAATATTT ACTATGTTTT 5640 

40 TCTATTAGCT AAATTCCAAC AATTTTGTAC TTTAATAAAA TGTTCTAAAC ATTGC 
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Seq ID NO: 178 Protein sequence: 
Protein Accession #: NP 001058.1 



45 
50 
55 
60 
65 
70 



MEVSPLQPVN 
MWVYDEDVGI 
NGKGIPWEH 
TASREYKKMF 
RRAYDIAGST 
LTMSEKGFQQ 
WIFVNALIEN 
VQLNKKCSAV 
YGVFPIjRGKI 
DQDQDGSHIK 
STPNHKKWKV 
RKEWLTNFME 
GLKPGQRKVIi 
LNLLQPIGQF 
YIPIIPMVLI 
ELAPNQYVIS 
HTDTTVKFW 
DFFELRLKYY 
GYDSDPVKAW 
KDELCRIjRNE 
KGKKTQMAEV 
RLEKKQKREP 
ATKTKFTMDL 
ADDVKGSVPIi 
TKRDPAIjNSG 
DFDSAVAPRA 



11 
I 

ENMQVNKIKK 
NYREVTFVPG 
KVEKMYVPAIi 
KQTWMDNMGR 
KDVKVFLNGN 
ISFVNSIATS 
PTFDSQTKEN 
KHNRIKGIPK 
LNVREASHKQ 
GIjLINFIHHN 
KYYKGLGTST 
DRRQRKLLGL 
FTCFKRNDKR 
GTRLHGGKDS 
NGAEGIGTGW 
GEVAIIiNSTT 
KMTEEKLAEA 
GLRKEWLLGM 
KEAQQKVPDE 
KEQELDTLKR 
LPSPRGQRVI 
GTKTKKQTTL 
DSDEDFSDFD 
SSSPPATHFP 
VSQKPDPAKT 
KSVRAKKPIK 



21 

1 

NEDAKKRLSV 
LYKIFDEILV 
IFGQLLTSSN 
AGEMELKPFN 
KLPVKGFRSY 
KGGRHVDYVA 
MTLQPKSFGS 
LDDANDAGGR 
IMENAEINNI 
WPSLLRllRFIi 
SKEAKEYFAD 
PEDYLYGQTT 
EVKVAQLAGS 
ASPRYIFTML 
SCKIPNFDVR 
lEISELPVRT 
ERVGLHKVFK 
LGAESAKL^3N 
EENEESDNEK 
KSPSDLWKED 
PRXTIEMKAE 
AFKPIKKGKK 
EKTDDEDFVP 
DETEITNPVP 
KNRRKRKPST 
YLEESDEDDIj 



31 

I ' 

ERIYQKKTQIj 
NAADNKQRDP 
YDDDEKKVTG 
GEDYTCITFQ 
VDMYLKDKLD 
DQIVTKLVDV 
TCQLSEKFIK 
NSTECTIjILT 

ikivglqykk 
eefitpivkv 

MKRHRIQFKY 
TYLTYNDFIN 
VAEMSSYHHG 
SSLARLLFPP 
EIVNNIRRLM 
WTQTYKEQVL 
LQTSLTCNSM 
QARFILEKID 
ETEKSDSVTD 
LATFIEELEA 
AEKKNKKKIK 
RNPWPDSESD 
SDASPPKTKT 
KKNVTVKKTA 
SDDSDSNFEK 
F 



41 

I 

EHILLRPDTY 
KMSCIRVTID 
GRNGYGAKLC 
PDLSKFKMQS 
ETGNSIiKVIH 
VKKKNKGGVA 
AAIGCGIVES 
EGDSAKTLAV 
NYEDEDSLKT 
SKNKQBMAFY 
SGPEDDAAIS 
KELILFSNSD 
EMSLMMTIIN 
KDDHTLKFLY 
DGEEPIiPMIiP 
EPMLNGTEKT 
VLFDHVGCLK 
GKI I lENKPK 
SGPTFNYLLD 
VEAKEKQDEQ 
NENTBGSPQE 
RSSDESNFDV 
SPKLSNKELK 
AKSQSSTSTT 
IVSKAVTSKK 



51 
I 

IGSVELVTQQ 
PENNLISIWN 
NIFSTKFTVE 
LDKDIVALMV 
EQVNHRWEVC 
VKAHQVKNHM 
ILNWVKFKAQ 
SGLGWGRDK 
LRYGKIMIMT 



LAFSKKQIDD 
NERSIPSMVD 
LAQNFVGSNN 
DDNQRVEPEW 
SYKNFKGTIE 
PPLITDYREY 
KYDTVLDILR 
KELIKVLIQR 
MPLWYLTKEK 
VGLPGKGGKA 
DGVELEGLKQ 
PPRETEPRRA 
PQKSWSDLE 
GAKKRAAPKG 
SKGESDDFHM 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



75 Seq ID NO: 179 DMA sequence 

Nucleic Acid Accession #: Eos sequence 
Coding sequence: 148-7095 



80 



85 



1 

I 

CACACATACG 
CAAAAAAAAC 
CGGCGAGGGG 
CAGCTCCTCT 
CTTGTTGAAG 
AAATATCCAA 
CAAGTAAATG 



11 
I 

CACGCACGAT 
ATTTCCTTCG 
CCGCAGACCG 
GTGTTTGCCG 
AGATTGGCTG 
CATGTAATAG 
TGAATCTTAA 



21 
I 

CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCCAAAACAA 
GAAACTTAAA 



31 

! 

TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
TCTCCTATCA 
TTTCAGGGTT 



41 


51 




1 

GGAGGATTAA 


1 

AACAAACAAA 


60 


TGAGAAGCAG 


AGGAGCCGCA 


120 


AGCGTTTCCT 


CGCTTGCATT 


180 


ACTACAGACA 


ACAGAGAAAA 


240 


ATCAAAAAAA 


TTG66GAAAG 


300 


ATATTGATGA 


AGATCTTACA 


360 


GGGAT7VAAAC 


ATCATTGGAA 


420 
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AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCG6AG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

TWITGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 102 0 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAG6ACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG TU^CTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAA/Ui^CTG TTCTTAGATC TCCACATATG 180 0 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGT^TA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG G6TCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCGACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 240 0 

GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 

ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 

TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 2580 

TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 

TTTCGCCATC TGCATACAGT TTCTCAAAtC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 

AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 

TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2 880 

AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 

GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 

GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 

CCTAAGTCTT C6TTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3 240 

TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 33 0 0 

ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 

CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 3420 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC T7AGGTTTTT 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATAGTGTC 3540 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3 60 0 

TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 

ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 

GACACCTTGC TTAAi\ACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3 840 

AGT6AAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3 900 

ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3 960 

GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 

CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 432 0 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 

TTAGTGGGTG GTG6TGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 4440 

AGAGGTAGTG ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAQAGT^TCA 4500 

CAGGT^AAAGG TAATGAATGA TTCAGACACC CACGAAAACA GTCTTATGGA TCAGAATAAT 4560 

CCAATCTCAT ACTCACTATC TGAGAATTCT GAAGAAGATA ATAGAGTCAC AAGTGTATCC 4620 

TCAGACAGTC AAACTGGTAT GGACAGAAGT CCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 

TCCCAAAAGC ACAATGATGG AAAAGAGGAA AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 

CCTCTCAGCC CTGAATCTAA AGCATGGGCA GTTCTGACAA GTGATGAAGA AAGTGGATCA 4800 

GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATGAGACTT CCACAGATTT CAGTTTTGCA 4860 

GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 

GGATTCCCAC AGTCCCCAAC ATCATCTGTT ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 4980 

TCAGAGGCAG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 5040 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 5100 

CTAGTGGTTC TT6TGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 5160 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 5220 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 52 80 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 5340 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 5400 

CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACA6 5460 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 5520 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 5580 

TGGAGAAT6A TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 5640 
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AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTAC6GGAAC 5700 

TTTCTGGTCA CTCA6AAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 5820 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 5880 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 5940 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 6000 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 6060 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 6120 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 6180 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GCTCCTGAGC 6240 

CAGTCAAATA TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA CAGGGAAAAG 6300 

AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 6360 

GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATGG GCTATTACCA GAGCAATGAA 6420 

TTCATCATTA CCCAGCACCC TCTCCTTCAT ACCATCAAGG ATTTCTGGAG GATGATATGG 6480 

GACCATAATG CCCAACTGGT GGTTATGATT CCTGATGGCC AAAACATGGC AGAAGATGAA 6540 

TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAGCTTTAA GGTCACTCTT 6600 

ATGGCTGTU^G AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 6660 

TTAGT^GCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGTG TCCTT^TGG 6720 

CCAAATCCAG ATAGCCCCAT TAGTAAAACT TTTGAACTTA TAAGTGTTAT AAAAGAAGAA 6780 

GCTGCCAATA GGGATGGGCC TATGATTGTT CATGATGAGC ATGGAGGAGT GACGGCAGGA 6840 

ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGGATGTT 6900 

TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 6960 

TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA AGAGAATCCA 7020 

TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG G7VAATATAGC TGAGAGCTTA 7080 

GAGTCTTTAG TTTAACACAG AAAGGGGTGG GGGGACTCAC ATCTGAGCAT TGTTTTCCTC 7140 

TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTGATTT CCCATCACCT 7200 

GACAGTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT ATATCATTAA CAATGTGTGC 7260 

CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT AAAATGATTG AATTTTACAG 7320 

TATTTCTAAG AATGGAATTG TGGTATTTTT TTCTGTATTG ATTTTAACAG AAAATTTCAA 7380 

TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATGTTTG TTTTTAGTGT CAAATTTTTA 7440 

GCTGTATTTG TAGCAATTAT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 7500 

AAATAAAACA CTCTTCCATA TGATATTCAA CATTTTACAA CTGCAGTATT CACCTAAAGT 7560 

AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAGTGT CTCCATGGAC CAAATTTATA 7620 

TTTATAATTG TAGATTTTTA TATTTTACTA CTGAGTCAAG TTTTCTAGTT CTGTGTAATT 7 680 

GTTTAGTTTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAGTTTTC TGACATTGTA 7740 

TTGTGTTACC TT^AGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGGAAAATA 7800 

GAAATACCTT CATTTTGAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA 7860 

AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT ATTGCCATTA AAAAAAAAAA 7920 
AAAAAAAAAA A/^AT^AT^AAAA PiPAA 

Seq ID NO: 180 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

] 1 1 1 1 t 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKI.RAI.S 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVPK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TVJERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HBFLTDGYQD LGAILNNbLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLPPE 420 

LIGTEEIIKE EEEGKDIBEG AXVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNBAKTN 480 

RSPTRGSBFS GKGDVPNTSL NSTSQPVTKIj ATEKDISLTS QTVTEIiPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTABSL NTVSITEYEE ESIiLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ PSYSSEVFPL VTPIjLLDNQI 780 

LNTTPAASSS DSALEIATPVF PSVDVSFESI LSSYDGAPIiL PFSSASFSSE LFRHLHTVSQ 840 

ILPQVTSATE SDKVPLHASL PVAGGDLLLE PSLAQYSDVL STTHAASETL EFGSESGVLY 900 

KTLMFSQVEP PSSDAMMHAR SSGPEPSYAL SDNEGSQHIF TVSYSSAIPV HDSVGVTYQG 960 

SLPSGPSHIP IPKSSIilTPT ASLLQPTHAL SGDGEWSGAS SDSEFLLPDT DGLTALNISS 1020 

PVSVAEFTYT TSVFGDDNKA LSKSEIIYGN ETELQIPSFN EMVYPSESTV MPNMYDNVNK 1080 

LNASIiQETSV SISSTKGMFP GSDAHTTTKV FDHEISQVPE NNFSVQPTHT VSQASGDTSL 1140 

KPVLSANSEP ASSDPASSEM LSPSTQLIiFY ETSASFSTEV LIjQPSFQASD VDTLIiKTVLP 1200 

AVPSDPILVE TPKVDKISST MLHLIVSNSA SSENMLHSTS VPVFDVSPTS HMHSASLQGL 1260 

TISYASEKYE PVDLKSESSH QWPSLYSND ELFQTANLBI NQAHPPKGRH VFATPVLSID 1320 

EPLNTLINKIi IHSDEILTST KSSVTGKVFA GIPTVASDTP VSTDHSVPIG NGHVAITAVS 1380 

PHRDGSVTST KLLFPSKATS ELSHSAKSDA GLVGGGEDGD TDDDGDDDDD DRGSDGLSIH 1440 

KCMSCSSYRE SQBKVMNDSD THENSLMDQN NPISYSLSEN SEEDNRVTSV SSDSQTGMDR 1500 

SPGKSPSANG IjSQKHNDGKE ENDIQTGSAL LPLSPESKAW AVLTSDEESG SGQGTSDSLN 1560 

ENETSTDFSP ADTNEKDADG ILAAGDSBIT PGFPQSPTSS VTSENSBVFH VSEAEASNSS 1620 

HESRIGLAEG LESEKKAVIP LVIVSALTFI CLWLVGIDI YWRKCFQTAH FYLEDSTSPR 1680 

VISTPPTPIF PISDDVGAIP IKHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI 1740 

TADSSNHPDN KHKNRYINIV AYDHSRVKLA QLAEKDGKLT DYINANYVDG YNRPICAYIAA 18 00 

QGPLKSTAED FWRMIWEHNV EVIVMITNLV EKGRRKCDQY WPADGSEEYG NPLVTQKSVQ 1860 

VLAYYTVRNF TLRNTKIKKG SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY 1920 

AKRHAVGPW VHCSAGVGRT GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE 1980 

QYVFIHDTIiV EAILSKETBV LDSHIHAYVN ALLIPGPAGK TKLEKQFQLL SQSNIQQSDY 2040 

SAALKQCNRE KNRTSSIIPV ERSRVGISSL SGEGTDYINA SYIMGYYQSN EFIITQHPLL 2100 

HTIKDFWRMI WDHNAQLWM IPDGQNMAED EFVYWPNKDE PINCESFKVT LMAEEHKCLS 2160 

NEEKLIIQDF ILEATQDDYV IjEVRHFQCPK WPNPDSPISK TFELISVIKE EAANRDGPMI 2220 

vhdehggvta gtfcalttiim hqlekensvd vyqvakminl mrpgvfadie qyqflykvil 2280 
slvstrqebn PSTSLDSNGA alpdgniaes leslv 

Seq ID NO: 181 DNA sequence 

Nucleic Acid Accession #t Eos sequence 
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Coding sequence: 14B-4518 



PCT/US02/12476 



1 11 21 31 41 51 

^1 1 I 1 1 1 

J CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCA6ACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 3 00 

10 AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 3 60 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

15 GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

20 ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCTU^CAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCX5AGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

25 CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 13 80 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

30 AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 162 0 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG * 180 0 

35 AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

40 GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTT6CCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACX3GTC AACX3TGGTAT ACTCGCAGAC AACCCAACCG 2400 

45 GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT AGCTGAGGGG 2460 

TTGGAATCCG AGAAGAAGGC AGTTATACCC CTTGTGATCG TGTCAGCCCT GACTTTTATC 2520 

TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGCACAC 2580 

TTTTACTTAG AGGACAGTAC ATCCCCTAGA GTTATATCCA CACCTCCAAC ACCTATCTTT 2640 

CCAATTTCAG ATGATGTCGG AGCAATTCCA ATAAAGCACT TTCCAAAGCA TGTTGCAGAT 2700 

50 TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAGTT TTACCAGGAA 2760 

GTGCAGAGCT GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 2820 

AAGCACAAGA ATCGATACAT AAATATCGTT GCCTATGATC ATAGCAGGGT TAAGCTAGCA 2880 

CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATGCCAATTA TGTTGATGGC 2940 

TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 3000 

55 TTCTGGAGAA TGATATGGGA ACATAATGTG GAAGTTATTG TCATGATAAC AAACCTCGTG 3 0 60 

GAGAAAGGAA GGAGA7VAATG TGATCAGTAC TGGCCTGCCG ATGGGAGTGA GGAGTACGGG 312 0 

AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT 3180 

AGTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACCCAG TGGACGTGTG 3240 

GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACCAGAGTA CTCCCTGCCA 3300 

60 GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC 3360 

GTCCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA TTGTGCTAGA CAGTATGTTG 3420 

CAGCAGATTC AACACGAAGG AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCCGTTCA 3480 

CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 3 54 0 

GAGGCCATAC TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATTCATGC CTATGTTAAT 3600 

65 GCACTCCTCA TTCCTGGACC AGCAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCTCCTG 3660 

AGCCAGTCAA ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAGCAATG GAAGAGGGAA 3720 

AAGAATCGAA CTTCTTCTAT CATCCCTGTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 3780 

AGTGGAGAAG GCACAGACTA CATCAATGCC TCCTATATCA TGGGCTATTA CCAGAGCAAT 3840 

GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 3 90 0 

70 TGGGACCATA ATGCCCAACT GGTGGTTATG ATTCCTGATG GCCAAAACAT GGCAGAAGAT 3 960 

GAATTTGTTT ACTGGCCAAA TAAAGATGAG CCTATAAATT GTGAGAGCTT TAAGGTCACT 4020 

CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACTTATAAT TCAGGACTTT 4080 

ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GGCACTTTCA GTGTCCTAAA 4140 

TGGCCAAATC CAGATAGCCC CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 42 00 

75 GAAGCTGCCA ATAGGGATGG GCCTATGATT GTTCATGATG AGCATGGAGG AGTGACGGCA 4260 

GGAACTTTCT GTGCTCTGAC AACCCTTATG CACCAACTAG AAAAAGAAAA TTCCGTGGAT 4320 

GTTTACCAGG TAGCCAAGAT GATCAATCTG ATGAGGCCAG GAGTCTTTGC TGACATTGAG 4380 

CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 

CCATCCACCT CTCTGGACAG TAATQGTGCA GCATTGCCTG ATGGAAATAT AGCTGAGAGC 4500 

80 TTAGAGTCTT TAGTTTAACA CAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 

CTCTTCCTAA AATTAGGCAG GAAAATCAGT CTAGTTCTGT TATCTGTTGA TTTCCCATCA 4620 

CCTGACAGTA ACTTTCATGA CATAGGATTC TGCCGCCAAA TTTATATCAT TAACAATGTG 4680 

TGCCTTTTTG CAAGACTTGT AATTTACTTA TTATGTTTGA ACTAAAATGA TTGAATTTTA 4740 

CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 

85 CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTGTTTTTAG TGTCATIATTT 4860 

TTAGCTGTAT TTGTAGCAAT TATCAGGTTT GCTAGAAATA TAACTTTTAA TACAGTAGCC 4920 

TGTAAATAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA 4980 
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AGTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAS TGTCTCCATG GACCAAATTT 5040 

ATATTTATAA TTGTAGATTT TTATATTTTA CTACTOAGTC AAQTTTTCTA GTTCTGTQTA 5100 

ATTGTTTA6T TTAATGACGT AGTTCATTAG CTGGTCTTAC TCTACCAGTT TTCTQACATT 5160 

GTATTGTGTT ACCTAAGTCA TTAACTTTGT TTCAGCATGT AATTTTAACT TTTGTGGAAA 5220 

ATAGAAATAC CTTCATTTTG AAAGAAGTTT TTATGAGAAT AACACCTTAC CAAACATTGT 5280 

TCAAATGGTT TTTATCCAAG GAATTGCAAA AATAAATATA AATATTGCCA TTAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAAAAAA 
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10 
15 
20 
25 
30 
35 
40 
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Seq ID NO: 182 Protein sequence: 
Protein Accession #: Eos sequence 



MRIIiKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFBVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGS6R 
TEVTPHAFTP 
PLVIVSALTF 
PIKHFPKHVA 
VAYDHSRVKL 
VEVIVMITNL 
GSQKGRPSGR 
TGTYIVLDSM 
VLDSHIHAYV 
VERSRVGISS 
MIPDGQNMAE 
VLEVRHFQCP 
MHQLEKENSV 
AALPD6NIAE 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
IiDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAIIjNNLLP 
EEEGKDIEEG 
GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
ICLWLVGIL 
DLHASSGFTE 
AQLAEKDGKL 
VEKGRRKCDQ 
WTQYHYTQW 
LQQIQHEGTV 
NALLIP6PAG 
LSGEGTDYIN 
DEFVYWPNKD 
KWPNPDSPIS 
DVYQVAKMIN 
SLESLV 



21 31 

I I 
WANGYYRQQR KLVEEIGWSY 
KFQGWDKTSL ENTFIHNTGK 
EHSLEGQKFP LEMQIYCFDA 
ESVSRFGKQA AliDPFILLNL 
AVFCEVLTMQ QSGYVMLMDY 
DPE^TYTSLLV TWERPRWYD 
NMSYVLQIVA ICTNGLYGKY 
AIVNPGRDSA TNQIRKKEPQ 
NSTSQPVTKL ATEKDISLTS 
NTVSITEYEE ESLLTSFKLD 
VlilPESARNA SEDSTSSGSE 
IRVDESEKTT KSPSAGPVMS 
VNWYSQTTQ PVYNAEASNS 
lYWRKCFQTA HFYLEDSTSP 
EFETLKEFYQ EVQSCTVDLG 
TDYINANYVD GYNRPKAYIA 
YWPADGSBBY GNFLVTQKSV 
PDMGVPEYSIj PVI.TFVRKAA 
NIFGFLKHIR SQRNYLVQTE 
KTKLEKQFQIi LSQSNIQQSD 
ASYIMGYYQS NEFIITQHPL 
EPINCESFKV TLMAEEHKCL 
KTFELISVIK EEAANRDGPM 
LMRPGVFADI EQYQFLYKVI 



Seq ID NO: 183 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 148-4494 



CACACATACG 
CAAAAAAAAC 
CGGCGAGGGG 
CAGCTCCTCT 
CTTGTTGAAG 
AAATATCCAA 
CAAGTAAATG 
AACACATTCA 
GTCA6CGGA6 
AAATGCAATA 
GAGATGCAAA 
GGAAAAGGGA 
GATTTCAAAG 
TTAGATCCAT 
AATGGCTCAT 
ACAGTTAGCA 
TCTGGTTATG 
TTCTCTAGAC 
AGTTCAGAAC 
TGGGAAAGAC 
CAGTTGGATG 
GGTGCTATTC 
TGCACTAATG 
AATCCTGAAC 
GAAGAGGGAA 
AACCAAATCA 
ACGAAATACA 
AAGGGTGATG 
ACAGAAAAAG 
GAAGGTACTT 
AACTTGTCGG 
AGTTTATTGA 
GCAACTTCTG 
GATWVACCCAG 
GAAGATTCAA 
GTGTGGTTTC 
AGCTTTCTCC 
TCCTTTTCTG 
CATTATTCTA 
TCCAGACAAC 
GTATACAATG 



11 
i 

CACGCACGAT 
ATTTCCTTCG 
CCGCAGACCG 
GTGTTTGCCG 
AGATTGGCTG 
CATGTAATAG 
TGAATCTTAA 
TTCATAACAC 
GAGTTTCAGA 
TGTCATCTGA 
TCTACTGCTT 
AGTTAAGAGC 
CGATTATTGA 
TCATACTGTT 
TGACATCTCC 
TCTCTGAAAG 
TCATGCTGAT 
AGGTGTTTTC 
CAGAAAATGT 
CTCGAGTCGT 
6AGA6GACCA 
TCAATAATTT 
GCTTATATGG 
TTGATCTTTT 
AAGACATTGA 
GGAAAAAGGA 
ATGAAGCCAA 
TTCCCAATAC 
ATATTTCCTT 
CAGCCTCTTT 
GGACTGCAGA 
CCAGTTTCAA 
CTATCCCATT 
AGACAATAAC 
CTTCATCAGG 
CTAGCTCTAC 
AGACTAATTA 
CAGGCCCAGT 
CCTTTGCCTA 
AGGATTTGGT 
A66CCAGTAA 



21 
I 

CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCCAAAACAA 
GAAACTTAAA 
TGGGAAAACA 
AATGGTGTTT 
TGGATCAGAG 
TGATGCAGAC 
TTTATCCATT 
TGGAGTCGAA 
GAACCTTCTG 
TCCCTGCACA 
CCAGTTGGCT 
GGACTACTTA 
CTCATACACT 
TCAGGCTGAC 
TTATGATACC 
AACCAAGCAT 
GCTACCCAAT 
AAAATACAGC 
CCCTGAATTA 
AGT^GGCGCT 
ACCCCAGATT 
GACTAAGCGA 
ATCTTTAAAT 
6ACTTCTCAG 
AAATGATGGC 
ATCCTTAAAT 
GCTTGATACT 
CATCTCTGAG 
ATATGATGTC 
TTCAGAAGAA 
AGACATAACA 
CACTGAGATA 
GATGTCACAG 
CTTCCCAACT 
CTCCACGGTC 
TAGTAGCCAT 



31 
1 

TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
TCTCCTATCA 
TTTCAGGGTT 
GTGGAAATTA 
AAAGCAA6CA 
CATAGTTTAG 
CGATTTTCAA 
TTGTTTGAGG 
AGTGTTAGTC 
CCAAACTCAA 
GACACAGTTG 
GTTTTTTGTG 
CAAAACAATT 
GGAAAGGAAG 
CCAGAGAATT 
ATGATTGAGA 
GAATTTTTGA 
ATGAGTTATG 
GACCAACTGA 
ATTGGAACTG 
ATTGTGAATC 
TCTACCACAA 
TCCCCAACAA 
TCCACTTCCC 
ACTGTGACTG 
TCTAAAACTG 
ACAGTTTCTA 
GGAGCTGAAG 
AACATATCCC 
CTTATACCAG 
TCACTAAAGG 
GCACAGCCCG 
CGTGTTGATG 
GGTCCCTCAG 
GAGGTAACAC 
AACGTGGTAT 
GAGTCTCGTA 



41 
I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
SHESRIGLAE 
RVISTPPTPI 
ITADSSNHPD 
AQGPIjKSTAE 
QVLAYYTVRN 
YAKRHAVGPV 
EQYVFIHDTL 
YSAALKQCNR 
LHTIKDFWRM 
SNEEKLIIQD 
IVHDEHGGVT 
LSLVSTRQEE 



41 
I 

GGAGGATTAA 
TGAGAAGCAG 
AGCGTTTCCT 
ACTACAGACA 
ATCAAAAAAA 
ATATTGATGA 
GGGATAAAAC 
ATCTCACTAA 
AGATAACTTT 
AAGGACAAAA 
GTTTTGAGGA 
TTGGGACAGA 
GTTTTGGGAA 
CTGACAAGTA 
ACTGGATTGT 
AAGTTCTTAC 
TTCGAGAGCA 
AGATTCATGA 
ATACCAGCCT 
AGTTTGCAGT 
CAGATGGCTA 
TTCTTCAGAT 
TTGTCGACAT 
AAGAAATAAT 
CTGGTAGAGA 
CACACTACAA 
GAGGAAGTGA 
AACCAGTCAC 
AACTGCCACC 
TTCTTAGATC 
TAACAGAATA 
ATTCTTCAGG 
AAGGGTATAT 
AATCTGCTAG 
ATCCTTCTAT 
ATGTTGGATC 
AATCTGAGAA 
TTACAGATCT 
CTCATGCTTT 
ACTCGCAGAC 
TTGGTCTAGC 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
GLESBKKAVI 
FPISDDVGAI 
NKHKNRYINI 
DFWRMIWEHN 
FTLRNTKIKK 
WHCSAGVGR 
VEAILSKETE 
EKNRTSSXIP 
IWDHNAQLW 
FILEATQDDY 
AGTFCJ\LTTL 
NPSTSLDSNG 



51 
I 

AACAAACAAA 
AGGAGCCGCA 
CGCTTGCATT 
ACAGAGAAAA 
TTGGGGAAAG 
AGATCTTACA 
ATCATTGGAA 
TGACTACCGT 
TCACTGGGGA 
ATTTCCACTT 
AGCAGTCAAA 
AGAAAATTTG 
GCAGGCTGCT 
TTACATTTAC 
TTTTAAAGAT 
AATGCAACAA 
ACAGTACAAG 
AGCAGTTTGT 
TCTTGTTACA 
TTTGTACCAG 
TCT^GACTTQ 
AGTAGCCATA 
GCCTACTGAT 
CAAGGAGGAG 
CAGTGCTACA 
TCGCATAGGG 
ATTCTCTGGA 
TAAATTAGCC 
TCACACTGTG 
TCCACATATG 
TGAGGAGGAG 
CTCCAGTCCC 
ATTTTCCTCC 
AAATGCTTCC 
GGAGGGAAAT 
AGGCAGAGAG 
GACAACCAAG 
GGAAATGCCA 
TACCCCATCC 
AACCCAACCG 
TGAGGGGTTG 



60 

120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 

CTAGTG6TTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 

ATTTCT^GATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGGAAGTGC AGAGCTGTAC TGTTGACTTA 2760 

GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC ACAAGAATCG ATACATAAAT 2820 

ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC TTGCTGAAAA GGATGGCAAA 2880 

CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA ACAGACCTiAA AGCTTATATT 2940 

GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT GGAGAATGAT ATGGGAACAT 3000 

AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA AAGGAAGGAG AAAATGTGAT 3060 

CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT TTCTGGTCAC TCAGAAGAGT 3120 

GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC TAAGAAACAC AAAAATAAAA 318 0 

AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA CACAGTATCA CTACACGCAG 3240 

TGGCCTGACA TGGGA6TACC AGAGTACTCC CTGCCAGTGC TGACCTTTGT GAGAAAGGCA 3300 

GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC ACTGCAGTGC TGGAGTTGGA 33 60 

AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC AGATTCAACA CGAAGGAACT 342 0 

GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA GAAATTATTT GGTACAAACT 34 80 

GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG CCATACTTAG TAAAGAAACT 3540 

GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC TCCTCATTCC TGGACCAGCA 3600 

GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC AGTCAAATAT ACAGCAGAGT 3660 

GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA ATCGAACTTC TTCTATCATC 3 720 

CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG GAGAAGGCAC AGACTACATC 3780 

AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT TCATCATTAC CCAGCACCCT 3840 

CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG ACCATAATGC CCAACTGGTG 3 900 

GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT TTGTTTACTG GCCAAATAAA 3 960 

GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA TGGCTGAAGA ACACAAATGT 4020 

CTATCTAATG AGGAA/U^CT TATAATTCAG GACTTTATCT TAGAAGCTAC ACAGGATGAT 4080 

TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC CAAATCCAGA TAGCCCCATT 4140 

AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG CTGCCAATAG GGATGGGCCT 42 00 

ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA CTTTCTGTGC TCTGACAACC 4260 

CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT ACCAGGTAGC CAAGATGATC 4320 

AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT ATCAGTTTCT CTACAAAGTG 43 80 

ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT CCACCTCTCT GGACAGTAAT 4440 

GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG AGTCTTTAGT TTAACACAGA 4500 

AAGGGGTGGG 'gGGACTCACA TCTGAGCATT GTTTTCCTCT TCCTAAAATT AGGCAGGAAA 4560 

ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG ACAGTAACTT TCATGACATA 4620 

GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC TTTTTGCAAG ACTTGTAATT 4680 

TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT ATTTCTAAGA ATGGAATTGT 4740 

GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT TTATAGAGGT TAGGAATTCC 4800 

AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG CTGTATTTGT AGCAATTATC 4860 

AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA AATAAAACAC TCTTCCATAT 4 92 0 

GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA GAAATAATCT GTTACTTATT 4980 

GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT TTATAATTGT AGATTTTTAT 5040 

ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG TTTAGTTTAA TGACGTAGTT 5100 

CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT TGTGTTACCT AAGTCATTAA 5160 

CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG AAATACCTTC ATTTTGAAAG 522 0 

AAGTTTTTAT 6AGAATAACA CCTTACCAAA CATTGTTCAA ATGGTTTTTA TCCAAGGAAT 52 80 

TGCAAAAATA AATATAT^TA TTGCCATTAA A7WVAAAAAAA AAAAAAAAAA AAAAAAAAAA 5340 
T^AA 

Seq ID NO*. 184 Protein sequence: 
Protein Accession #: EOS sequence 

1 11 21 31 41 51 

I I I I I I 

MRIIiKRFLAC IQLLCVCRLD WANGYYRQQR KLVBEIGWSY TGALNQKNWG KKYPTCNSPK 60 
QSPINIDEDL TQVNVNLKKIi KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 12 0 
FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 
IIiFEVGTEBN L.DFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 
TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 3 00 
TGKEEIHEAV CSSEPBNVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 
HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 
LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 
RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 
GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 
ENISQGYIFS SENPETITYD VIiIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 
TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSPSAGPVMS QGPSVTDLEM PHYSTFAYFP .720 
TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 
LVIVSALTFI CLWLVGIIjI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 840 
IKHFPKHVAD LHASSGPTEE FEEVQSCTVD LGITADSSNH PDNKHKNRYI NIVAYDHSRV 900 
KLAQLAEKDG KLTDYIN7\NY VDGYNRPKAY lAAQGPLKST AEDFWRMIWE HNVEVIVMIT 960 

NIjVEKGRRKC DQYWPADGSE EYGNFLVTQK SVQVLAYYTV RNFTLRNTKI KKGSQKGRPS 102 0 

GRWTQYHYT QWPDMGVPEY SLPVLTFVRK AAYAKRHAVG PVWHCSAGV GRTGTYIVLD 1080 

SMLQQIQHEG TVNIFGFLKH IRSQRNYLVQ TBEQYVFIHD TLVEAILSKE TEVIiDSHIHA 1140 

YVNALLIPGP AGKTKLEKQF QLLSQSNIQQ SDYSAALKQC NREKNRTSSI IPVERSRVGI 1200 

SSLSGEGTDY INASYIMGYY QSNEFIITQH PLLHTIKDFW RMIWDHNAQL WMIPDGQNM 12 60 

AEDBFVYWPN KDEPINCESF KVTLMAEEHK CIjSNEEKLII QDFILEATQD DYVLEVRHFQ 1320 

CPKWPNPDSP ISKTFBIiISV IKEEAANRDG PMIVHDEHGG VTAGTFCAIiT TLMHQIjEKEN 1380 

SVDVYQVAKM INLMRPGVFA DIEQYQFLYK VIIiSLVSTRQ EENPSTSLDS NGATOjPDGNI 1440 
AESLESLV 



Seq ID NO: 185 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 501-4514 

1 11 21 31 41 51 
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1 



I 

CACACATACG 
CAAAAAAAAC 
CGGC6AGGGG 
CAGCTCCTCT 
CTTGTTG7VAG 
AATATCCAAC 
AAGTAAATGT 
ACACATTCAT 
TCAGCGGAGG 
AATGCAATAT 
AGATGCAAAT 
GAAAAGGGAA 
ATTTCAAAGC 
TAGATCCATT 
ATGGCTCATT 
CAGTTAGCAT 
CTGGTTATGT 
TCTCTAGACA 
GTTCAGAACC 
GGGAAAGACC 
AGTTGGATGG 
GTGCTATTCT 
GCACTAATGG 
ATCCTGAACT 
AAGAGGGAAA 
ACCAAATCAG 
CGAAATACAA 
AGGGTGATGT 
CAGAAAAAGA 
AAGGTACTTC 
ACTTGTCGGG 
GTTTATTGAC 
CAACTTCTGC 
AAAACCCAGA 
AAGATTCAAC 
TGTGGTTTCC 
GCTTTCTCCA 
CCTTTTCTGC 
ATTATTCTAC 
CCAGACAACA 
TATACAATGA 
AATCCGAGAA 
TAGTGGTTCT 
ACTTAGAGGA 
TTTCAGATGA 
ATGCAAGTAG 
AGAGCTGTAC 
ACAAGAATCG 
TTGCTGAAAA 
ACAGACCAAA 
GGAGAATGAT 
AAGGAAGGAG 
TTCTGGTCAC 
TAAGAAACAC 
CACAGTATCA 
TGACCTTTGT 
ACTGCAGTGC 
AGATTCAACA 
GAAATTATTT 
CCATACTTAG 
TCCTCATTCC 
AGTCAAATAT 
ATCGAACTTC 
GAGAAGGCAC 
TCATCATTAC 
ACCATAATGC 
TTGTTTACTG 
TGGCTGAAGA 
TAGAAGCTAC 
CAAATCCAGA 
CTGCCAATAG 
CTTTCTGTGC 
ACCAGGTAGC 
ATCAGTTTCT 
CCACCTCTCT 
AGTCTTTAGT 
TCCTAAAATT 
ACAGTAACTT 
TTTTTGCAAG 
ATTTCTAAGA 
TTATAGAGGT 
CTGTATTTGT 
AATAAAACAC 
GAAATAATCT 
TTATAATTGT 
TTTAGTTTAA. 



I 



CACGCACGAT 
ATTTCCTTCG 
CCX3CAGACCG 
GTGTTTGCCG 
AGATTGGCTG 
ATGTAATAGC 
GAATCTTAAG 
TCATAACACT 
AGTTTCAGAA 
GTCATCTGAT 
CTACTGCTTT 
GTTAAGAGCT 
GATTATTGAT 
CATACTGTTG 
GACATCTCCT 
CTCTGAAAGC 
CATGCTGATG 
GGTGTTTTCC 
AGAAAATGTT 
TCGAGTCGTT 
AGAGGACCAA 
CAATAATTTG 
CTTATATGGA 
TGATCTTTTC 
AGACATTGAA 
GAAAAAGGAA 
TGAAGCCAAG 
TCCCAATACA 
TATTTCCTTG 
AGCCTCTTTA 
GACTGCAGAA 
CAGTTTCAAG 
TATCCCATTC 
GACAATAACA 
TTCATCAGGT 
TAGCTCTACA 
GACTAATTAC 
AGGCCCAGTG 
CTTTGCCTAC 
GGATTTGGTC 
GGCCAGTAAT 
GAAGGCAGTT 
TGTGGGTATT 
CAGTACATCC 
TGTCGGAGCA 
TGGGTTTACT 
TGTTGACTTA 
ATACATAAAT 
GGATGGCAAA 
AGCTTATATT 
ATGGGAACAT 
AAAATGTGAT 
TCAGAAGAGT 
AAAAATAAAA 
CTACACGCAG 
GAGAAAGGCA 
TQGAGTTGGA 
CGAAGGAACT 
GGTACAAACT 
TAAAGAAACT 
TGGACCAGCA 
ACAGCAGAGT 
TTCTATCATC 
AGACTACATC 
CCAGCACCCT 
CCAACTGGTG 
GCCAAATAAA 
ACACAAATGT 
ACAGGATGAT 
TAGCCCCATT 
GGATGGGCCT 
TCTGACAACC 
CAAGATGATC 
CTACAAAGTG 
GGACAGTAAT 
TTAACACAGA 
AGGCAGGAAA 
TCATGACATA 
ACTTGTAATT 
ATGGAATTGT 
TAGGAATTCC 
AGCAATTATC 
TCTTCCATAT 
GTTACTTATT 
AGATTTTTAT 
TGACGTAGTT 



CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCAAAACAAT 
AAACTTAAAT 
GGGAAAACAG 
ATGGTGTTTA 
GGATCAGAGC 
GATGCGGACC 
TTATCCATTT 
GGAGTCGAAA 
AACCTTCTGC 
CCCTGCACAG 
CAGTTGGCTG 
GACTACTTAC 
TCATACACTG 
CAGGCTGACC 
TATGATACCA 
ACCAAGCATG 
CTACCCAATA. 
AAATACAGCG 
CCTGAATTAA 
GAAGGCGCTA 
CCCCAGATTT 
ACTAACCGAT 
TCTTTAAATT 
ACTTCTCAGA 
AATGATGGCT 
TCCTTAAATA 
CTTGATACTG 
ATCTCTGAGA 
TATGATGTCC 
TCAGAAGAAT 
GACATAACAG 
ACTGAGATAC 
ATGTCACAGG 
TTCCCAACTG 
TCCACGGTCA 
AGTAGCCATG 
ATACCCCTTG 
CTCATCTACT 
CCTAGAGTTA 
ATTCCAATAA 
GAAGAATTTG 
GGTATTACAG 
ATCGTTGCCT 
CTGACTGATT 
GCTGCCCAAG 
AATGTGGAAG 
CAGTACTGGC 
GTGCAAGTGC 
AAGGGCTCCC 
TGGCCTGACA 
GCCTATGCCA 
AGAACAGGCA 
GTCAACATAT 
GAGGAGCAAT 
GAGGTGCTGG 
GGCAAAACAA 
GACTATTCTG 
CCTGTGGAAA 
AATGCCTCCT 
CTCCTTCATA 
GTTATGATTC 
GATGAGCCTA 
CTATCTAATG 
TATGTACTTG 
AGTAAAACTT 
ATGATTGTTC 
CTTATGCACC 
AATCTGATGA 
ATCCTCAGCC 
GGTGCAGCAT 
AAGGGGTGGG 
ATCAGTCTAG 
GGATTCTGCC 
TACTTATTAT 
GGTATTTTTT 
AAACTACAGA 
AGGTTTGCTA 
GATATTCAAC 
GTAAATACTG 
ATTTTACTAC 
CATTAGCTGG 



TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
CTCCTATCAA 
TTCAGGGTTG 
TGGAAATTAA 
AAGCAAGCAA 
ATAGTTTAGA 
GATTTTCAAG 
TGTTTGAGGT 
GTGTTAGTCG 
CAAACTCAAC 
ACACAGTTGA 
TTTTTTGTGA 
AAAACAATTT 
GAAAGGAAGA 
CAGAGAATTA 
TGATTGAGAA 
AATTTTTGAC 
. TGAGTTATGT 
ACCAACTGAT 
TTGGAACTGA 
TTGTGAATCC 
CTACCACAAC 
CCCCAACAAG 
CCACTTCCCA 
CTGTGACTGA 
CTAAAACTGT 
CAGTTTCTAT 
GAGCTGAAGA 
ACATATCCCA 
TTATACCAGA 
CACTAAAGGA 
CACAGCCCGA 
GTGTTGATGA 
GTCCCTCAGT 
AGGTAACACC 
ACGTGGTATA 
AGTCTCGTAT 
TGATCGTGTC 
GGAGGAAATG 
TATCCACACC 
AGCACTTTCC 
AGACACTGAA 
CAGACAGCTC 
ATGATCATAG 
ATATCAATGC 
GCCCACTGAA 
TTATTGTCAT 
CTGCCGATGG 
TTGCCTATTA 
AGAAAGGAAG 
TGGGAGTACC 
AGCGCCATGC 
CATATATTGT 
TTGGCTTCTT 
ATGTCTTCAT 
ACAGTCATAT 
AGCTAGAGAA 
CAGCCCTAAA 
GATCAAGGGT 
ATATCATGGG 
CCATCAAGGA 
CTGATGGCCA 
TAAATTGTGA 
AGGAAAAACT 
AAGTGAGGCA 
TTGAACTTAT 
ATGATGAGCA 
AACTAGAAAA 
GGCCAGGAGT 
TTGTGAGCAC 
TGCCTGATGG 
GGGACTCACA 
TTCTGTTATC 
GCCAAATTTA 
GTTTGAACTA 
TCTGTATTGA 
AAATGTTTGT 
GAAATATAAC 
ATTTTACAAC 
CCCTAGTGTC 
TGAGTCAAGT 
TCTTACTCTA 



GGAGGATTAA 
TGAGAAGCAG 
AGCGTTTCCT 
ACTACAGACA 
ATCAAAAAAT 
TATTGATGAA 
GGATAAAACA 
TCTCACTAAT 
GATAACTTTT 
AGGACAAAAA 
TTTTGAGGAA 
TGGGACAGAA 
TTTTG6GAAG 
TGACAAGTAT 
CTGGATTGTT 
AGTTCTTACA 
TCGAGAGCAA 
GATTCATGAA 
TACCAGCCTT 
GTTTGCAGTT 
AGATGGCTAT 
TCTTCAGATA 
TGTCGACATG 
AGAAATAATC 
TG6TA6AGAC 
ACACTAC7UVT 
AGGAAGTGAA 
ACCAGTCACT 
ACTGCCACCT 
TCTTAGATCT 
AACAGAATAT 
TTCTTCAGGC 
AGGGTATATA 
ATCTGCTAGA 
TCCTTCTATG 
TGTTGGATCA 
ATCTGAGAAG 
TACAGATCTG 
TCATGCTTTT 
CTCGCAGACA 
TGGTCTAGCT 
AGCCCTGACT 
CTTCCAGACT 
TCCAACACCT 
AAAGCATGTT 
AGAGTTTTAC 
CAACCACCCA 
CAGGGTTAAG 
CAATTATGTT 
ATCCACAGCT 
GATAACAAAC 
GAGTGAGGAG 
TACTGTGAGG 
ACCCAGTGGA 
AGAGTACTCC 
AGTGGGGCCT 
GCTAGACAGT 
AAAACACATC 
TCATGATACA 
TCATGCCTAT 
ACAATTCCAG 
GCAATGCAAC 
TGGCATTTCA 
CTATTACCAG 
TTTCTGGAGG 
AAACATGGCA 
GAGCTTTAAG 
TATAATTCAG 
CTTTCAGTGT 
AAGTGTTATA 
TGGAGGAGTG 
AGAAAATTCC 
CTTTGCTGAC 
AAGGCAGGAA 
AAATATAGCT 
TCTGAGCATT 
TGTTGATTTC 
TATCATTAAC 
T^TGATTGA 
TTTTAACAGA 
TTTTAGTGTC 
TTTTAATACA 
TGCAGTATTC 
TCCATGGACC 
TTTCTAGTTC 
CCAGTTTTCT 



AACAAACAAA 
AGGAGCCGCA 

CGCTTGCATT 
ACAGAGAAAA 
TGGGGAAAGA 
GATCTTACAC 
TCATTGGAAA 
GACTACCGTG 
CACTGGGGAA 
TTTCCACTTG 
GCAGTCAAAG 
GAAAATTTGG 
CAGGCTGCTT 
TACATTTACA 
TTTAAAGATA 
ATGCAACAAT 
CAGTACAAGT 
GCAGTTTGTA 
CTTGTTACAT 
TTGTACCAGC 
CAAGACTTGG 
GTAGCCATAT 
CCTACTGATA 
AAGGAGGAGG 
AGTGCTACAA 
CGCATAGGGA 
TTCTCTGGAA 
AAATTAGCCA 
CACACTGTGG 
CCACATATGA 
GAGGAGGAGA 
TCCAGTCCCG 
TTTTCCTCCG 
AATGCTTCCG 
GAGGGAAATG 
GGCAGAGAGA 
ACAACCAAGT 
GAAATGCCAC 
ACCCCATCCT 
ACCCAACCGG 
GAGGGGTTGG 
TTTATCTGTC 
GCACACTTTT 
ATCTTTCCAA 
GCAGATTTAC 
CAGGAAGTGC 
GACAACAAGC 
CTAGCACAGC 
GATGGCTACA 
GAAGATTTCT 
CTCGTGGAGA 
TACGGGAACT 
AATTTTACTC 
CGTGTGGTCA 
CTGCCAGTGC 
GTTGTCGTCC 
ATGTTGCAGC 
CGTTCACy\AA 
CTGGTTGAGG 
GTTAATGCAC 
CTCCTGAGCC 
AGG6AAAAGA 
TCCCTGAGTG 
AGCAATGAAT 
ATGATATGGG 
GAAGAT6AAT 
GTCACTCTTA 
GACTTTATCT 
CCTAAATGGC 
AAAGAAGAAG 
ACGGCAGGAA 
GTGGATGTTT 
ATTGAGCAGT 
GAGAATCCAT 
GAGAGCTTAG 
GTTTTCCTCT 
CCATCACCTG 
AATGTGTGCC 
ATTTTACAGT 
AAATTTCAAT 
AAATTTTTAG 
GTAGCCTGTA 
ACCTAAAGTA 
AAATTTATAT 
TGTGTAATTG 
GACATTGTAT 



60 
120 

180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
lOBO 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
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TGTGTTACCT AAGTCATTAA CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG 5220 
AAATACCTTC ATTTTGAAAG AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA 5280 
ATGGTTTTTA TCCAAGGAAT TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA 5340 

AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO: 186 Protein sequence: 
Protein Accession #: EOS sequence 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



MVFKASKITF 
LSILPEVGTE 
PCTDTVDWIV 
SYTGKEEIHE 
TKHEFLTDGY 
PELIGTEEII 
TNRSPTRGSE 
NDGSKTVLRS 
ISENISQGYI 
DITAQPDVGS 
FPTEVTPHAF 
IPLVIVSALT 
IPIKHFPKHV 
IVAYDHSRVK 
NVEVIVMITN 
KGSQKGRPSG 
RTGTYIVLDS 
EVLDSHIHAY 
PVERSRVGIS 
VMIPDGQNMA 
YVLEVRHFQC 
LMHQIiEKBNS 
GAALPDGNIA 



11 

1 

HWGKCNMSSD 
ENLDFKAIID 
FKDTVSISES 
AVCSSEPENV 
QDLGAILNNL 
KEEEEGKDIE 
FSGKGDVPNT 
PHMNLSGTAE 
FSSENPETIT 
GRESFLQTNY 
TPSSRQQDLV 
FICIiWLVGI 
ADLHASSGFT 
LAQIiAEKDGK 
LVBKGRRKCD 
RWTQYHYTQ 
MLQQIQHEGT 
VNALLIPGPA 
SLSGEGTDYI 
EDEFVYWPNK 
PKWPNPDSPI 
VDVYQVAKMI 
ESLESLV 



21 
1 

GSEHSLEGQK 
GVESVSRFGK 
QLAVFCEVLT 
QADPENYTSL 
LPNMSYVLQI 
EGAIVNPGRD 
SLNSTSQPVT 
SLNTVSITEY 
YDVIjIPEST^ 
TEIRVDESEK 
STVNWYSQT 
LIYWRKCFQT 
EEFETLKEFY 
LTDYINANYV 
QYWPADGSEE 
WPDMGVPEYS 
VNIFGFLKHI 
GKTKLEKQFQ 
NASYIMGYYQ 
DEPINCESFK 
SKTFELISVI 
NLMRPGVFAD 



31 

1 

FPLEMQIYCF 
QAALDPFILL 
MQQSGYVMLM 
LVTWERPRW 
VAXCTNGLYG 
SATNQIRKKE 
KLATEKDISL 
EEESLLTSFK 
NASEDSTSSG 
TTKSFSAGPV 
TQPVYNEASN 
AHFYLEDSTS 
QEVQSCTVDIi 
DGYNRPKAYI 
YGNFIiVTQKS 
LPVLTFVRKA 
RSQRNYLVQT 
LLSQSNIQQS 
SNEFIITQHP 
VTLMAEEHKC 
KEEAANRD6P 
lEQYQFIiYKV 



41 

1 

DADRFSSFEE 
NLLPNSTDKY 
DYLQNNFREQ 
YDTMIEKFAV 
KYSDQLIVDM 
PQISTTTHYN 
TSQTVTELPP 
LDTGAEDSSG 
SKESIjKDPSM 
MSQGPSVTDL 
SSHESRIGLA 
PRVISTPPTP 
GITADSSNHP 
AAQGPLKSTA 
VQVLAYYTVR 
AYAKRHAVGP 
EEQYVFIHDT 
DYSAALKQCN 
IiLHTIKDFWR 
LSNEEKIillQ 
MIVHDBHGGV 
ILSLVSTRQE 



51 

1 

AVKGKGKLRA 
YIYNGSLTSP 
QYKFSRQVFS 
LYQQLDGEDQ 
PTDNPELDIjF 
RIGTKYNEAK 
HTVEGTSASL 
SSPATSAIPF 
E6NVWFPSST 
EMPHYSTFAY 
EGLESEKKAV 
IFPISDDVGA 
DNKHKNRYIN 
BDFWRMIWEH 
NFTLRNTKIK 
VWHCSAGVG 
LVEAILSKET 
RBKNRTSSII 
MIWDHNAQLV 
DFILEATQDD 
TAGTFCALTT 
ENPSTSLDSN 



Seq ID NO: 187 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 148-4632 



CACACATACG 
CAAAAATU^C 
CGGCGAGGGG 
CAGCTCCrCT 
CTTGTTGAAG 
AAATATCCAA 
CAAGTAAATG 
AACACATTCA 
GTCAGCGGAG 
AAATGCAATA 
GAGATGCAAA 
GGAAAAGGGA 
GATTTCAAAG 
TTAGATCCAT 
AATGGCTCAT 
ACAGTTAGCA 
TCTGGTTATG 
TTCTCTAGAC 
AGTTCAGAAC 
TGGGAAAGAC 
CAGTTGGATG 
GGTGCTATTC 
TGCACTAATG 
AATCCTGAAC 
GAAGAGGGAA 
AACCAAATCA 
A06AAATACA 
AAGGGTGATG 
ACAGAAAAAG 
.GAAGGTACTT 
AACTTGTCGG 
AGTTTATTGA 
GCAACTTCTG 
GAAJ^ACCCAG 
GAAGATTCAA 
GTGTGGTTTC 
AGCTTTCTCC 
TCCTTTTCTG 
CATTATTCTA 
TCCAGACAAC 
GTATACAATG 
GAATCCGAGA 
CTAGTGGTTC 
TACTTAGAGG 
ATTTCAGATG 
CATGCAAGTA 
CAGAGCTGTA 



11 
1 

CACGCACGAT 
ATTTCCTTCG 
CCGCAGACCG 
GTGTTTGCCG 
AGATTGGCTG 
CATGTAATAG 
TGAATCTTAA 
TTCATAACAC 
GAGTTTCAGA 
TGTCATCTGA 
TCTACTGCTT 
AGTTAAGAGC 
CGATTATTGA 
TCATACTGTT 
TGACATCTCC 
TCTCTGAAAG 
TCATGCTGAT 
AGGTGTTTTC 
CAGAAAATGT 
CTCGAGTCGT 
GAGAGGACCA 
TCAATAATTT 
GCTTATATGG 
TTGATCTTTT 
AAGACATTGA 
GGAAAAAGGA 
ATGAAGCCAA 
TTCCCAATAC 
ATATTTCCTT 
CAGCCTCTTT 
GGACTGCAGA 
CCAGTTTCAA 
CTATCCCATT 
AGACAATAAC 
CTTCATCAGG 
CTAGCTCTAC 
AGACTAATTA 
CAGGCCCAGT 
CCTTTGCCTA 
AGGATTTGGT 
AGGCCAGTAA 
AGAAGGCAGT 
TTGTGGGTAT 
ACAGTACATC 
ATGTCGGAGC 
GTGGGTTTAC 
CTGTTGACTT 



21 

I 

CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCCA/Wy^CAA 
GAAACTTAAA 
TGGGAAAACA 
AATGGTGTTT 
TGGATCAGAG 
TGATGCGGAC 
TTTATCCATT 
TGGAGTCGAA 
GAACCTTCTG 
TCCCTGCACA 
CCAGTTGGCT 
GGACTACTTA 
CTCATACACT 
TCAGGCTGAC 
TTATGATACC 
AACCAAGCAT 
GCTACCCAAT 
AAAATACAGC 
CCCTGAATTA 
AGAAGGCGCT 
ACCCCAGATT 
GACTAACCGA 
ATCTTTAAAT 
GACTTCTCAG 
AAATGATGGC 
ATCCTTAAAT 
GCTTGATACT 
CATCTCTGAG 
ATATGATGTC 
TTCAGAAGAA 
AGACATAACA 
CACT6AGATA 
GATGTCACAG 
CTTCCCAACT 
CTCCACGGTC 
TAGTAGCCAT 
TATACCCCTT 
TCTCATCTAC 
CCCTAGAGTT 
AATTCCAATA 
TGAAGAATTT 
AGGTATTACA 



31 

1 

TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
TCTCCTATCA 
TTTCAGGGTT 
GTGGAAATTA 
AAAGCAAGCA 
CATAGTTTAG 
CGATTTTCAA 
TTGTTTGAGG 
AGTGTTAGTC 
CCAAACTCAA 
6ACACAGTTG 



CAAAACAATT 
GGAAAGGAAG 
CCAGAGAATT 
ATGATTGAGA 
GAATTTTTGA 
ATGAGTTATG 
GACCAACTGA 
ATTGGAACTG 
ATTGTGAATC 
TCTACCACAA 
TCCCCAACAA 
TCCACTTCCC 
ACTGTGACTG 
TCTA/WUVCTG 
ACAGTTTCTA 
GGAGCTGAAG 
AACATATCCC 
CTTATACCAG 
TCACTAAAGG 
GCACAGCCCG 
CX3TGTTGATG 
GGTCCCTCAG 
GAGGTAACAC 
AACGTGGTAT 
GAGTCTCGTA 
GTGATCGTGT 
TGGAGGAAAT 
ATATCCACAC 
AA6CACTTTC 
6AGACACTGA 
6CAGACAGCT 



41 

I 

GGAGGATTAA 
TGAGAAGCAG 
AACGTTTCCT 
ACTACA6ACA 
ATCAAA/^AAA 
ATATTGATGA 
GGGATAAAAC 
ATCTCACTAA 
AGATAACTTT 
AAGGACAAAA 
GTTTTGAGGA 
TTGGGACAGA 
GTTTTGGGAA 
CTGACAAGTA 
ACTGGATTGT 
AAGTTCTTAC 
TTCGAGAGCA 
AGATTCATGA 
ATACCAGCCT 
AGTTTGCAGT 
CAGATGGCTA 
TTCTTCAGAT 
TTGTCGACAT 
AAGAAATAAT 
CTGGTAGAGA 
CACACTACAA 
GAGGAAGTGA 
AACCAGTCAC 
AACTGCCACC 
TTCTTAGATC 
TAACAGAATA 
ATTCTTCAGG 
AAGGGTATAT 
AATCTGCTAG 
ATCCTTCTAT 
ATGTTGGATC 
AATCTGAGAA 
TTACAGATCT 
CTCATGCTTT 
ACTCGCAGAC 
TTGGTCTAGC 
CAGCCCTGAC 
GCTTCCAGAC 
CTCCAACACC 
CAAAGCATGT 
AAGAGTTTTA 
CCAACCACCC 



51 

I 

AACAAACAAA 
AGGAGCCGCA 
CGCTTGCATT 
ACAGAGAAAA 
TTGGGGAAAG 
AGATCTTACA 
ATCATTGGAA 
TGACTACCGT 
TCACTGGGGA 
ATTTCCACTT 
AGCAGTCAAA 
AGAAAATTTG 
GCAGGCTGCT 
TTACATTTAC 
TTTTAAAGAT 
AATGCAACAA 
ACAGTACAAG 
AGCAGTTTGT 
TCTTGTTACA 
TTTGTACCAG 
TCAAGACTTG 
AGTAGCCATA 
GCCTACTGAT 
CAAGGAGGAG 
CAGTGCTACA 
TCGCATAGGG 
ATTCTCTGGA 
TAAATTAGCC 
TCACACTGTG 
TCCACATATG 
TGAGGAGGAG 
CTCCAGTCCC 
ATTTTCCTCC 
AAATGCTTCC 
GGAGGGAAAT 
AGGCAGAGAG 
GACAACCAAG 
GGAAATGCCA 
TACCCCATCC 
AACCCAACCG 
TGAGGGGTTG 
TTTTATCTGT 
TGCACACTTT 
TATCTTTCCA 
TGCAGATTTA 
CCAGGAAGTG 
AGACAACAAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 



262 
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CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 
CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 
AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 
TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 
AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 
TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 
CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 
ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 
CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 
CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 
CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 
AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 
GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 
CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GGGTCTCACT 
CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTCGGCTC ACTGCAACCT TCCTCTCCCT 
GGCTTAACTG ATCCTCCTAC CTCAGCCTCC CGAGTGGCTG GGACTATACT CCTGAGCCAG 
TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 
CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 
GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CZW^TGAATTC 
ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 
CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 
GTTTACTGGC CT^AATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 
GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 
GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 
AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 
GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 
TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 
CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 
CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGGGCACAA GGCAGGAAGA GAATCCATCC 
ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 
TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 
CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 
AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 
TTTGCAA6AC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 
TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 
ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 
GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 
TAATlACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 
AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATG6ACCAA ATTTATATTT 
ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 
TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 
TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 
ATACCTTCAT TTTGAAAGAA GTTTTTATGA GZATAACACC TTACCAAACA TTGTTCAAAT 
GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA A 

Seg ZD NO: 188 Protein sequence: 
Protein Accession #: EOS sequence 

1 11 21 31 41 51 

I I I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 
QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEIVIV 
FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRAIiS 
ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFIIiLNL LPNSTDKYYI YNGSLTSPPC 
TDTVDWIVFK DTVSISESQIj AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 
TGKBEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 
HEFItTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQIjIVDMPT DNPELDLFPE 
LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 
RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 
GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 
ENISQGYIFS SENPETITYD VLIPESJ\RNA SEDSTSSGSB ESLKDPSMEG NVWFPSSTDI 
TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 
TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 
LVIVSALTFI CIiWIiVGIIiI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 
IKHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI TADSSNHPDN KHKNRYINIV 
AYDHSRVKLA QLAEKDGKLT DYINANYVDG YNRPKAYIAA QGPLKSTAED PWRMIWEHNV 
EVIVMITNLV EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ VLAYYTVRNF TLRNTKIKKG 
SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY AKRHAVGPW VHCSAGVGRT 
GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE QYVFIHDTLV EAILSKETEV 
LDSHIHAYVN ALLIPGPAGK TKLEKQFQGL TLSPRLECRG TISAHdSTLPIi PGLTDPPTSA 
SRVAGTILLS QSNIQQSDYS AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS 
YIMGYYQSNE FIITQHPIjLH TIKDFWRMIW DHNAQLWMI PDGQNMAEDE FVYWPNKDEP 
INCESFKVTL MAEEHKCLSN EEKLIIQDFI LEATQDDYVL EVRHFQCPKW PNPDSPISKT 
PEIiISVIKEE AKNRDGPMIV HDEHGGVTAG TFCALTTLMH QLEKBNSVDV YQVAKMINLM 
RPGVFADIEQ YQFLYKVILS LVGTRQEENP STSLDSNGAA lpdgniaesl eslv 



Seq ID NO; 189 DNA sequence 
Nucleic Acid Accession #: NM_002820 
Coding sequence: 304., 831 

1 11 21 31 41 51 

I I I I i I 

CCGGTTCGCA AAGAAGCTGA CTTCAGAGGG GGAAACTTTC TTCTTTTAGG AGGCGGTTAG 
CCCTGTTCCA CGAACCCAGG AGAACTGCTG GCCAGATTAA TTAGACATTG CTATGGGAGA 
CGTGTAAACA CACTACTTAT CATTGATGCA TATATAAAAC CATTTTATTT TCGCTATTAT 



2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



60 
120 
180 



263 



10 



15 



20 



25 



WO 02/086443 

TTCAGAGGAA GCGCCTCTGA TTTGTTTCTT TTTTCCCTTT TTGCTCTTTC TGGCTGTGTG 240 

GTTTGGAGAA AGCACAGTTG GAGTAGCCGG TTGCTAAATA AGTCCCGAGC GCGAGCGGAG 300 

ACGATGCAGC GGAGACTGGT TCAGCAGTGG AGCGTCGCGG TGTTCCTGCT GAGCTACGCG 360 

GTGCCCTCCT GCGGGCGCTC GGTGGAGGGT CTCAGCCGCC GCCTCAAAAG AGCTGTGTCT 420 

GAACATCAGC TCCTCCATGA CAAGGGGAAG TCCATCCAAG ATTTACGGCG ACGATTCTTC 480 

CTTCACCATC TGATCGCAGA AATCCACACA GCTGAAATCA GAGCTACCTC GGAGGTGTCC 540 

CCTAACTCCA AGCCCTCTCC CAACACAAAG AACCACCCCG TCCGATTTGG GTCTGATGAT 600 

GAGGGCAGAT ACCTAACTCA GGAAACTAAC AAGGTGGAGA CGTACAAAGA GCAGCCGCTC 660 

AA6ACACCTG GGAAGA7\AAA GAAAGGCAAG CCCGGGAAAC GCAAGGAGCA GGAAAAGAAA 720 

AAACGGCGAA CTCGCTCTGC CTGGTTAGAC TCTGGAGTGA CTGGGAGTGG GCTAGAAGGG 780 

GACCACCTGT CTGACACCTC CACAACGTCG CTGGAGCTCG ATTCACGGTA ACAGGCTTCT 840 

CTGGCCCGTA GCCTCAGCGG GGTGCTCTCA GCTGGGTTTT GGAGCCTCCC TTCTGCCTTG 900 

GCTTGGACAA ACCTAGAATT TTCTCCCTTT ATGTATCTCT ATCGATTGTG TAGCAATTGA 960 

CAGAGJ^ATAA CTCAGAATAT TGTCTGCCTT A/^AGCAGTAC CCCCCTACCA CACACACCCC 1020 

TGTCCTCCAG CACCATAGAG AGGCGCTAGA GCCCATTCCT CTTTCTCCAC CGTCACCCAA 1080 

CATCAATCCT TTACCACTCT ACCAAATAAT TTCATATTCA AGCTTCAGAA GCTAGTGACC 1140 

ATCTTCATAA TTTGCTGGAG AAGTGTATTT CTTCCCCTTA CTCTCACACC TGGGCAAACT 1200 

TTCTTCAGTG TTTTTCATTT CTTACGTTCT TTCACTTCAA GGGAGAATAT AGAAGCATTT 1260 

GATATTATCT ACAAACACTG CAGAACAGCA TCATGTCATA AACGATTCTG AGCCATTCAC 1320 

ACTTTTTATT TAATTAAATG TATTTAATTA AATCTCAAAT TTATTTTAAT GTAAAGAACT 1380 

TAAATTATGT TTTAAACACA TGCCTTAAAT TTGTTTAATT AAATTTAACT CTGGTTTCTA 1440 

CCAGCTCATA CAAAATAflAT GGTTTCTGAA AATGTTTAAG TATTAACTTA CAAGGATATA 1500 

GGTTTTTCTC ATGTATCTTT TTGTTCATTG GCAAGATGAA ATAATTTTTC TAGGGTAATG 1560 
CCGTAG6AAA AATAAAACTT CACATTTAAA AAAAA 



PCT/US02/12476 



Seg ID NO: 190 Protein sequence: 
Protein Accession #: NP 002811 
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MQRRLVQQWS VAVFLLSYAV PSCGRSVBGL SRRLKRAVSE HQLLHDKGKS IQDLRRRFFL 
HHLIAEIHTA EIRATSEVSP NSKPSPNTKN HPVRFGSDDE GRYLTQETNK VETYKEQPLK 
TPGKKKKGKP GKRKEQEKKK RRTRSAWLDS GVTGSGLEGD HLSDTSTTSL ELDSR 

Seq ID NO: 191 DNA sequence 
Nucleic Acid Accession #: XM_059328 
Coding sequence: 52.. 1023 



GGGCTGTCCG 
CCTCGCATGC 
GGTATCGTGG 
GCGGCCACGG 
GCCAACCTGT 
GGCCCGGAAG 
GTGGATTTGC 
CTGGGCAGGG 
TGCCAGGTGT 
GAGCGCGGTG 
GTGGAGCGCG 
GACGCCTTCG 
GCCCTGGCGC 
GCGCACCCCG 
TTCTCTTGCT 
GCCCAGCTTG 
CCAGGGGAGG 
TGACCCCCTA 
TTAGTCCTGG 
GGACACTGCC 
AGCCTTCTTG 
TGGTGCCCCT 
CTATATTAAT 



11 
I 

GCCCACTCCC 
GCCTGGTGGT 
AGGCCTTTCT 
AGAGCGCX3GC 
CCGAGGGCCG 
GCTTCTTCCT 
CTCAGGTGCG 
CCCCCACGCA 
TCGCCGAGGC 
TGGGTGGCTG 
ACGCCCGGGC 
TGGGCCTGAG 
GGGTCCTGGA 
GCTACCCCAG 
CTTGGGAGCG 
CCCAGGATGG 
AGGTCCCCTG 
CAGACAACCA 
CCCAGCCCAG 
ACCTCTGGGC 
GCTGCAGGCA 
CCATGTTGCA 
AAAATAACGT 



21 
I 

CTGGGAGCGC 
CACCGCGGAC 
GGCCGGGGCT 
GGAGCTGGCC 
CCCCGTGGGT 
TGGCAAGATG 
GGAGGAGCTC 
CGCGGACGGG 
GCTGCAGGCC 
CACTTGGCTG 
CGCGGTGGGC 
CACTTGCGGC 
AGGTACCCTA 
TGTGCCTCCC 
GCTGCATGAG 
CGTGCAGCTT 
TGAGCCCACT 
AGCACTAATC 
AGCTGGGACC 
TCAGGTCCTC 
GGCCTAGCCT 
ATGCAAACAC 
GTGTCTTTC 



31 

1 

GAGCGGTGGA 
GACTTTGGTT 
GTGACCAGCG 
CGCAGGCACA 
CCGGCCCGCC 
GGATTCCGGG 
GAGGCCCAAC 
CACCAGCACG 
TATGGGGTGC 
GAGGCCCCCG 
CCCTTCTCCC 
CGGCACATGT 
GCGGGCCACA 
ACCGGCGGCT 
CTGCGCX3TCC 
TGCGCCCTCG 
CTGGAACCCT 
CCCTTAGTAC 
TGGAGCACGA 
ATGCCTCCAA 
GTGGCAGCGG 
CTTCACCACT 



41 

1 

CCCAGGCGGC 
ACTGCCCGCG 
TGTCCCTGCT 
GCATCCCCAC 
GTGGCGCCTC 
AGGCGGTGGC 
TAAGCTGCTT 
TGCACGTGCT 
GCTTTACGCG 
CGCGTGCCTT 
GCCACGGCCT 
CCGCTCACCG 
CCCTGACAGC 
GCGGTGAAGG 
TCACCGCGCC 
ACGACCTGGA 
TCCTGGAACC 
CAAGAAAGGG 
TCTGTTGACT 
ATGGCATCTA 
GCTAGGGCCC 
GGGGCAGTGG 



51 
I 

CATGTCCCGC 
ACGCGATGAG 
GGTCAACGGT 
GGGCCTCCAC 
ATCGCTGCTC 
GGCCGGAGAC 
CCGGGAGCTG 
CCCA6GCGTG 
ACTGCCGCTG 
CGCCTGCGCC 
GCGGTGGACA 
CGTGTCCGGG 
CGAGCTGATG 
CCCCGACGCT 
CACGCTGCGG 
CTCCAAGAGG 
CTCCCTACTC 
GAGCCAGGAT 
TCCCTGGGTA 
GAGTTTGAGC 
GCAGAGCATT 
G6A6AGATGG 
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Seg ID NO: 192 Protein sequence: 
Protein Accession XP 059328 



MSRPRMRLW 
GLHANLSEGR 
RELLGRAPTH 
ACAVERDARA 
ELMAHPGYPS 
SKRPGEEVPC 



11 
I 

TADDFGYCPR 
PVGPARRGAS 
ADGHQHVHVL 

AVGPFSRHGL 
VPPTGGCGEG 
EPTLEPFLEP 



21 

1 

RDEGIVEAFL 
SLLGPEGFFL 
PGVCQVFAEA 
RWTDAFVGLS 
PDAFSCSWER 



31 

AGAVTSVSLL 
GKMGFREAVA 
LQAYGVRFTR 
TCXSRHMSAHR 
LHELRVLTAP 



41 
I 

VNGAATESAA 
AGDVDLPQVR 
LPLERGVGGC 
VSGALARVLB 
TIiRAQIiAQDG 



51 
I 

ELARRHSIPT 
EBLEAQLSCF 
TWLEAPARAF 
GTLAGHTLTA 
VQLCALDDIiD 



Seq ID NO: 193 DNA sequence 

Nucleic Acid Accession #: NM_005688.1 

Coding sequence: 126.. 4439 



11 



21 



31 41 51 

I I I I I I 

CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 
AGG6GCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 
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AGAAGATGAA GGATATCGAC 
GTGTGAGGGA GAGAACCAGC 
GGAGAACTCG ACCGTTGGAA 
TCTCTCTTGA TGCCTCCATG 
GAAAGTACCA TCATGGCTTG 
ACCCAGTGGA CAATGCTGGG 
CCCGTGTGGC CCACAAGAAG 
ACGAGTCTTC TGACGTGAAC 
JAGTTGGGCC AGACGCTGCT 
TCATCCTGTC CATCGTGTGC 
TCATGGTGAA ACACCTCTTG 
TGTTGTTAGT GCTGGGCCTC 
CTTGGGCATT GAATTACCGA 
TTAAGAAGAT CCTTAAGTTA 
TTTGCTCCAA CGATGGGCAG 
GAGGACCCGT TGTTGCCATC 
GCTTCCTGGG ATCAGCTGTT 
TCACAGCATA TTTCAGGAGA 
ATGAAGTTCT TACTTACATT 
AGAGTGTTCA AAAAATCCGC 
AGGGTATCAC TGTGGGTGTG 
CTGTTCATAT GACCCTGGGC 
TCTTCAATTC CATGACTTTT 
AAGCCTCAGT GGCTGTTGAC 
TAAAGAACAA ACCAGCCAGT 
GGGACTCCTC CCACTCCAGT 
ACAAGAGGGC TTCCAGGGGC 
AGGCGGTGCT GGCAGAGCAG 
CCGAAGAGGA AGAAGGCAAG 
ACAGCATCGA TCTGGAGATC 
GTGGAAAAAC CTCTCTCATT 
TTGCAATCAG TGGAACCTTC 
TGAGAGACAA CATCCTGTTT 
ACAGCTGCTG CCTGAGGCCT 
GAGAGCGAGG AGCCAACCTG 
TGTATAGTGA CAGGAGCATC 
TGGGCAACCA CATCTTCAAT 
TTGTTACCCA CCAGTTACAG 
GCTGTATTAC GGAAAGAGGC 
CCATTTTTAA TAACCTGTTG 
AAACCAGTGG TTCACAGAAG 
AGGAAAAAGC AGTAAAGCCA 
GTTCAGTGCC CTGGTCAGTA 
TCCTGGTTAT TATGGCCCTT 
GGTTGAGTTA CTGGATCAAG 
CCTCGGTGAG TGACAGCATG 
CCCTCTCCAT GGCAGTCATG 
GCACGCTGCG AGCTTCCTCC 
CTATGAAGTT TTTTGACACG 
TGGATGAAGT TGACGTGCGG 
TGGTGTTCTT CTGTGTGGGA 
GGCCCCTTGT CATCCTGTTT 
TGAAGCGTCT GGACAATATC 
AGGGCCTTGC CACCATCCAC 
AGCTGCTGGA TGACAACCAA 
CTGTGCGGCT GGACCTCATC 
TTATGCACGG GCAGATTCCC 
TAACGGGGCT GTTCCAGTTT 
CGGTGGAGAG GATCAATCAC 
AGAACAAGGC TCCCTCCCCT 
AGATGAGGTA CCGAGAAAAC 
CTAAAGAGAA GATTGGCATT 
CCCTCTTCCG TCTGGTGGAG 
GTGATATTGG CCTTGCCGAC 
TGTTCAGTGG CACTGTCAGA 
TTTGGGATGC CCTGGAGAGG 
TTGAATCTGA AGTGATGGAG 
GCATAGCTAG AGCCCTGCTC 
CCATGGACAC AGAGACAGAC 
GTACCATGCT GACCATTGCC 
TGCTGGCCCA GGGACAGGTG 
GTTCCCGATT CTATGCCATG 
TCCTCCCTGT TGACGAAGTC 
CCCCTCATCG CGTCCTCCTA 
GTTCCGGATT GGCTTGTGTG 
ATTCCATATT CATGTA/^CA 
GGGAACCGTT ATTATAATTG 
TCTATATATA ATTCTGTACA 
TATTAAAATA AGCACTGTGC 
TTGCTGTACT AGAGATCTGG 
CTCTAGCTGG TGGTTTCACG 
ATAGTGGGCC CTCCGACAGC 
GAGACGGGTG GGCGGCTGGA 
CTGTCCTGGT GTCACTTACT 
TTTCACTCCC TCCATCAAGA 
TTTCCTGCCT TCTTCTTTTT 
TCCCACTGCC TCAGGTTCCT 



ATAGGAA/UVG AGTATATCAT CCCCAGTCCT GGGTATAGAA 180 
ACTTCTGGGA CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 240 
TGCCAAGATG CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 300 
CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 360 
AGTGCTCTGA AGCCCATCCG GACTACTTCC AAACACCAGC 42 0 
CTTTTTTCCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 
GGGGAGCTCT CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 540 
TGCAGAAGAC TAGAGAGACT GTG6CAAGAA GAGCTGAATG 600 
TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 
CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 72 0 
GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 
CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 
ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 
AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 960 

AGAATGTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 1080 

TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

AAATGCGTGG CCGCCACGGA TGAACGTGTC CAGAAGATGA 1200 

AAATTTATCA AAATGTATGC CTGGGTCAAA GCATTTTCTC 1260 

GAGGAGGAGC GTCGGATATT GGAAAAAGCC GGGTACTTCC 1320 

GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 13 80 

TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

GCTTTGAAAG Ti^ACACCGTT TTCAGTAAAG TCCCTCTCAG 1500 

AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA 1560 

CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

ATCCAGAACT CGCCCAAGCT GACCCCC/iT^ ATGAAAAAAG 1680 

AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

AAAGGCCACC TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1800 

CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC 1860 

CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 1920 

TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 

GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 210 0 

GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 

AGCGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 2220 

TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 2340 

TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 2520 

AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 

GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

TATGGTGTCT ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 2700 

TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2760 

CAAGGAAGCG GGAACACCAC TGTGACTCGA GGGAACGAGA 2 82 0 

AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2 880 

CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2 940 

CGGCTGCATG ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3 000 

ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 3 060 

CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 

ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 

TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 

GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 

GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 3420 

AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 3480 

CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 

ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3 600 

TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3 660 

GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3720 

CTCCCTCTTG TCCTAAAGAA AGTATCCTTC ACGATCAAAC* 3780 

GTGGGGCGGA CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 

TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 3900 

CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3960 

TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 402 0 

ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 

AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 

CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 4200 

TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 

CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 4320 

GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 

TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 

TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 

CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4560 

TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATT6TATTT 4620 

AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 

TATCAGAGGC CTATAATGAA GCTTTATACG TGTAGCTATA 4740 

TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 

TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 4860 

TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4920 

GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 4980 

CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 

GACCATGCAG AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 

GTTTCTGTCA GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 5160 

ATGGGGATCA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 

GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 5280 

ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 



265 
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GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG TGGCACTTTT TCATTTGCCT 5400 

ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT TCGTGGGTCT GTTTTCCTTT 5460 

CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 5520 

CAGCTCTTGC TAATCAGTGT CTCACACTGG CGTAGAAGTT TTTGTACTGT AAAGAGACCT 5580 

ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC GCAAACCCCC TTTGTGCTGT 5640 

GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC TGCTGTCATC AGTTGAATGG TCAGCGTTGC 5700 

ATGTCGTGAC CAACTAGACA TTCTGTCGCC TTAGCATGTT TGCTGAACAC CTTGTGGAAG 5760 

CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 5820 
AAAAAAAAAA AAAAAAAA 

Seq ID NO: 194 Protein sequences 
Protein Accession #: NP_005679.1 



1 11 21 31 41 51 

i I 1 1 I ^ 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGLS 60 

LDASMHSQIiR ILDEEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGLFSCM TFSWLSSLAR 120 

VAHKKGELSM EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 180 

LSIVCLMITQ LAGFSGPAFM VKHLLEYTQA TESNLQYSLL LVLGLLLTEI VRSWSLALTW 240 

ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGEIiINIC SNDGQRMFEA AAVGSLLAGG 300 

PWAILGMIY NVIILGPTGF LGSAVFILFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 360 

VLTYIKFIKM YAWVKAPSQS VQKIREEERR ILEKAGYFQG ITVGVAPIW VIASWTFSV 420 

HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMEEVHMIK 480 

NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA 540 

VtAEOKGHIiti LDSDERPSPE EEEGKHIHLG HLRLQRTLHS IDbEIQEGKL VGICGSVGSG 600 

KTSIilSAILG QMTIiLEGSIA ISGTFAYVAQ QAWILNATLR DNILFGKEYD EERYNSVLNS 660 

CCIiRPDIAII* PSSDLTEIGE RGANLSGGQR QRISLARALY SDRSIYILDD PLSALDAHVG 72 0 

NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVIFMKEGC ITERGTHEEL MNLNGDYATI 780 

FNNLLLGETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPBEGQI. VQLEEKGQGS 840 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWWL SYWIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 960 

KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FFCVGMIAGV PPWFLVAVGP 1020 

LVILFSVIiHI VSRVLIRBLK RLDNITQSPF LSHITSSIQG LATIHAYNKG QEFLHRYQEL 10 80 

LDDNQAPFFL FTCAMRWLAV RLDLISIALI TTTGLMIVLM HGQIPPAYAG LAISYAVQIiT 1140 

GLFQFTVRLA SETEARFTSV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 1200 

RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 1260 

IGLADLRSKL SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNF SVGERQLLCI ARALLRHCKl LILDBATAAM DTETDLLIQE TIREAFADCT 1380 
MliTIAHRLHT VLGSDRIMVL AQGQWBFDT PSVMiSNDSS RFYAMFAAAB NKVAVKG 

Seq ID NO: 195 DNA sequence 
Nucleic Acid Accession NM_006470 
Coding sequence : 2 2 8 .. 1 92 2 



1 11 21 31 41 51 

I 1 I 1 1 1 

GCTGTCCTGA GCCTGAGTAC TCTAGCTGCC TTGTCGCCAT CGCATCTGGC TGCCATCCAG 60 

CGCCAGCACA CA6TAATGAG TGGCCX3AGCT TCCTCTGGGA GGGAGGAAAC AGTTAAAATC 120 

TTGCAGCAGC TGCAATCATC TAGGCX3TGGT TCTCTTGTCT GACTTGGGCT GCACAGATCC 180 

TGGGCCAAGG GACAGAAGAA AGACAGCCTA GGAGCAGAGC CTCCCAGATG GCTGAGTTGG 240 

ATCTAATGGC TCCAGGGCCA CTGCCCAGGG CCACTGCTCA GCCCCCAGCC CCTCTCAGCC 300 

CAGACTCTGG GTCACCCAGC CCAGATTCTG GGTCAGCCAG CCCAGTGGAA GAAGAGGACG 360 

TGGGCTCCTC GGAGAAGCTT GGCAGGGAGA CGGAGGAACA GGACAGCGAC TCTGCAGAGC 420 

AGGGGGATCC TGCTGGTGAG GGGAAAGAGG TCCTGTGTGA CTTCTGCCTT GATGACACCA 480 

GAAGAGTGAA GGCAGTGAAG TCCTGTCTAA CCTGCATGGT GAATTACTGT GAAGAGCACT 540 

TGCAGCCGCA TCAGGTGAAC ATCAAACTGC AAAGCCACCT GCTGACCGAG CCAGTGAAGG 600 

ACCACAACTG GCGATACTGC CCTGCCCACC ACAGCCCACT GTCTGCTTTC TGCTGCCCTG 660 

ATCAGCAGTG CATCTGCCAG GACTGTTGCC AGGAGCACAG TGGCCACACC ATAGTCTCCC 720 

TGGATGCAGC CCGCAGGGAC AAGGAGGCTG AACTCCAGTG CACCCAGTTA GACTTGGAGC 780 

GGAAACTCAA GTTGAATGAA AATGCCATCT CCAGGCTCCA GGCTAACCAA AAGTCTGTTC 840 

TGGTGTCGGT GTCAGAGGTC AAAGCGGTGG CTGAAATGCA GTTTGGGGAA CTCCTTGCTG 900 

CTGTGAGGAA GGCCCAGGCC AATGTGATGC TCTTCTTAGA GGAGAAGGAG CAAGCTGCGC 960 

TGAGCCAGGC CAACGGTATC AAGGCCCACC TGGAGTACAG GAGTGCCGAG ATGGAGAAGA 102 0 

GCAAGCAGGA GCTGGAGAGG ATGGCGGCCA TCAGCAACAC TGTCCAGTTC TTGGAGGAGT 1080 

ACTGCAAGTT TAAGAACACT GAAGACATCA CCTTCCCTAG TGTTTACGTA GGGCTGAAGG 1140 

ATAAACTCTC GGGCATCCGC AAAGTTATCA CGGAATCCAC TGTACACTTA ATCCAGTTGC 1200 

TGGAGAACTA TAAGAAAAAG CTCCAGGAGT TTTCCAAGGA AGAGGAGTAT GACATCAGAA 1260 

CTCAAGTGTC TGCCGTTGTT CAGCGCAAAT ATTGGACTTC CAAACCTGAG CCCAGCACCA 1320 

GGGAACAGTT CCTCCAATAT GCGTATGACA TCACGTTTGA CCCGGACACA GCACACAAGT 1380 

ATCTCCGGCT GCAGGAGGAG AACCGCAAGG TCAGCAACAC CACGCCCTGG GAGCATCCCT 1440 

ACCCGGACCT CCCCAGCAGG TTCCTGCACT GGCGGCAGGT GCTGTCCCAG CAGAGTCTGT 1500 

ACCTGCACAG GTACTATTTT GAGGTGGAGA TCTTCGGGGC AGGCACCTAT GTTGGCCTGA 1560 

CCTGCAAAGG CATCGACCGG AAAGGGGAGG AGCGCAACAG TTGCATTTCC GGAAACAACT 1620 

TCTCCTGGAG CCTCCAATGG AACGGGAAGG AGTTCACGGC CTGGTACAGT GACATGGAGA 1680 

CCCCACTCAA AGCTGGCCCT TTCCGGAGGC TCGGGGTCTA TATCGACTTC CCGGGAGGGA 1740 

TCCTTTCCTT CTATGGCGTA GAGTATGATA CCATGACTCT GGTTCACAAG TTTGCCTGCA 180 0 

AATTTTCAGA ACCAGTCTAT GCTGCCTTCT GGCTTTCCAA GAAGGAAAAC GCCATCCGGA 1860 

TTGTAGATCT GGGAGAGGAA CCCGAGAAGC CAGCACCGTC CTTGGGGGTG ACTGCTCCCT 1920 

AGACTCCAGG AGCCATATCC CAGACCTTTG CCAGCTACAG TGATGGGATT TGCATTTTAG 1980 

GGTGATTTGT GGGCAGAAAT 7UVCTGCTGAT GGTAGCTGGC TTTTGAAATC CTATGGGGTC 2040 

TCTGAATGAA AACATTCTCC AGCTGCTCTC TTTTGCTCCA TATGGTGCTG TTCTCTATGT 2100 

GTTTGCAGTA ATTCTTTTTT TTTTTTTTGA GACGGAGTCT CGCACTGTTG CCCAGGCTGG 2160 

AGAGCAGTGG CGCGATCTTG GCTCACTGCA AGCTCCGCCT CCCGAGTTCA AGCAATTCTC 2220 

CTGCCTCAGC CTCCCGAGTA GCTGGGATTA CAGGTGCCTG CCACCACACC CAGCTAATGT 2280 

TTTGTATTTT TAGTAGAGAT GGGGTTTCAC CATGTTGGCC AGGCAGATCT CAAACTCCTG 2340 



266 
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ACCTCGTGAT GCACCCACCT CGGCCTCCCA AAGTGCTGGG ATTACATGCG TGAGCCT^CTG 2400 
CGCCCTGCCT GTTTGTAGTA ATTTTTAGGC ACCAAATCTC CCTCATCTTC TAGTGCCATT 2460 
CTCCTCTCTG TTCAGGTAAA TGTCACACTG TGCCCAGAAT GGATGACCAG GAACCTTAAA 2520 
GAGTGGCTGA AAAGATTGCA 6A6TTATCAT AATAT^TTGC TAACTTGCGT 



PCT/US02/12476 



Seq ID NO: 196 Protein sequence: 
Protein Accession #: NP_006461 



10 



15 



20 



MAELDLMAPG 
DSAEQGDPAG 
EPVKDHNWRY 
LDIjERKLKLN 

EQAALSQANG 
VGLKDKLSGI 
EPSTREQFLQ 
QQSLYLHRYY 
SDMETPIiKAG 
NAIRIVDLGE 



11 

I 

PLPRATAQPP 
EGKEVLCDFC 
CFAHHSPIiSA 
ENAISRLQAN 
IKAHLEYRSA 
RKVITESTVH 
YAYDITFDPD 
FEVEIFGAGT 
PFRRLGVYID 
EPEKPAPSLG 



21 

1 

APLSPDSGSP 
LDDTRRVKAV 
FCCPDQQCIC 
QKSVIiVSVSE 
EMEKSKQELE 
LIQLLENYKK 
TAHKYLRLQE 
YVGLTCKGID 
FPGGILSFYG 
VTAP 



31 
1 

SPDSGSASPV 
KSCLTCMVNY 
QDCCQEHSGH 
VKAVAEMQFG 
RMAAISNTVQ 
KLQEFSKEEE 
ENRKVTNTTP 
RKGEERNSCI 
VBYDTMTLVH 



41 

1 

EEEDVGSSEK 
CEEHLQPHQV 
TIVSLDAARR 
EBIiAAVRKAQ 
FLEEYCKFKN 
YDIRTQVSAV 
WEHPYPDLPS 
SC^FSWSLQ 
KFACKFSEPV 



51 
I 

LGRETEEQDS 
NIKLQSHLLT 
DKBAELQCTQ 
ANVMLFLEEK 
TEDITFPSVY 
VQRKYWTSKP 
RFLHWRQVLS 
WNGKEFTAWY 
YAAFWLSKKE 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID NO: 197 DNA sequence 
Nucleic Acid Accession #: NM_004316 
25 Coding sequence: 433-1149 



30 
35 
40 
45 
50 
55 



CCCGAGACCC 
GCGCGTTCAG 
GCGCCAGCGG 
GGAGGAGGGG 
TTGCTCCCAC 
GCTCCCGCTT 
GTCCCCCTCG 
CCGCCGCTGC 
CAGCCGCAGC 
GCCGCGGCGG 
CAGCAGCAGC 
TCAGGGGGCG 
GAACTGATGC 
CAGCAGCCGG 
AACCTGGGCT 
AGTAAGGTGG 
GACGAGCATG 
CCCAACTACT 
GACGAGGGCT 
TGGTTCTGAG 
ATCGCACAAC 
AAAAGAAAAA 
CCAACCCCAT 
AGCGCTCAGA 
ACCTGAGTCA 
GAGCAGCACA 
GCTCGGGTCC 
GAGTTGGTGT 



11 
I 

GGCGCAAGAG 
CACTGACTTT 
CAGCCTCACA 
AGGGAGGAGG 
TCTAA6AAGT 
CATATTTCCT 
CGGGCCCCGC 
GCATGGAAAG 
CCCAGCAGCG 
CCGCAGCCGC 
AGCAGCAGCA 
GTCACAAGTC 
GCTGCAAACG 
CCGCCGTGGC 
TTGCCACCCT 
AGACACTGCG 
ACGCGGTGAG 
CCAACX3ACTT 
CTTACGACCC 
GGGCTCGGCC 
CTGCATCTTT 
AAAAGAAGAA 
CGCCAACTAA 
ACAGTATCTT 
ATGCGCAAAA 
CGCGTTATAG 
CTTCACCTCC 
CTTTC 



21 
1 

AGCGCAGCCT 
TGCTGCTGCT 
CGCGAGCGCC 
AGGCGGCGTG 
CTCCCGGGGA 
TTTCTTTCCC 
ACCTCGCGTC 
CTCTGCCAAG 
CTTCCTGCCG 
CGCAGCGGCA 
GCAGGCGCCG 
AGCGCCCAAG 
CCGGCTCAAC 
GCGCCGCAAC 
TCGGGAGCAC 
CTCGGCGGTC 
CGCCGCCTTC 
GAACTCCATG 
GCTCAGCCCC 
TGGTCAGGCC 
AGTGCTTTCT 
GAAGAAGAAA 
GCGAGGCATG 
TGCACTCCAA 
TGCAGCTTGT 
TAACTCCCAT 
CCGCCCTTTC 



31 

I 

TAGTAGGAGA 
TCTGCTTTTT 
ACGCGAGGCT 
CAGGGAGGAG 
TTTTGTATAT 
TCTCTGTTCC 
CCGGATCGCT 
ATGGAGAGCG 
CCCGCAGCCT 
GCGCAGAGCG 
CAGCTGAGAC 
CAAGTCAAGC 
TTCAGCGGCT 
GAGCGCGAGC 
GTCCCCAACG 
GAGTACATCC 
CAGGCAGGCG 
GCCGGCTCGC 
GAGGAGCAGG 
CTGGTGCGAA 
TGTCAGTGGC 
AGAGAAGAAG 
CCTGAGAGAC 
TCATTCACGG 
GTGCAAAAGC 
CACCTCTAAC 
TTAGAGTGCA 



41 
I 

GGAACGCGAG 
TTTTTCTTAG 
CCCGAAGCCA 
AAAAAGCATT 
ATTTTTTAAC 
TGCACCCAAG 
CTGATTCCGC 
GCGGCGCCGG 
GTTTCTTTGC 
CGCAGCAGCA 
CGGCGGCCGA 
GACAGCGCTC 
TTGGCTACAG 
GCAACCGCGT 
GCGCGGCCAA 
GCGCGCTGCA 
TCGTGTCGCC 
CGGTCTCATC 
AGCTTCTCGA 
TGGACTTTGG 
GTTGGGAGGG 
AAAAAAACGA 
ATGGCTTTCA 
AGATATGAAG 
AGTGGGCTCC 
ACGCACAGCT 
GTTCTTAGCC 



51 

1 

ACGCGGCAGA 
AAACAAGAAG 
ACCCX3CGAAG 
TTCACCTTTT 
TTCCGTCAGG 
TTCTCTCTGT 
GACTCCTTGG 
CCAGCAGCCC 
CACGGCCGCA 
GCAGCAGCAG 
CGGCCAGCCC 
GTCTTCGCCC 
CCTGCCGCAG 
CAAGTTGGTC 
CAAGAAGAT6 
GCAGCTGCTG 
CACCATCTCC 
CTACTCGTCG 
CTTCACCAAC 
AAGCAGGGTG 
GGAGAAAAGG 
AAACAGTCAA 
GAAAACGGGA 
AGCAACTGGG 
TGGCAGAAGG 
GAAAGTTCTT 
CTCTAGAAAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



60 



Seq ID NO: 198 Protein sequence; 
Protein Accession #: NP_004307 



1 11 21 31 41 51 

I I I I I I 

MESSAKMESG GAGQQPQPQP QQPFIjPPAAC FFATAAAAAA AAAAAAAQSA QQQQQQQQQQ 
65 QQQQAPQIiRP AADGQPSGGG HKSAPKQVKR QRSSSPELMR CKRRLNFSGF GYSLPQQQPA 

AVARRNERER NRVKLVNLGF ATLREHVPNG AANKKMSKVE TLRSAVEYIR ALQQLLDEHD 
AVSAAFQAGV LSPTISPNYS NDIJISMAGSP VSSYSSDEGS YDPLSPEEQE LLDFTNWF 



60 
120 
180 



Seq ID NO: 199 DNA sequence 
70 Nucleic Acid Accession #: NM_007015 
Coding sequence: 1-1005 



75 
80 

85 



ATGACA6AGA 
TGCAGCCCCC 
AAGGTGGGAG 
GCCTTCTACT 
ATCAATGGGA 
TTTAAAATGG 
ACAGGAATTC 
ATTCCTGAGG 
ATGCCAGTCA 
GACAACAGCT 
CTTAAACCAA 
GTTCCAACTA 
CTGAATAATG 



11 
I 

ACTCCGACAA 

CGGCGTACGC 
CCGTGGTCCT 
TCTGGAAGGG 
AACTACAAGA 
GAAGTGGAGC 
GTTTTGCTGG 
TGGGCGCCGT 
AATATGAAGA 
TCTTGAGTTC 
CCTATCCAAA 
CCACAAATVAG 
AAACCAGACC 



21 
I 

AGTTCCCATT 
TACGCTGACG 
CATTTCGGGA 
GAGCGACAGT 
TGGGTCAATG 
TGAAGAAGCA 
AGGAGAGAAG 
GACCAAACAG 
AAATTCTCTT 
TAAGGTGTTA 
AGAAATCCAG 
ACCACACAGT 
CAGTGTTCAA 



31 
I 

GCCCTGGTGG 

GTGAAGCCCT 
GCTGTGCTGC 
CACATTTACA 
GAAATAGACG 
ATTGCAGTTA 
TGCTACATTA 
AGCATCTCCT 
ATCTGGGTGG 
GAACTCTGCG 
AGGGAAAGAA 
GGACCACGGA 
GAGGACTCAC 



41 
I 

GACCTGATGA 
CCAGCCCCGC 
TGCTCTTTGG 
ATGTCCATTA 
CTGGGAACAA 
ATGATTTCCA 
AAGCGCAAGT 
CCAAACTGGA 
CTGTAGATCA 
GTGACCTTCC 
GAGAAGTGGT 
GCAACCCAGG 
TUVGCCTTCT^ 



51 
I 

CGTGGAATTC 
GCGGCTGCTC 
GGCCATCGGG 
CACCATGAGT 
CTTGGAGACC 
GAATGGCATC 
GAAGGCTCGT 
AGGCAAGATC 
GCCTGTGAAG 
TATTTTCTGG 
AAGAAAAATT 
CGCTGGAAGA 
TCCTGATAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



267 
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CCTTATCATC AGCAGGAAGG GGAAAGCATG ACATTCGACC CTAGACTGGA TCACGAAGGA 840 

ATCTGTTGTA TAGAATGTAG GCGGAGCTAC ACCCACTGCC AGAAGATCTG TGAACCCCTG 900 

GGGGGCTATT ACCCATGGCC TTATAATTAT CAAGGCTGCC GTTCGGCCTQ CAGAGTCATC 960 

AT6CCATQTA GCTGGTGGGT GGCCCGTATC TTGGGCATGG TGTGAAATCA CTTCATATAT 1020 

CACGTGCTGT AAAATAAGAA CTAGCTGAAG AGACAACCAA AGAAGCATTA AGGCAGGTTG 1080 

ATGCTGATGG GACCATAAAA TATTTTTACA CGCAGCCTGA GCGGTTATTC TTGACACTCT 1140 

TAACAGAATT TTTTTAATCG TTTTCCAGAA CTTTAGTATA TGCAAATGCA CTGAAAGGGT 1200 

AGTTCAAGTC TAAAATGCCA TAACCCCGTT ATTTGTTATT TTTTATTTGC ATTGATTTGC 1260 

CATAAGTCTT CCCTTGCTTG CATCTTCCAA AGCTATTTCG AAATAAACAC GAAAATTTAC 1320 
AGTTTGCC 



Seq ID NO: 200 Protein sequence: 
Protein Accession #: NP_008946 



1 11 21 31 41 51 

I I I 1 I I 

MTENSDKVPI ALVGPDDVEF CSPPAYATLT VKPSSPARLI, KVGAWLISG AVLLLFGAIG 60 
AFYFWKGSDS HIYNVHYTMS INGKLQDGSM EIDAGNNLET FKMGSGAEEA lAVNDFQNGI 120 
TGIRFAGGEK CYIKAQVKAR IPEVGAVTKQ SISSKLEGKI MPVKYEENSL IWVAVDQPVK 180 
DNSFLSSKVL ELCXSDIiPIFW LKPTYPKEIQ RERRBWRKI VPTTTKRPHS GPRSNPGAGR 240 
LNNETRPSVQ EDSQAFNPDN PYHQQEGESM TFDPRLDHEG ICCIECRRSY THCQKICEPL 300 
GGYYPWPYNY QGCRSACRVI MPCSWWVARI LGMV 



Seq ID NO: 201 DNA sequence 

Nucleic Acid Accession #: NM_000728.2 

Coding sequence: 112.. 495 

1 11 21 31 41 51 

GTAATAAGAG CGGGGTCTCC GCGGGGAAGG CGCCCACAGC AGGTGTGGTG TTCATCCCGG 60 

GTCGACCGGC CGCTCGCGCT GCCCTGAAAC TCTAGTCGCC AGAGAGGCGG CATGGGTTTC 120 

CGGAAGTTCT CCCCCTTCCT GGCTCTCAGT ATCTTGGTCC TGTACCAGGC GGGCAGCCTC 180 

CAGGCGGCGC CATTCAGGTC TGCCCTGGAG AGCAGCCCAG ACCCGGCCAC ACTCAGTAAA 240 

GAGGACGCGC GCCTCCTGCT GGCTGCACTG GTGCAGGACT ATGTGCAGAT GAAGGCCAGT 300 

GAGCTGAAGC AGGAGCAGGA GACACAGGGC TCCAGCTCCG CTGCCCAGAA GAGAGCCTGC 360 

TU^CACTGCCA CCTGTGTGAC TCATCGGCTG GCAGGCTTGC TGAGCAGATC AGGGGGCATG 420 

GTGAAGAGCA ACTTCGTGCC CACCAATGTG GGTTCCAAA6 CCTTTGGCAG GCGCCGCAGG 480 

GACCTTCAAG CCTGAGCAGA TGAATGACTC CAGGAAGAAG GTGTGTCCTA AATCCAATGA 540 

CATATCCTTA TAAGAGATTC ACTCAGAAGA CACATGTGGA GAAGGTGACA TGACAGAGGC 600 

AAGGAGGCAC AAGCCAAGGA AGTCTGTGTC TACCAGAAGC CAGAATCACA GAACAGTCTG 660 

TGGAAGAAGA GCAGCCCTGC TGACACCTAG AGTTTGGACT TCCAGCTTCC AGAACTGTGA 720 

GAGAATAATT TCTGTTGTTT TAAGCCACAA AGTTTGTGGT AATTTGTTAT GACAGCCCTA 780 

GGAAACTAAT ACAATACATT TTCATTTATT TTGGGTAAAT GCCTTGGAGT GGGATTGCTG 840 

GGTTATTTGG AAAGTGTGTA TTTAACTCTG TAAGAAACTG CCAAACTATT TTCTGAAGTG 900 

ACTGTACCAC TTCGCCTTCT TGCCAGCCAC ATATOAGAGC TCTAGTATTT CCACAAATAG 960 

GTATGTAGCA GTATCTCATT GCTGTTTTAA TTTGTATTTC CCCAATGACT AATGACGTTG 1020 

AGCATCTATT TTACCATATG TTTATCACCT TTATTGAAGG GTCTGTTTAA ATCTTCTGCT 1080 

j^^^^^^Q^ TGGCTTGCTT GCTTTATTAG TGTTGAGTTT TTAGAGCTCT TTATATGTTG 1140 

TGGATGCAAG ATTGTTTTCA GATATATAGT TTGGAAACTT CCTTCCCCTG AATCTGCGGA 1200 

TTGCTTTTTC ATTTTCTTAG CAGTGTCTCT CACAGAGAAA AAGTTGTAAT TTGAATAAGA 1260 

TCCAATTCAT CTTTTTTTTT CTTTTATGTA TTGTGCTTTT AGTTCATGTC TAAGAACTCT 1320 

TTGCCTAACT AAGGTCCCAA GGTCACAATA ACCTTATTCT ATACTTTCTT GTAAAAGTTT 1380 

TATAGTTTTA TATTTTATAT GTAGATTAGT GATCTATTTT GAGTTAATTT TTGTATAAGG 1440 

TGAGAGGTGT AGGTTGAAAT TCATACCTGT GAATATAGAT ACCCAATTGT TTCAGTGCCA 1500 

TTTGTTAAAA AGACTGTTAT TTCACCATTT AATTGCCCCT GCACCTTTGT CAT^AAAGCAA 1560 

CTGATCATAT TTGTGTGGGT ATATTTCTGG GTTCTCAATT CTGTCTCATT GATTGATTTG 1620 

ACCATTCTTT TGCCAATGTC ATACTGCCTT GATTAGTGTA GTGTTAAAGT GAATCTCAAA 1680 

ACCAGATAAT GTGGGTCTAC CAACATTGTT CATTCTTGTT CAAAAAGATT TTAGCTACAT 1740 

CTAAAATATT TTCTACATCT TTTATACATT TTAGAATCAG TGTGTTACTA TCTACAAAAT 1800 

TTCTGATGAG ATTTTTAATG GGATTGTGTT AAATCAGTGG GTTAATTTTG GGAGAATTAG 1860 

CATATTAATA ATATTAAGTC GTTCAATTCA TGAACACAAT ACATGTTTTC ACTTATTTAG 1920 

GTTTTCTCTG tTTTTTTTTT TTTAACAGTG TTCTCAGTTT TCAACAGAAA TATTCTACAC 1980 

ATATCTTGTT AGATTTTTAA CTATTTTATT TTTTGGTGCT AATGTAAATG GTACTTAAAC 2040 

ATTTTTGTTT TTAATTGTTC ATTGCTAGTA GATAGAAATA CAATATTTAA AATATTAGGA 2100 
AAAi\AAAAAA AAAAAAAAAA AAAAAAAAAA 



Seq ID NO: 202 Protein sequence: 
Protein Accession NP_000719.1 

1 11 21 31 41 51 

1 1 1 1 I ' 

MGFRKFSPFL ALSILVLYQA GSLQAAPFRS ALESSPDPAT LSKEDARLLL AALVQDYVQM 60 
KASELKQEQE TQGSSSAAQK RACNTATCVT HRLAGLLSRS GGMVKSNFVP TNVGSKAFGR 120 

RRRDLQA 



Seq ID NO: 203 DNA sequence 
Nucleic Acid Accession #: NM_001741 
Coding sequence: 71.. 496 

1 11 21 31 41 51 

I i I ) I I 

CTCTGGCTGG ACGCCGCCGC CGCCGCTGCC ACCGCCTCTG ATCCAAGCCA CCTCCCGCCA 60 
GAGAGGTGTC ATGGGCTTCC AAAAGTTCTC CCCCTTCCTG GCTCTCAGCA TCTTGGTCCT 120 
GTTGCAGGCA GGCAGCCTCC ATGCAGCACC ATTCAGGTCT GCCCTG6AGA GCAGCCCAGC 180 
AGACCCGGCC ACGCTCAGTG AGGACGAAGC GCGCCTCCTQ CTGGCTGCAC TGGTGCAGGA 240 
CTATGTGCAG ATGAAQGCCA GTGAGCTGGA GCAGGAGCAA GAGAGAGAG6 GCTCCAGCCT 300 



268 



10 



wo 02/086443 

GGACAGCCCC AGATCTAAGC GGTGCGGTAA TCTGA6TACT TGCATGCTGG GCACATACAC 3 60 

GCAGGACTTC AACAAGTTTC ACACGTTCCC CCAAACTGCA ATTGGGGTTG GAGCACCTGG 420 

AAAGAAAAGG GATATGTCCA GCGACTTGGA GAGAGACCAT CGCCCTCATG TTAGCATGCC 480 

CCAGAATGCC AACTAAACTC CTCCCTTTCC TTCCTAATTT CCCTTCTTGC ATCCTTCCTA 540 

TAACTTGATG CATGTGGTTT GGTTCCTCTC TGGTGGCTCT TTGGGCTGGT ATTGGTGGCT 600 

TTCCTTGTGG CAGAGGATGT CTCAAACTTC AGATGGGAGG AAAGAGAGCA GGACTCACAG 660 

GTTGGAAGAG AATCACCTGG GAAAATACCA GAAAATGAGG GCCGCTTTGA GTCCCCCAGA 720 

GATGTCATCA GAGCTCCTCT GTCCTGCTTC TGAATGTGCT GATCATTTGA GGAATAAAAT 780 
TATTTTTCCC C 



PCT/US02/12476 



15 



20 



25 



30 



35 



40 



Seq ID NO: 204 Protein sequence: 
Protein Accession #: NP_001732 

1 11 21 31 41 51 

i I I.I I I 

MGFQKFSPFL ALSILVLLQA GSLHAAPFRS ALESSPADPA TLSEDEARLL LAALVQDYVQ 
MKASELEQEQ EREGSSLDSP RSKRCGNLST CMLGTYTQDF NKFHTFPQTA IGVGAPGKKR 
DMSSDLERDH RPHVSMPQNA N 

Seq ID NO: 205 DNA sequence 
NUcleic Acid Accession #: NM_005361 
Coding sequence: 1-945 



1 
I 

ATGCCTCTTG 

GAGGCCCTGG 
TCCTCTTCTA 
CCTCCCCACA 
AGACAATCCG 
CTGGAGTCCG 
CTCCTCAAGT 
AGAAATTGCC 
GTCTTTGGCA 
TGCCTGGGCC 
CTCCTGATAA 
ATCTGGGAGG 
CATCCCAGGA 
GTGCCCGGCA 
ACCAGCTATG 
TACCCACCCC 



11 
I 

AGCAGAGGAG 
GCCTGGTGGG 
CTCTAGTGGA 
GTCCTCAGGG 
ATGAGGGCTC 
AGTTCCAAGC 
ATCGA6CCAG 
AGGACTTCTT 
TCGAGGTGGT 
TCTCCTACGA 
TCGTCCTGGC 
AGCTGAGTAT 
AGCTGCTCAT 
GTGATCCTGC 
TGAAAGTCCT 
TGCATGAACG 



21 
I 

TCAGCACTGC 
TGCGCAGGCT 
AGTTACCCTG 
AGCCTCCAGC 
CAGCAACCAA 
AGCAATCAGT 
GGAGCCGGTC 
TCCCGTGATC 
GGAAGTGGTC 
TGGCCTGCTG 
CATAATCGCA 
GTTGGAGGTG 
GCAAGATCTG 
ATGCTACGAG 
GCT^CCATACA 
GGCTTTGAGA 



31 
I 

AAGCCTGAAG 
CCTGCTACTG 
GGGGAGGTGC 
TTCTCGACTA 
GAAGAGGAGG 
AGGAAGATGG 
ACAAAGGCAG 
TTCAGCAAAG 
CCCATCApCC 
GGCGACAATC 
ATAGAGGGCG 
TTTGAGGGGA 
GTGCAGGAAA 
TTCCTGTGGG 
CTAAAGATCG 
GAGGGAGAAG 



41 
I 

AAGGCCTTGA 
AGGAGCAGCA 
CTGCTGCCGA 
CCATCAACTA 
GGCCAA6AAT 
TTGAGTTGGT 
AT^TGCTGGA 
CCTCCGAGTA 
ACTTGTACAT 
AGGTCATGCC 
ACTGTGCCCC 
GGGAGGACAG 
ACTACCTGGA 
GTCCAAGGGC 
GTGGAGAACC 
AGTGA 



51 

I 

GGCCCGAGGA 
GACCGCTTCT 
CTCACCGAGT 
CACTCTTTGG 
GTTTCCCGAC 
TCATTTTCTG 
GAGTGTCCTC 
CTTGCAGCTG 
CCTTGTCACC 
CAAGACAGGC 
TGAGGAGAAA 
TGTCTTCGCA 
GTACCGGCAG 
CCTCATTGAA 
TCACATTTCC 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



45 



Seq ID NO: 206 Protein sequence: 
Protein Accession #: NP 005352 



11 



50 



55 



60 



I 

MPLEQRSQHC 
PPHSPQGASS 
LLKYRAREPV 
CLGLSYDGIiL 
HPRKLLMQDL 
YPPLHERALR 



21 

1 



KPEEGIjEARG ealglvgaqa 
FSTTINYTIjW RQSDEGSSNQ 
TKAEMLESVL RNCQDFFPVI 
GDNQVMPKTG LLIIVIAIIA 
VQENYLEYRQ VPGSDPACYE 



31 

1 

PATEEQQTAS 
EBEGPRMFPD 
FSKASEYIiQL 
lEGDCAPEEK 
FLWGPRALIE 



41 
1 

SSSTLVEVTL 
IiESEFQAAIS 
VFGIEWEW 
IWEELSMLEV 
TSYVKVLHHT 



51 
I 

GEVPAADSPS 
RKMVELVHFL 
PISHLYIIiVT 
FEGREDSVFA 
LKIGGEPHIS 



Seq ID NO: 207 DNA sequence 
Nucleic Acid Accession #: NM_021115 
Coding sequence; 743-2893 



60 
120 
180 
240 
300 



11 



21 



31 



41 



51 



65 



70 



75 



80 



85 



AAAGGAAG6G 
GGCACCGCCC 
CCCAAACTAA 
CCCTTTGGGT 
GCACCCTGAA 
GGGCGAGCTG 
ACCGCTGCTT 
TTCGCTCAAG 
CACTGTCCAA 
CACGGAGAAG 
AGAAGTGCCC 
GCAAATCTCC 
ACCCGGGGAG 
GGCCCTGATG 
GACCACTACC 
CTGCAGTGTG 
GCCCCTCAAC 
GGAGCTCCAG 
GGACGGCCCT 
CCGAAGCCCC 
GACCTTCCAG 
CTCTGGGGAT 
CCTGGGCTAT 
CTGGAGCAGC 
CATCGGCCGC 
CTGGACGATT 



AGGGAG66AG 
TTAGGAGGGC 
CTGGTGTCTT 
CCTTACCTCC 
GAGAGAGTGG 
GTGCTGGATG 
CCAGAGGAGG 
CAGGTGAACT 
AGGGCAGGGT 
GCTGGCCCAC 
CTTTGGCTGG 
CCCTTCACTT 
CCTGGGCCTG 
GACAAAGGTG 
TCCACCATTA 
AGCTTCTCCA 
AACTTTCTGG 
GTGAAGAGTG 
ACCCTGACCG 
ACCAACACCA 
CTTCACTACC 
GTCACGGTGA 
GAGCTCCAGG 
CAGGAGCCCA 
GTCCTCTCCC 
GAAGCTCCAG 



AAAGGAGAAG 
CACCCTCAGA 
TTCTCCTCTT 
TGCCCTCAGG 
TAACAGCGCC 
GGACCGCACC 
CCCGCCCCAA 
CTGCCAGGAA 
CCCAGCCAGC 
CGGGGGACCC 
ACCGAAAGGA 
CGCAGCCCTA 
ACATGGCCCA 
AGAATGAGCT 
TCACCACCAC 
ATCCTGAGGG 
AGTGCACATA 
TGAACCTGTC 
TCCTGGCCAA 
TCTCCGTCTA 
AGGCCTTCAT 
TGGACCTGCA 
GCGCTAAGAT 
TCTGCTCAGC 
CAAGTTACCC 
AGGGCCAGAA 



TTGGTTTAGA 
GTCTGACAGC 
CCAAGATGCT 
AGCCCCGGAG 
CCCCAGTTCC 
CTCTGCACAT 
GCACGCCTTG 
GCAGCTGAGG 
GTCCCAGGGC 
GGACCCCATC 
GAGTGCGGTC 
TGTGGCCCAC 
GGAGGCCCCC 
GACTGGGTCA 
GGTCATCACC 
GTACATTGAC 
CAACGTGACA 
CGATGGGGAA 
CCAGACACTC 
CTTCCGGACC 
GCTGAGCTGC 
CTCAGGTGGG 
GCTGACATGC 
TCCTTGTGGA 
TGAAAACACA 
GCTGCACCTG 



GGCCAGCCGG 
AGGTGAAGGT 
CTTCCCGAGG 
AGAGGCAGTC 
TCACAGTCGG 
CACGACATCC 
CCCCCCAAGA 
CCCAAGGCCA 
CTAGATCTCC 
GTGGCCTCCG 
CCTACAACAC 
ACACTCCCCC 
CAGGAGGACA 
GCCTCAGAGG 
ACCGAGCAGG 
TCCAGCGACT 
GTCTACACTG 
CTGCTCTCCA 
CTGGTGGAGG 
TTCCAGGACG 
AACTTTCCCC 
GTGGCCCACT 
ATCAATGCCT 
GGGGCAGTGC 
AATGGGAGCC 
CACTTTGAGA 



ACGAGCTTTG 
CCTAAATCTC 
GAGATGCTAG 
CTGGCAAAGA 
CGGAAGTGCT 
CAGCCCTGTC 
AGAAACTGCC 
CCTCCGCAGC 
TCTCCTCCTC 
AGGAGGCATC 
CCGCACCCCT 
AGAGGCCAGA 
CCAGCCCCAT 
AGAGCCAGGA 
CACCAGCTCT 
ACCCACTGCT 
GCTATGGGGT 
TCCGCGGGGT 
GGCAGGTAAT 
ACGGCCTTGG 
GCCGGCCTGA 
TTCACTGCCA 
CCAAGCCGCA 
ACAATGCCAC 
AATTCTGCAT 
GGCTGTTGCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



269 



wo 02/086443 

GCATGACAAG GACAGGATGA CGGTTCACAG CGGGCAGACC AACAAGTCA.G CTCTTCTCTA 1620 

CGACTCCCTT CAAACCGAGA GTGTCCCTTT TGAGGGCCTG CTGAGCGAAG GCAACACCAT 1680 

CCGCATCGAG TTCACGTCCG ACCAGGCCCG GGCGGCCTCC ACCTTCAACA TCCGATTTGA 1740 

AGCGTTTGAG AAAGGCCACT GCTATGAGCC CTACATCCAG AATGGGAACT TCACTACATC 1800 

5 CGACCCGACC TATAACATTG GGACTATAGT GGAGTTCACC TGCGACCCCG GCCACTCCCT 1860 

GGAGCAGGGC CCGGCCATCA TCGAATGCAT CAATGTGCGG GACCCATACT GGAATGACAC 1920 

AGAGCCCCTG TGCAGAGCCA TGTGTGGTGG GGAGCTCTCT GCTGTGGCTG GGGTGGTATT 1980 

GTCCCCAAAC TGGCCCGAGC CCTACGTGGA AGGTGAAGAT TGTATCTGGA AGATCCAC6T 2040 

GGGAGAAGAG AAACGGATCT TCTTAGATAT CCAGTTCCTG AATCTGAGCA ACAGTGACAT 2100 

10 CTTGACCATC TACGATGGCG ACGAGGTCAT GCCCCACATC TTGGGGCAGT ACCTTGGGAA 2160 

CAGTGGCCCC CAGAAACTGT ACTCCTCCAC GCCAGACTTA ACCATCCAGT TCCATTCGGA 2220 

CCCTGCTGGC CTCATCTTTG GAAAGGGCCA GGGATTTATC ATGAACTACA TAGAGGTATC 2280 

AAGGAATGAC TCCTGCTCGG ATTTACCCGA GATCCAGAAT GGCTGGAAAA CCACTTCTCA 2340 

CACGGAGTTG GTGCGGGGAG CCAGAATCAC CTACCAGTGT GACCCCGGCT ATGACATCGT 2400 

15 GGGGAGTGAC ACCCTCACCT GCCAGTGGGA CCTCAGCTGG AGCAGCGACC CCCCATTTTG 2460 

TGAGAAAATT ATGTACTGCA CCGACCCCGG AGAGGTGGAT CACTCGACCC GCTTAATTTC 2520 

GGATCCTGTG CTGCTGGTGG GGACCACCAT CCAATACACC TGCAACCCCG GTTTTGTGCT 2580 

TGi\AGGGAGT TCTCTTCTGA CCTGCTACAG CCGTGAAACA GGGACTCCCA TCTGGACGTC 2640 

TCGCCTGCCC CACTGCGTTT CAGAAGCGGC AGCAGAGACG TCGCTGGJ^AG GGGGGAACAT 2700 

20 GGCCCTGGCT ATCTTCATCC CGGTCCTCAT CATCTCCTTA CTGCTGGGAG GAGCCTACAT 2 760 

TTACATCACA AGATGTCGCT ACTATTCCAA CCTCCGCCTG CCTCTGATGT ACTCCCACCC 2 82 0 

CTACAGCCAG ATCACCGTGG AAACCGAGTT TGACAACCCC ATTTACGAGA CAGGGGGAAC 2 880 

CCAAAAGGTT TAGGGTTTCA TTTAAAAAGA GGTACCCTTT AAAAAGGGGC TTGTGAACTC 2940 

AACCCCAATT TCCCCGAGAC ATTTATCCAA AGGCCCTGGG GGCCTTGATT TAAACCCCCA 3000 

25 AAAGGCGGCT GTTTTTTGGT TAAACTTTTT AACAAAGGGT TACGGGTTTT TTCCCCGGAT 3 060 
TTTATAAATT TTAAAAGTG 



PCT/US02/12476 



30 



Seq ID NO: 208 Protein sequence: 
Protein Accession #: NP_066938 



35 



40 



45 



MAQEAPQEDT 
PEGYIDSSDY 
LANQTLLVEG 
DLHSGGVAHF 
SYPENTNGSQ 
VPFEGIiLSBG 
TIVEFTCDPG 
YVEGEDCIWK 
SSTPDLTIQF 
RITYQCDPGY 
TTIQYTCNPG 
VIiIISIiliLGG 



11 
I 

SPMALMDKGE 
PLLPLNNFLE 
QVIRSPTNTI 
HCHLGYELQG 
FCIWTIEAPE 
NTIRIEFTSD 
HSIiEQGPAII 
IHVGEEKRIF 
HSDPAGLIFG 
DIVGSDTLTC 
FVLEGSSLLT 
AYIYITRCRY 



21 
I 

NELTGSASEE 
CTYNVTVYTG 
SVYFRTFQDD 
AKMLTCINAS 
GQKLHLHFER 
QARAASTFNI 
ECINVRDPYW 
LDIQFLNLSN 
KGQGFIMNYI 
QWDLSWSSDP 
CYSRETGTPI 
YSNLRLPLMY 



31 

I 

SQETTTSTII 
YGVELQVKSV 
GLGTFQLHYQ 
KPHWSSQEPI 
IiLLHDKDRMT 
RFEAFEKGHC 
NDTEPLCRAM 
SDILTIYDGD 
EVSRNDSCSD 
PFCEKIMYCT 
WTSRLPHCVS 
SHPYSQITVE 



41 

1 

TTTVITTEQA 
NliSDGELLSI 
AFML5CNFPR 
CSAPCGGAVH 
VHSGQTNKSA 
YEPYIQNGNF 
CGGELSAVAG 
EVMPHILGQY 
LPEIQNGWKT 
DPGEVDHSTR 
EAAAETSLEG 
TEFDNPIYET 



51 

I 

PALCSVSFSN 
RGVDGPTLTV 
RPDSGDVTVM 
NATIGRVLSP 
LLYDSLQTES 
TTSDPTYNIG 
WLSPNWPEP 
LGNSGPQKLY 
TSHTELVRGA 
LISDPVLLVG 
GNMALAIFIP 
GGTQKV 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



50 



55 



60 



65 



Seq ID NOt 209 DNA sequence 

Nucleic Acid Accession #: NM_001327.1 

Coding sequence: 89-631 



AGCAGGGGGC 
CTGAGAGCCG 
GACGGGCGAT 
TGGCGGCCCA 
AAGGGCCTCG 
GCTGAATGGA 
CCTCGCCATG 
GGATGCCCCA 
CATACTGACT 
CTGTCTCCAG 
GGCTCAGCCT 
GCCTCCTCCC 
GTTTGTCGCT 



11 

1 

GCTGTGTGTA 
GGCAGAGGCT 
GCTGATGGCC 
GGAGAGGCGG 
GGGCCGGGAG 
TGCTGCAGAT 
CCTTTCGCGA 
CCGCTTCCCG 
ATCCGACTGA 
CAGCTTTCCC 
CCCTCAGGGC 
CTAGGGAATG 
GGAGGAGGAC 



21 

I 

CCGAGAATAC 
CCGGAGCCAT 

CAGGAGGCCC 
GTGCCACGGG 
GAGGCGCCCC 
GCGGGGCCAG 
CACCCATGGA 
TGCCAGGGGT 
CTGCTGCAGA 
TGTTGATGTG 
AGAGGCGCTA 
GTCCCAGCAC 
GGCTTACATG 



31 
1 

GAGAATACCT 
GCAGGCCGAA 

TGGCATTCCT 
CGGCAGAGGT 
GCGGGGTCCG 
GGGGCCGGAG 
AGCAGAGCTG 
GCTTCTGAAG 
CCACCGCCAA 
GATCACGCAG 
AGCCCAGCCT 
GAGTGGCCAG 
TTTGTTTCTG 



41 
I 

CGTGGGCCCT 
GGCCGGGGCA 

GATGGCCCAG 
CCCCGGGGCG 
CATGGCGGCG 
AGCCGCCTGC 
GCCCGCAGGA 
GAGTTCACTG 
CTGCAGCTCT 
TGCTTTCTGC 
GGCGCCCCTT 
TTCATTGTGG 
TAGAAAATAA 



51 
I 

GACCTTCTCT 
CAGGGGGTTC 

GGGGCAATGC 
CAGGGGCAGC 
CGGCTTCAGG 
TTGAGTTCTA 
GCCTGGCCCA 
TGTCCGGCAA 
CCATCAGCTC 
CCGTGTTTTT 
CCTAGGTCAT 
GGGCCTGATT 
AACTGAGCTA 



60 
120 

180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



70 
75 



Seq ID NO: 210 Protein sequence: 
Protein Accession #: NP_001318.1 

1 11 21 31 41 51 

I I I I I I 

MQAEGR6TGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPGGGA 
PRGPHGGAAS 6LATGCCRCGA RGPESRIiIiEF YLAMPFATPM EAELARRSLA QDAPPIiPVPG 
VLLKEFTVSG NILTIRLTAA DHRQLQLSIS SCLQQLSLLM WITQCFLPVF LAQPPSGQRR 



60 
120 



80 



85 



Seq ID NO: 211 DNA sequence 
Nucleic Acid Accession #: Eos 
Coding sequence: 52-459 



sequence 



CCTCGTGGGC 
GAAGGCCAGG 
CCTGATGGCC 
GGTCCCCGGG 
CCGCATGGCG 



11 

i 

CCTGACCTTC 
GCACAGGGGG 
CAGGGGGCAA 
GCGCAGGGGC 
GTGCCGCTTC 



21 

1 

TCTCTGAGAG 
TTCGACGGGC 
TGCTGGCGGC 
AGCAAGGGCC 
TGCGCAGGAT 



31 
1 

CCGGGCAGAG 
GATGCTQATG 
CCAGGAGAGG 
TCGG6GCCGA 
GGAAGGTGCC 



41 
I 

GCTCCGGAGC 
GCCCA6GAGG 
CGGGTGCCAC 
6AGGAGGC6C 
CCTGCGGGGC 



51 
I 

CATGCAGGCC 
CCCTGGCATT 
GGGCGGCAGA 
CCCGCGGGGT 
CAGGAGGCCG 



60 
120 
180 
240 
300 



270 



wo 02/086443 

GACAGCCGCC TGCTTCAGTT CCGACTGACT GCTGCAGACC ACCGCCAACT GCAGCTCTCC 360 

ATCAGCTCCT GTCTCCAGCA GCTTTCCCTG TTGATGTGGA TCACGCAGTG CTTTCTGCCC 420 

GTGTTTTTGG CTCAGGCTCC CTCAGGGCAG AGGCGCTAAG CCCAGCCTGG CGCCCCTTCC 480 

TAGGTCATGC CTCCTCCCCT AGGGAATGGT CCCAGCACGA GTGGCCA6TT CATTGTGGGG 540 

GCCTGATTGT TTGTCGCTGG AGGAGGACGG CTTACATGTT TGTTTCTGTA GAAAATAAAG 600 
CTGAGCTA 

Seg ID NO: 2X2 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I 1 1 I I I 

MQAEGQGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 60 

PRGPHGGAAS AQDGRCPCGA RRPDSRLLQF RLTAADHRQL QLSISSCLQQ LSLLMWITQC 120 
FLPVFLAQAP SGQRR 

Seq ID NO: 213 DNA sequence 
Nucleic Acid Accession #*. NM_0 00555 
Coding sequence: 416. .1498 

1 11 21 31 41 51 

I I I I I I 

CTTATTTTTT ATGAATGTCG GATAGCTGCA CCAGCTTGGT GGGGAAAGGG TTTGATGAAT 60 

AGCACAAAGA CACTGGCTGT TCCCTGGAGG CTGTCCCTTT AAAGGAGAAT CTTAGTTTAT 120 

TCTGGGGGGA GGGGATGCAC ACATTAGAGT AGGAAAGAGG GCTTGGAATA AAATGATVAAC 180 

ACTCCCCCTT CATAGTCATT GTACTGAAAT GCAAAGACTG CTTCCTAAGC TGGAGATGCT 240 

AACCTTGGGT AGCTCCTTCT GTTCTCTTCA AGGGGAATTT TGTCAGGCTA TGGATTCATT 300 

TACAACTGTT AGTCATGTGG GCATGTGTGA GGAAACAGAT GCCAGTTTTA ATGTATTTAG 360 

CCCGAAGTTC CAATTTGATA GGAGCCACTG TCAGTCTCTG AGGTTCCACC AAAATATGGA 42 0 

ACTTGATTTT GGACACTTTG ACGAAAGAGA TAAGACATCC AGGAACATGC GAGGCTCCCG 480 

GATGAATGGG TTGCCTAGCC CCACTCACAG CGCCCACTGT AGCTTCTACC GAACCAGAAC 540 

CTTGCAGGCA CTGAGTAATG AGAAGAAAGC CJ\AGAAGGTA CGTTTCTACC GCAATGGGGA 600 

CCGCTACTTC AAGGGGATTG TGTACGCTGT GTCCTCTGAC CGTTTTCGCA GCTTTGACGC 660 

CTTGGTGGCT GACCTGACGC GATCTCTGTC TGACAACATC AACCTGCCTC AGGGAGTGCG 72 0 

TTACATTTAC ACCATTGATG GATCCAGGAA GATCGGAAGC ATGGATGAAC TGGAGGAAGG 780 

GGAAAGCTAT GTCTGTTCCT CAGACAACTT CTTTAAAAAG GTGGAGTACA CCAAGAATGT 840 

CAATCCCAAC TGGTCTGTCA ACGTAAAAAC ATCTGCCAAT ATGAAAGCCC CCCAGTCCTT 900 

GGCTAGCAGC AACAGTGCAC AGGCCAGGGA GAACAAGGAC TTTGTGCGCC CCAAGCTGGT 960 

TACCATCATC CGCAGTGGGG TGAAGCCTCG GAAGGCTGTG CGTGTGCTTC TGAACAAGAA 102 0 

GACAGCCCAC TCTTTTGAGC AAGTCCTCAC TGATATCACA GAAGCCATCA AACTGGAGAC 1080 

CGGGGTTGTC AAAAAACTCT ACACTCTGGA TGGAAAACAG GTAACTTGTC TCCATGATTT 1140 

CTTTGGTGAT GATGATGTGT TTATTGCCTG TGGTCCTGAA AAATTTCGCT ATGCTCAGGA 1200 

TGATTTTTCT CTGGATGA7VA ATGAATGCCG AGTCATGAAG GGAAACCCAT CAGCCACAGC 1260 

TGGCCCAAAG GCATCCCCAA CACCTCAGAA GACTTCAGCC AAGAGCCCTG GTCCTATGCG 1320 

CCGAAGCAAG TCTCCAGCTG ACTCAGCAAA CGGAACCTCC AGCAGCCAGC TCTCTACCCC 13 80 

CAAGTCTAAG CAGTCTCCCA TCTCTACGCC CACCAGTCCT GGCAGCCTCC GGT^GCACAA 1440 

GGACCTGTAC CTGCCTCTGT CCTTGGATGA CTCGGACTCG CTTGGTGATT CCATGTAAAG 1500 

GAGGGGAGAG TGCTCAGAGT CCAGAGTACA AATCCAAGCC TATCATTGTA GTAGGGTACT 1560 

TCTGCTCAAG TGTCCAACAG GGCTATTGGT GCTTTCAAGT TTTTATTTTG TTGTTGTTGT 1620 

TATTTTGAAA AACACATTGT AATATGTTGG GTTTATTTTC CTGTGATTTC TCCTCTGGGC 1680 

CACTGATCCA CAGTTACCAA TTATGAGAGA TAGATTGATA ACCATCCTTT GGGGCAGCAT 1740 

TCCAGGGATG CAAAATGTGC TAGTCCATGA CCTTTCAATG GAAAGCTTAG GGGCCTGGGG 1800 

TAAATTTGCC CCGTTTAAAT TTGCCCAAAC AGTTTTCCTT TTGTAGAGGG GTGTTTAAAT 1860 

ATACAGCAAT TAAAAAGTTT GTGTGGGGAA AAAAAAAACT CATTGGCAGA TCCAAGAATG 192 0 

ACAAACACAA GTGCCCCTTT TCTGTGGATC TCAAGAATGG TG6AGGACCC TGGAAGGACA 1980 

GCAAGGCAGC TCCCCAGCCT CACTCTTCAC TCCTGATTGA GGCCCGGGTT TGTTGTCCAG 2040 

CACCAATTCT GGCTGTCAAT GGGGAGAAAT AAACCAACAA CTTATAATTG TGACACCAGA 2100 

TGCTTAGGAT CCTGGTGCTG GGTTAGCTAA GAGAATAGAC AGAATTGGAA AATACTGCAG 2160 

ACATTTCCGA AGAGTTTATA AAGCACAGTG AATTCCTGGT CAATCTCTCC ACTGAGGCAA 2220 

TTTGGAATCA ATAAGCAATT GATAATAGTT TGGAGTAAGG 6ACTTCATAT ACCTGATTCX: 2280 

TCTAGAAGGC TGTCTAACAT ACCACATGAT TACATGAACT GTATGGTATC CATCTATCTC 2340 

T6TTCTATTG AATGCCTTGT TAACAGCCAA CACTGAAAAC ACTGTGAGAA TTTGTTTTCA 2400 

GGTCTGACAC CTTTCAGTCT CTTTTTATAG CAAGAAATCA ATATCCTTTT TATAAAAATT 2460 

CATGTCTGTA TTTCAGGAGC AAACTCTTCA GGCTCCTTTT TTATAAACTG GTGATTTTTC 2520 

TTTTGTCTAA AAAACACATG AAGAAAATTT ACCAGAAAAA AAAAAAAAAG CCGAAGAATA 2580 

ATGTTATTTA GAAATTATGC TGTCACTGCC AAACAGTAAC CTCCAGGAGA AAACAAGATG 2640 

AATAGCAGAG GCCAATTCAA TAGAATCAGT TTTTTGATAG CTTTTTAACA GTTATGCTTG 2700 

CATTAATAAT TTCAATGTGG ACCAGACATT CTAATTATAT TTTAAATGAA ATGTTACAGC 2760 

ATATTTTAAG CAACTCTTTT TATCTATAAT CCTAATATTT CATACTGAAG ACACAGAAAT 2820 

CTTTCACTTG TCTTTAACAT TAGAAAGGAT TTCTCTTTAC TAAGGACTGA TCATTTGAAA 2880 

TAGTTTTCAG TCTTTTGAGA TACAGGTTTA TAACACTGCT TTTTTTTTCC TGTAAACATA 2 940 

GCCCATAATG GCAAAAACAA CTAATTTTAA TTGAAGGTCT TGCTTGCCAN TCCTGTGTTG 3000 

GCTTTNACCA AATATAAAAA TTCCCTTATT CCTTGGTAAT GGTGCAAATN TTTGGAAAGG 3060 

CACAGCATCC AAACCAAGCT GCTGTTTGGC TACTGAATGG CTTGCAGTTG TTCCTCCACT 3120 

CTAAATGGAA TGAGCTTGCT GTGTGTGTGT GTGGTGGTGG TGGGAGGGGG TGGTGCATGT 3180 

GTGTGTGTGT GTGTGCATCT GCAGCTGGTT CAAAATTAAG AAATACTACA AGACACCCCT 3240 

GTAATGGATT GGTGGCAACT GGGTGGCACT GCTGATGTGC ACTGTGTAGG GGGGAACCCA 33 00 

GTGGTGGTGG GGTATCTCAA ATGCCCCTAG ACAAGCTTCA GATGTCTGTA GCTACCAAAA 33 60 

ACATTTTCX3G TTCAAGAAAA GTGAGATGAT GGTAGTACTG GTTTCTGGTG ATU^TTGAAAA 3420 

ACCCCAAATG ATGAGGATCT CTTTTTGCCC CCTCTCCTTT TTTTGTA7\AC CCATTCAAAA 3480 

CCATTAATAA GCCCATTTTA CTAANCCCCT ATTTCTTTCT AGAAGCTCAG GGTTTNCTTA 3540 

GTGCCTCCCA NAACATTTTG TAGTTAATTG GGAAAAAGTG ATACTTGGAT TAGGGGGTGT 3 600 

GGGCATAAAG AATGGTGGGA GGCCTGATTT TAAAATTCAG GCCAGAACCC CCAATGACTC 3660 

CACCCATAGT NTCACTTTAG GTCTCATTTA GTCCATCACC TTTATTTTAA GTTGAGGAAG 3720 

TGGAGGCTGG TAAAGAGCAG GACCAGAGGA AGAATCCAGA TTTCCTTATG CTTGGGCCTC 3780 

ACACTAGCTC TNTGAGTATT TCCTTGATTG CGGTATATGT ACTACTAGAA AATACCAAAT 3840 

GGATATATTT TCTTTAGGAT AACCTTTGAA CCT^CAATNT TCAATAACAA TAGTACATCT 3 900 



271 



wo 02/086443 

TCCATCTTAC TTTTAATCGA GTATAAGGAA 
AGGGGATGAG GCTTGGCATA GTCCAAAATT 
TTGTTTTAAA TTGGCCCACT TTCAAGGCAA 
CCACCCCTGT CATTCACTTC CAATTTTACC 
5 TGTTAATTTT TGCACGGTCT ACACACATCA 

ACTTCTCCCT CTTTTTTACA CACACACACA 
TTCCTACCTC CCTGATTTTT CTTCCCTACA 
ATGTATATAT TGGGGCTGGG CTGAACAACT 
ATTGAGAGAA AAGCTCCTTT TCTCTTCACT 

10 GCTTTGTGTC CTTATGGACT TTAGTATTAG 
GAAGGAACCT TAAGATCACA TCATCTACTC 
AGGAAACCGA GACACAGAGG TAAAGTAATT 
GATTGGGTTT ACAACCCACA TCTCCTGGCT 
TATTGCCTTC CATTAGGCTC CTGAGAGTTA 

15 ACATGCTGCT GCCCTGATCT CAGTGGGAAA 

CCCTGCATTC ACCTGGTTCC CATCCACATG 
ATTGAGGGCC AATAGGAGCA ATGGGGTCCC 
TGCTCCATCG TGAGGAGCCC TCTGAATAGC 
GAATGGAGGA AGATTGATTT TCTCCATCAG 

20 CTTTCCAGGC TGAGGGAAAT GTTTCTTGTT 
ATANCTTTGT TCATCCTAAC TTTCTGAGAT 
6GTACCATTC ACTGGCANGA TTTNTTTTAG 
AACCTGGTCT CACTGTGGTT GCCCTCATCC 
GAGGACAATT TCCAGGTATA AGCAAGGGGC 

25 TAAACATTGG CTCCTGTGTT TGCACCAAAA 

CATCGTCTTG TGTACACTGC TCCTGTGGCC 
CAAACACATG GTTTTCCTTG CTGCAAGGCT 
TTCAGTTNTA AGAGACCTCC TTCTGGGCTT 
CCTCCTTCTC CTCCACAGTC ACAAGTAACC 

30 TGAAGAAGGC AAGGAACCCT GAGATTCTTC 

GTGATTGGTG CTTACCTTGA ACAAAATTTT 
GGTACAATGC TCCCAATCAC CCTGCACATT 
TCCATATCCC TAGGACAAGA NAACAGGATG 
TGATGGGAAT GATCCCAAN6 ATCACCCCAC 

35 CCAGATAGAA NCACTGGGAC AGTGGTTTGA 

ATGGAAATAA AAGGCATTGA TTTTTTAAAA 
AGGGCCACTT GGATCCATTT CCAGGCCTTA 
GCTTTAAGTC CCAGACTGGT CTCCCAAGTG 
TGAGGCATGA GAATGTTGCC CCATCTATCC 

40 TCCTAAAGCC TGGTCCCCAA AAATTGTTTT 

CACCCANACT CTTAGTGTTG CGTCCTGCCT 
CCCGCTTTGG CTTAGCTAGC GTGACATTGG 
TTTTTTTTTG ACTGAGTCTC CCTCTGTCAC 
CTCGCTGCAA CCTTCACCCT TCACCTCCCA 

45 C6AGTAGCTG GGATTACAGG CGTGCGCCAC 
TTTTAGTAGA GATGGGGTTT CACCATGTTG 
ATTATCTGCC CACCTCGGCC TCCCAAAGTG 
GCTGACAAGA CTAATTTTTT ATCCCTTGGT 
GGTGATTTTT TCTTACCTTG GATGCCTGAG 

50 TTAAGGCATC TTTCTGCTCC TGATCAGAAG 
CAACAGT^GT CACCTTGTAA GTAAGGCAAA 
TTAGGTCAAT AACCTTGAGG GAATCAATGG 
TCTTTGACTT TTCTTTCTCT GTCTAGTTTC 
AGTCTCTCTT TCCACAGTAC AAACATCCAT 

55 CTTATTATCT TTATTTGTAC TTTTTCCTTC 

TTCTTAGCCT GTGATTTTGC CTTGGGACTG 
CTGGTCCTAC CCCAGTCCAA TCAGAAGTAT 
TTTCTTCTTC TCCATTTTCA TTCGTAATCC 
TATAGCTCAT GTATCTTTAG GTCTTTGCCT 

60 CCTTTTTAGT CTGACATTTT GTGGAGCAGT 

GAGAAAAAAT CCACCCATGG ATTTATATCA 
GTGCCCATJWil TTTTTAAAGC TGCAATATAA 
GTCATTTGTT TTGGCTGGAT GGGGGTGGGG 
ATTAAACTCT CTATAATAAT CTTGTTTGGG 

65 ACTGGTAGGC AATCGGAGTT GATTTAAATG 

AAAGAGACAT TTGCTTAATT GACATGTATT 
CTTGACACCA ACTGTTCATG ATACTGAATA 
TAAAGAAGCC AGATTGTAGG TGTTAATTTA 
TCACTGCTTG GCAATACCAT ATGGCATGCC 

70 ATCCAAAAGG GATTTGAACA AGTAAGAGGT 

GTAATATATT GCAGCTTGAA GCCAATGATC 
TTTATTGAAT TTTGCATTTC CCACGTGTGG 
NTGTGCCATT AAACTTGTAC AGAAAATGTT 
AAAATGGT^ CAGCCCACCC TTTCTGCCCT 

75 CAAAACAGCT GTAATTGGTG GTTGTAGTGT 
TGGAGAGTAA ATGCATGGTA TTGTACATCA 
GAATATATTC TTCTTTGTAG TCCTTCTTCC 
CCAGTTGTCT TACAGTTGTA AATATCTGAT 
TCAGCAAACA ACAAACAAAC CAAAATGTGG 

80 AGTTATTGAT CATTTCTTAA GGAACAGCAT 
TCAGTGGTAA ATTGGGGTTG TATTGGCCAT 
AATCACATGT AATCCAAAGA CAGTAGGTAG 
GATAGAGACC TCAGAAGACT CTGCTTGACC 
AAAAATGAGA GATVATAAAAC AGATATTTAA 

85 AGCCAGAAAA AAAAACAAGG GCATGAGTTC 

ACCTAACCTA CTCTGAAATT GTGATTCAAA 
TGGTTTGCTG ACCCCACTTG GACTGGTAGG 



ATGTTTCTTT ATGGCCATTT TGGAGGGAGC 3960 

TAAGNCTCCA ATAATTAATT GCATTTTAAA 4020 

TTTTTTTTGT GTGTCTGTAA CTGAGCTCCT 4080 

CAATCCAATT TTAGCACTCA AGTTCCATTG 4140 

AGTCAGCAAG CATTTGCCAC CACTCCCTAT 42 00 

CACACACACA CACAATCCAT CTCTTGCTTG 42 60 

GAAATAGAAA TAGGGACAAA GAAGGGGAAA 432 0 

AACTTCATAA GTAGTATTAA CTAGGGGTAA 4380 

GTTTTGGAAA GGATAGCCAT TAGCATGACT 4440 

CCTAGATTGA ATTATAGCGT TTTTCTAGCT 4500 

CTCTACTCCA AATTTCTCAT TCTTCAGGCC 4560 

TCCCCAAGGT CACACAGCTG GCTGGGGCAG 4620 

CTTATTCCAG GGCCTTTTCC CACTAAGTAG 4680 

TTTCTCAGGG TCATGTTGCA TCTTGGAGCC 4740 

TNCACCCAGC AACCT7VATAC AGCCCCTTTT 4800 

GGTTGCAGAT GTCCTTGAAG AGAGTGAGGC 4860 

TGGCCTTGTC CATCTGATTC AGGAGATCAC 4 920 

CCCCCACTGA ATGCTTGCCT TGCCCAAATG 4 980 

TTCACCTTGT GTCATCTCAT AATGGTTGGT 5040 

TCCANAGTAN AAAAAAGAAA GAGTGGAACA 5100 

GGCTTTTCAA CATTTAAAAA AAACTAGTGT 5160 

AATATGGGAG TAAGATGAGG TAGAGAAAAT 5220 

ACAATGTCCC CAAAGCCATC CTGCTNTGAT 5280 

TTTGTGACAA AAATGTACCC TGGCTGATGT 5340 

TAGCAAGCTG TGTGCTCTAT ACACTCTTCC 5400 

TTCCACAGCA GAAACCAGGG CAAAAGGGTC 5460 

NTTCCTGGGA ACTAAGGGGG TATTTATTAG 5520 

ACCCCACTCC TCAGGTACTT CTCTCTCCTT 5580 

AAGGAACCTG AAAGTGGATG TGTAGCTATT 5640 

TTTGAATCCT TTAGTCCAAG TCTTAGACCA 5700 

GTCTGTGTTC CTAATCCCTT CAATACTNTG 5760 

TGATTCTAAA TGGCTTTTAT TTTTTAAAAA 5820 

CCTATATCCC CAAAATGAGC TCCAGGACAC 5880 

CTCAGAAAAC GTCTGTGCCA ANAGACTTCC 5940 

ACGACTTCTT TTATGGTTGT CCAGTTTGCT 6000 

AAGATGATTG GAACCTGTCT TTGGCCACAT 6060 

CTCATATATT GCCTTCACTG AAGGGCTTTG 6120 

AACCATAAGT GTTTTGGAGC TCATCTGGGG 6180 

CTTCAGGAAA AGGTGCCTTC CCTCCCTTTC 6240 

TGTCTCCAAA AGTCTAGTAT GGTCTTTATA 6300 

TGTTTCCTTG TTAAGGATCT ATGCANACCT 63 60 

CTATCATTTG ACAAGACTAA CTTTTTTTTT 6420 

CTAGGCTGGA GTGCAGTGGC ACAATCTTGG 6480 

GGTCGAAGCG ATTCTCCTGC CTCAGTCTCC 6540 

CAAATCTGGC TATTTTTTTA TTATTATTAT 6600 

GCCAGACTGG TCTTGAACTC TTGGCCTCAA 6660 

CTGGGATTAC AGGCATGAGC ACCATGCCCA 672 0 

TTATTGGCTT CAACATCTTC TGGAATCAGA 678 0 

ACTAGGGGAG TATAGAATTC CAATTGGTAA 6840 

GGCAGGTTAG TTGGGAGAGG TCAGATG6CA 6900 

GACTTTG2WG GCATTAGCGT TTCTCATTAC 6960 

CTTTTTTGCC GCTCTACCTC TTTGTGTATC 7020 

CTCTGTTCTC AGTTTATATT CTATGTTATC 7080 

CCTTTCTCCT GTGCAATTCT GTCTCTCCCT 7140 

CTCCCTGTCT AGGCATTGGG CATGTGCCTC 7200 

ATGATAAATT ATTTCCAGAT TCAATCAGCC 7260 

GTTGGTGGGG AATCAACCTG ATCCTGGCCC 732 0 

CCCTCAGCAG ATCTTTACAA GCAGTTTCCT 73 80 

TCCAAGCACT GTACAGAATA CTTTGTGGTT 7440 

GAAGCGTGCT CAGAGACATA ATCAGCTGAA 750 0 

GCTAAATACT AATAATTGAT TTTGTTTGAT 7560 

TATAATGAGG GACCACAGGT AATTTCTCCT 7620 

GAGTAATTGC TTAAAGTTTT ACCATTACAC 7680 

GCTTGCTAAC TGTTGAGCTG TTTTAACTAA 7740 

AAAAGATAAT TTAACAAATC TATACTATAA 780 0 

TTTTCCTTCT GAGTCACCTA AACATTTACT 7860 

GACAGTCCAT ATAAGAGAAA TTAGTGGACC 7920 

TTAAACAGAA TTGCAAAGCC CTTGGAAATG 7980 

AAAATTTACA ATGACTTTTC TTTATAAGTT 8 040 

TATGCCAAAA TGTCTCCAAT GTATGGTCCT 810 0 

CCTTATGACT TGTATACAAC TAATGCATGT 8160 

TAAGTCTTTA AAATGTTTTT GATCACCTTT 822 0 

TTTATGGCCA TTTTCAAAGG GAGAAAGTTT 8280 

ATAGCTGTAG TTAGAATTGA GTACCTGTAG 8340 

TAGAGGTGTT AGCTTGCTAG TGACTAGCTT 8400 

CATTTCTTAA CTCGTTTTAA CCTCTGAAAA 8460 

CACCCCCTTG CCCTCTCCCT CTCCCTGCTC 852 0 

TTGAGGCCCA ATAACTCTTG CCAAGTAAAG 8580 

GGAAAAGGCA TTTCTCAACC ATCTCTCAGC 8640 

TGTGATCA7UV GACTCAACTT TACGTAAAAA 8700 

TGATTACATT CAGGATTGAA TAGTTTTCAG 8760 

TGATGTCCCT TATCCCTGCA GCTGTTTTAA 882 0 

GATGACCAAT AATTATTTGA AAAAAAAAGA 8880 

GAACTTTAGC CACCTATTTA GAATAGTTAT 8940 

AAATGCATTA CTATCAGTGT CCTAGGCAAT 9000 

AGCAGTATTT CAAGAGGCAT TCTCCTTTTT 9060 

TTTGGTGAGG CCCCCATAAA CCAGCTGGAG 9120 
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CAGACCCTTT TCATCTCCTG TGCCTGTAAC 
AATGAGGGCT TTCTTGGGTC AGAGGACTTC 
TAAGGTGCTG TGAACTGTGA GTGCTGAAGA 
AGCTGCTCTT GCAATTATTT TGGCTCTCAA 
TGTACATTTG CAAGATGTGT GTAATGTCAT 



Seq ID NO: 214 Protein sequence: 
Protein Accession #s NP_000546 



ACCCCTCTTC CCCCACCCCC TCCGCAATTC 
AAGGTTGTCT AGAGAAGTTT GCCATGTGTG 
TTCGCAGCAT TCAATACCAG GCAGCCAAAG 
GCTCTGTTCT TCATCGCATT CTCATTTCTG 
TTTCCAAAZUl TAAAATTTGA TTTCAAT 



1 11 21 31 41 51 

I I I I I I 

MELDFGHFDE RDKTSRNMRG SRMNGLPSPT HSAHCSFYRT RTLQALSNEK KAKICVRFYRN 
GDRYFKGIVY AVSSDRFRSF DALLADLTRS LSDNINLPQG VRYIYTIDGS RKIGSMDELE 
BGESYVCSSD NFFKKVEYTK NVNPNWSVNV KTSANMKAPQ SLASSNSAQA RENKDFVRPK 
liVTIIRSGVK PRKAVRVLLN KKTAHSFEQV LTDITEAIKL ETGWKKLYT LDGKQVTCIiH 
DFFGDDDVFI ACGPEKFRYA QDDFSIjDBNE CRVMKGNPSA TAGPKASPTP QKTSAKSPGP 
MRRSKSPADS AN6TSSSQLS TPKSKQSPXS TPTSPGSIiRK HKDXiYLPLSL DDSDSLGOSM 



Seq ID NO: 215 DNA sequence 
Nucleic Acid Accession ^: NM_130467 
Coding sequence: 3 12.. 644 

1 11 21 31 41 51 

1 1 I 1 I I 

GGCACGAGGC AGAGCTCTGC AAGGAGAGGT TGTGTCTTCG TTCTTTCCGC CATCTTCGTT 
CTTTCCAACA TCTTCGTTCT TTCTCACTGA CCGAGACTCA GCCGGTAGGT CTGCAGAGTG 
GTCTTCCTGG TAATTTAGTT GTGAGTGAAT GTGTGGAGGA GCCAGCGGGC TTAGGACAGG 
TCCTGTGGCA CAGTCCGTG6 CTTTGAGGGA AAAGGGCCTC GCGGTGGTCC TCCGCCTTCC 
CCCAGGTCGT GATGCAGGCG CCATGGGCCG GTAATCGTGG CTGGGCTGGA ACGAGGGAGG 
AAGTGAGAGA TATGAGTGAG CATGTAACAA GATCCCAATC CTCAGAAAGA GGAAATGACC 
AAGAGTCTTC CCAGCCAGTT GGACCTGTGA TTGTCCAGCA GCCCACTGAG GAAAAACGTC 
AAGAAGAGGA ACCACCAACT GATAATCAGG GTATTGCACC TA6TGGGGAG ATCAAAAATG 
AAG6AGCACC TGCTGTTCAA GGGACTGATG TGGAAGCTTT TCAACAGGAA CTGGCTCTGC 
TTAAGATAGA GGATGCACCT GGAGATGGTC CTGATGTCAG GGAGGGGACT CTGCCCACTT 
TTGATCCCAC TAAAGTGCTG GAAGCAGGTG AAGGGCAACT ATAGGTTTAA ACCAAGACAA 
ATGAAGACTG AAACCAAGAA TATTGTTCTT ATGCTGGAAA TTTGACTGCT AACATTCTCT 
TAATAAAGTT TTACAGTTTT CTGCAAAAAA AAAAAAAAAA AAA 



Seq ID NO: 216 Protein sequence: 
Protein Accession #: NP_569734 

1 11 21 31 41 51 

I I i I I I 

MSEHVTRSQS SERGNDQESS QPVGPVIVQQ PTEBKRQEEE PPTDNQGIAP SGEIKNEGAP 
AVQGTDVEAF QQELALLKIE DAPGDGPDVR EGTIjPTFDPT KVIiEAGEGQIj 

Seq ID NO: 217 DNA sequence 

Nucleic Acid Accession #; NM_0 01476.1 

Coding sequence: 82.. 435 

1 11 21 31 41 51 

1 1 I I 1 i 

GCCAGGGAGC TGTGAGGCAG TGCTGTGTGG TTCCTGCCGT CCGGACTCTT tttcctctac 

tgagattcat ctgtgtgaaa tatgagttgg cgaggaagat cgacctatta ttggcctaga 
ccaaggcgct atgtacagcc tcctgaagtg attgggccta tgcggcccga gcagttcagt 
gatgaagtgg aaccagcaac acctgaagaa ggggaaccag caactcaacg tcaggatcct 
gcagctgctc aggagggaga ggatgaggga gcatctgcag gtcaagggcc gaagcctgaa 

GCTGATAGCC AGGAACAGGG TCACCCACAG ACTGGGTGTG AGTGTGAAGA TGGTCCTGAT 

GGGCAGGAGG tggacccgcc aaatccagag gaggtgaaaa cgcctgaaga aggtgaaaag 
caatcacagt gttaaaagaa gacacgttga aatgatgcag gctgctccta tgttggaaat 

TTGTTCATTA AAATTCTCCC AATAAAGCTT TACAGCCTTC TGCAAAA 

Seq ID NO: 218 Protein sequence: 
Protein Accession #: NP_001467.1 

1 11 21 31 41 51 

I I I I I I 

MSWRGRSTYY WPRPRRYVQP PEVIGPMRPE QFSDEVEPAT PEEGEPATQR QDPAAAQEGE 
DEGASAGQGP KPEADSQEQG HPQTGCECED GPDGQEVDPP NPEEVKTPEE GEKQSQC 

Seq ID NO: 219 DNA sequence 
Nucleic Acid Accession #: NM_001476 
Coding sequence: 90-3671 

1 11 21 31 41 51 

I 1 1 I 1 1 

ACAGCGGAGC GCAGAGTGAG AACCAGCAAC CGAGGCGCCG GGCAGCGACC CCTGCAGCGG 
AGACAGAGAC TGAGCGGCCC GGCACCGCCA TGCCTGCGCT CTGGCTGGGC TGCTGCCTCT 
GCTTCTCGCT CCTCCTGCCC GCAGCCCGGG CCACCTCCAG GAGGGAAGTC TGTGATTGCA 
ATGGGAAGTC CAGGCAGTGT ATCTTTGATC GGGAACTTCA CAGACAAACT GGTAATGGAT 
TCCGCTGCCT CAACTGCAAT GACAACACTG ATGGCATTCA CTGCGAGAAG TGCAAGAATG 
GCTTTTACCG GCACAGAGAA AGGGACCGCT GTTTGCCCTG CAATTGTAAC TCCAAAGGTT 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



CTCTTAGTGC 
CCAGATGCGA 
ACCAGAGACT 
ACGCGGGCCG 
CAGGTTACTA 
GGCATTCAGC 
TTCATCAAGA 
AATGGTCACA 
TTGTGGCTCC 
TTGACTACCG 
GTGCTGGTCT 
TCACCAAGAC 
TGAGTTACTT 
CATATGGAGA 
TCTCTGGAGC 
AATTCTGCCA 
GCACCTGTAT 
GTTATTCAGG 
ACGATCCGCA 
CAGTGATGCC 
CCCGCTGTGA 
TGAGGCCTTG 
GTGACCGGCT 
ACCAGTGCAA 
GAGCTTGCAA 
GTGTTTGCAA 
CTTGCTATAA 
AGGCCCTGAT 
GCAGGATGCA 
AAGGTGCTAG 
ACCAGAGCCG 
AGTACCAGAA 
CA6AAAGTGA 
CAAATGGCTT 
CAGCCAGTAA 
CACTGGTGCG 
CTGTGGTGCA 
CAAGGGAGGC 
TCCTGGATTC 
CAAAGAGGAT 
AGTTCAAGCG 
AGAATGGAAA 
AAAGCAGAGC 
TCCTTAAAAA 
AAGCCATGAA 
AGCAAGCAGA 
CCGGGGAGGC 
AAGCCAATGT 
GTGAGATGAG 
TGGATGCAGT 
CTGGGGTTAC 
AGCCTCTCAG 
AGACCCAGAT 
AGCAGAGGGG 
AGAACTTGGA 
AGCAACAGTG 
GGCTCGGGAG 
TATGCTCAGG 
TGCACCATAC 
ATGATCAAGG 
ATGTTTGCCT 
ATAGTCAACT 
ATGAAATTCT 
ACTATTGCCT 
ACCCAGGGTG 
AGGACCTGTA 
GTTCTGGACC 
ATTTTTATTA 
GTTTCAAAGT 
ATTAGTCCTA 
ATCTTATTTT 
CACACTTCAG 
TTACCTCCAT 
GTGGGACAGT 
AGCATTTTTA 
GCAATAACCG 
CATGGGGGCA 
CATTCCAGCT 
TAACACCAGT 
TGGTGCTGCC 
CAATTGTTAG 



TCGATGTGAC 
CCGATGTCTG 
GCTAGACTCC 
CTGTGTCTGC 
TAATCTGGAT 
CAGCTGCCGC 
TGTTGATGGC 
GCGCCATCAA 
TGCCAAATTT 
TGTGGACAGA 
ACGGATCACA 
TTACACATTC 
TGAGTATCGA 
ATACAGTACT 
CCCAGCACCC 
GGATTGTGCT 
TCCTTGTAAC 
GGATGAGAAT 
CGACCCCCGC 
GGAGACGGAG 
GCTCTGTGCT 
TCAGCCCTGT 
GACAGGCAGG 
AGCAGGCTAC 
CTGTAACCCC 
GCCAGGATTT 
TCAAGTGAAG 
TTCAAAGGCT 
GCAGGCTGAG 
CAGATCCCTT 
CCTGGATGAC 
CCGAGTTCGG 
AGCTTCCTT6 
TAAAAGTCTG 
CATGGAGCAA 
CAAGGCCCTG 
AGGGCTTGTG 
CACTCAAGCG 
AGTGTCTCGG 
CAAACAAAAA 
TACACAAAAG 
AAGTGGGAGA 
AC7UVGAAGCA 
CCTCAGAGAG 
GAGACTCTCC 
AAGAGCCCTG 
CCTGGAAATC 
GACAGCAGAT 
GGAAGTGGAA 
ACAGATGGTG 
AATCC/iAGAC 
TGTAGATGAA 
CAACAGCCAA 
CCACCTCCAT 
GAACATTAGG 
AAGCTGCCAT 
CCATGTCATG 
TCAACTGACC 
TCCTTGCTTC 
ATCTGGACCC 
CATAATAGTC 
TATTCTTTGA 
TCCTAATGTC 
CATATTGTCC 
TGAACATGTT 
AGGCAGGCCC 
TGGGCATGAC 
AAGCATTTCC 
GATAG7U\AAG 
ATTCAATCCT 
CTCAATCTCC 
CTGGGTCACA 
CCATCCTTCC 
GGTGACATAG 
AAAAATTVAAT 
CTTGGTTTGC 
CTTGAGTTTT 
GTCACTCTGT 
GGGAATTGCT 
TTGCTTCTGT 
ATGCC 



AACTCTGGAC 
CCAGGCTTCC 
AAGTGTGACT 
AAGCCAGCTG 
GGGGGGAACC 
AGCTCTGCAG 
TGGAAGGCTG 
GATGTGTTTA 
CTTGGGAATC 
GGAGGCAGAC 
GCTCCCTTGA 
AGGTTAAATG 
AGGTTACTGC 
GGGTACATTG 
TGGGTTGAAC 
TCTGGCTACA 
TGTCAAGGGG 
CCTGACATTG 
AGCTGCAAGC 
GAGGTGGTGT 
GATGGCTACT 
CAATGCAACA 
TGTTTGAAGT 
TTCGGGGACC 
ATGGGCTCAG 
GGTGGCCCCA 
ATTCAGATGG 
CAGGGTGGTG 
CAGGCCCTTC 
GGTCTCCAGT 
CTCAAGATGA 
GATACTCACA 
GGAAACACTA 
GCTCAGGAGG 
CTGACAAGGG 
CATGAAGGAG 
GAAAAATTGG 
GAAATTGAAG 
CTTCAGGGAG 
GCGGATTCAC 
AATCTGGGAA 
GAGAT^TCAG 
CTGAGTATGG 
TTTGACCTGC 
TACATCAGCC 
GGGAGCGCTG 
TCCAGTGAGA 
GGAGCCTTGG 
GGAGAGCTGG 
ATTACAGAAG 
ACACTCAACA 
GAGGGGCTGG 
CTGCGGCCCA 
TTGCTGGAGA 
GACAACCTGC 
AAATATTTCT 
TGAGTGGGTG 
TGACCCCATT 
CTGATGCTGG 
CAAAGAATAG 
GTAAGTGGAG 
GTAATGTGAC 
AGAACAGAGT 
TCTGCAAGCT 
CTCCATTTTC 
ATTCAGAGCT 
ATCCTTTCTT 
TACCAGCAAA 
TGTGGCTTGG 
ACTTTTCX3AA 
TCTCTCTTTC 
TCCATCCCTC 
AACATATATT 
TCTCTGCCCT 
TTAAACTTAC 
AACCTCTTTG 
GGCAAGGCTG 
GCCTTTCTAC 
GGAGGAACCA 
ATTTCCTTGG 



GGTGCAGCTG 
ACATGCTCAC 
GTGACCCAGC 
TTACTGGAGA 
CTGAGGGCTG 
AATACAGTGT 
TCCAACGAAA 
GCTCAGCCCA 
AACAGGTGAG 
ACCCATCTGC 
TGCCACTTGG 
AGCATCCAAG 
GGAATCTCAC 
ACAATGTGAC 
AGTGTATATG 
AGAGAGATTC 
GAGGGGCCTG 
AGTGTGCTGA 
CATGTCCCTG 
GCAATAACTG 
TTGGGGACCC 
ACAATGTGGA 
GTATCCACAA 
CATTGGCTCC 
AGCCTGTAGG 
ACTGTGAGCA 
ATCAGTTTAT 
ATGGAGTAGT 
AGGACATTCT 
TGGCCAAGGT 
CTGTGGAAAG 
GGCTCATCAC 
ACATTCCTGC 
CCACAAGATT 
AAACTGAGGA 
TCGGAAGCGG 
AGAAAACCAA 
CAGATAGGTC 
TCAGTGATCA 
TCTCAACGCT 
ACTGGAAAGA 
ATCAGCTGCT 
GCAATGCCAC 
AGGTGGACAA 
AGAAGGTTTC 
CTGCTGATGC 
TTGAACAGGA 
CCATGGAAAA 
AAAGGAAGGA 
CCCAGAAGGT 
CATTAGACGG 
TCTTACTGGA 
TGATGTCAGA 
CAAGCATAGA 
CCCCAGGCTG 
CAACTGAGGT 
GGATGGGGAC 
CCTGATCCCA 
GCAATGAGGC 
ACTGGATGGA 
TCCTGGAATT 
TAAAGGAAAA 
GCAACCCAGT 
TCTTGCTGAT 
AAGCTGGAAG 
ATGGTGCTTG 
TTAATGATGC 
GCAAATGTTG 
GCATTGAAAG 
CACCAAAAAT 
CTCCACCCAT 
CATTCATCCT 
TATTGAGTAC 
CATAGAGTTG 
AAACTTTGTT 
CTCAACAGAA 
ACAGAGCTCT 
AAGTGATTGC 
GAGGCACTTC 
ATTTTCCTGA 



Seq ID NO: 220 Protein sequence: 
Protein Accession #iNP_005553 



21 
I 



TAAACCAGGT 
GGATGCGGGG 
TGGCATCGCA 
ACGCTGTGAT 
TACCCAGTGT 
CCATAAGATC 
TGGGTCTCCT 
ACGACTAGAC 
CTATGGGCAA 
CCATGATGTG 
CAAGACACTG 
CAATAATTGG 
AGCCCTCCGC 
CCTGATTTCA 
TCCTGTTGGG 
AGCGAGACTG 
TGATCCAGAC 
CTGCCCAATT 
TCATAACGGG 
CCCTCCCGGG 
CTTTGGTGAA 
CCCCAGTGCC 
CACAGCCGGC 
CAACCCAGCA 
ATGTCGAAGT 
TGGAGCATTC 
GCAGCAGCTT 
ACCTGATACA 
GAGAGATGCC 
GAGGAGCCAA 
AGTTCGGGCT 
TCAGATGCAG 
CTCAGACCAC 
AGCAGAAAGC 
CTATTCCAAA 
AAGCGGTAGC 
GTCCCTGGCC 
TTATCAGCAC 
GTCCTTTCAG 
GGTAACCAGG 
AGAAGCACAG 
TTCCCGTGCC 
TTTTTATGAA 
CAGAAAAGCA 
AGATGCCAGT 
ACAGAGGGCA 
GATTGGGAGT 
GGGACTGGCC 
GCTGGAGTTT 
TGATACCAGA 
CCTCCTGCAT 
GCAGAAGCTT 
GCTGGAAGAG 
TGGGATTCTG 
CTACAATACC 
TCTTGGGATA 
ATTTGAACAT 
TGGCCAGGTG 
AGATAGCACT 
AAGACAAACT 
TGGACAAGTG 
AACTTTGACT 
CACACTGTGG 
CAGAGTTCCT 
AAGTGAGCAG 
CTGGTGCCTG 
CATGGCAACT 
GGAAAGTATT 
AGGTAAAATT 
GATGCGCATC 
AATT^GAGAA 
TCCATCCATC 
CTACTGTGTG 
ATTGTCTAGT 
TGTCACAAGT 
CATATGTTGC 
GGGTTGTGCA 
AACAGACTGT 
CACCTTGGCT 
AAGTGTTTTT 



41 



GTGACAGGAG 
TGCACCCT^G 
GGGCCCTGTG 
AGGTGTCGAT 
TTCTGCTATG 
ACCTCTACCT 
GCAAAGCTCC 
CCTGTCTATT 
AGCCTGTCCT 
ATTCTGGAAG 
CCTTGTGGGC 
AGCCCCCAGC 
ATCCGAGCTA 
GCCCX3CCCTG 
TACAAGGGGC 
GGGCCTTTTG 
ACAGGAGATT 
GGTTTCTACA 
TTCAGCTGCT 
GTCACCGGTG 
CATGGCCCAG 
TCTGGGAATT 
ATCTACTGCG 
GACAAGTGTC 
GATGGCACCT 
AGCTGTCCAG 
CAGAGAATGG 
GAGCTGGAAG 
CAGATTTCAG 
GAGAACAGCT 
CTGGGAAGTC 
CTGAGCCTGG 
TACGTGGGGC 
CACGTTGAGT 
CAAGCCCTCT 
CCGGACGGTG 
CAGCAGTTGA 
AGTCTCCGCC 
GTGG7UVGAAG 
CATATGGATG 
CAGCTCTTAC 
AATCTTGCTA 
GTTGAGAGCA 
GAAGCTGAAG 
GACAAGACCC 
AAGAATGGGG 
CTGAACTTGG 
TCTCTGAAGA 
GACACGAATA 
GCCAAGAACG 
CTGATGGACC 
TCCCGAGCCA 
AGGGCACGTC 
GCTGATGTGA 
CAGGCTCTTG 
CAGATCTCAG 
GTTTAATGGG 
GTTGTCTTAT 
GGGTGTGAGA 
GCACAGGCAG 
CTGTTGGGAT 
TTGCCCAGGC 
CCAGTAAAAT 
CCTACTTACA 
TGTTGGAGTG 
CCACCTTCAA 
TAGAGATTGC 
TACTTTTTCG 
CTCTAGATTT 
AATGTATTTT 
TGTTCCTACT 
TTTCCATCCA 
CCAGGGGCTG 
GAGGAAGACA 
GGTGTTTATT 
AAGACCCTCC 
CATTTCTTTG 
TGAGTTATGA 
GGGAAGACTA 
AAATAAAGAA 



51 
I 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 



274 



wo 02/086443 

MPALWLGCCL CPSLLLPAAR ATSRREVCDC NGKSRQCIFD RELHRQTGNG FRCLNOIDNT 60 

DGIHCEKCKN GFYRHRERDR CLPCNCNSKG SLSARCDNSG RCSCKPGVTG ARCDRCLPGF 120 

HMLTDAGCTQ DQRLIjDSKCD CDPAGIAGPC DAGRCVCKPA VTGERCDRCR SGYYNLDGGN 180 

PEGCTQCFCY GHSASCRSSA EYSVHKITST FHQDVDGWKA VQRNGSPAKL QWSQRHQDVF 240 

5 SSAQRLDPVY FVAPAKFLGN QQVSYGQSLS FDYRVDRGGR HPSAHDVIL.E GAGLRITAPL 300 

MPLGKTLPCG LTKTYTFRLN EHPSNNWSPQ LSYFEYRRLL RNLTALRIRA TYGEYSTGYI 360 

DNVTLISARP VSGAPAPWVE QCICPVGYKG QFCQDCASGY KRDSARLGPF GTCIPCNCQG 420 

6GACDPDTGD CYSGDENPDI ECADCPIGFY NDPHDPRSCK PCPCHNGPSC SVMPETEEW 480 

CNNCPPGVTG ARCELCADGY FGDPFGEHGP VRPCQPCQCN NNVDPSASGN CDRLTGRCLK 540 

10 . CIHNTAGIYC DQCKAGYFGD PLAPNPADKC RACa^CNPMGS EPVGCRSDGT CVCKPGFGGP 600 

NCEHGAFSCP ACYNQVKIQM DQFMQQtjQRM EALISKAQGG DGWPDTEbE GRMQQAEQAL 660 

QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDLKM TVERVRALGS QYQNRVRDTH 720 

RLITQMQLSL AESEASLGNT NIPASDHYVG PNGFKSLAQE ATRLAESHVE SASNMEQLTR 780 

ETEDYSKQAL SLVRKALHEG VGSGSGSPDG AWQGLVEKL EKTKSLiAQQIj TREATQAEIE 840 

15 ADRSYQHSLR LLDSVSRLQG VSDQSFQVEE AKRIKQKADS LSTLVTRHMD EFKRTQKNLG 900 

NWKEEAQQLL QNGKSGREKS DQLLSRANLA KSRAQEALSM GNATFYEVES IIiKNLREFDL 960 

QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQAERALGSA AADAQRAKNG AGEALEISSE 1020 

lEQEIGSLNL EANVTADGAL AMEKGLASLK SEMREVEGEL ERKELEFDTN MDAVQMVITE 1080 

AQKVDTRAKN AGVTIQDTLN TLDGLLHLMD QPLSVDEEGIi VLLEQKLSRA KTQINSQLRP 1140 

20 MMSELEERAR QQRGHLHLLB TSIDGILADV KNLENIRDNL PPGCYNTQAL EQQ 



PCT/US02/12476 



25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



Seq ID NO: 221 DNA sequence 
Nucleic Acid Accession #: NM_016529 
Coding sequences 13-1854 



1 

I 

GTCAAGAAAA 
AAAGGGGCTG 
ACATTATGCC 
GCTGATCTCT 
ATATTGAAGG 
CTGCTACTTG 
GCAACACTGT 
GCGATTAATA 
AAGGAGGACT 
AATTTGCTGG 
GCGCTCTCCT 
ATATGCTGCA 
GTGAAGGCCA 
GCCCACGTGG 
TACGCCATCG 
TACAACCGGG 
ATTGAGCTTT 
TGCATCGGCC 
GAGAGGTCTT 
AATGGCGAAG 
TCCCTCATCC 
GGTCATGCTA 
GTTTGTCTGA 
TGGGGAAGCA 
ATTCCCATTG 
TGGTTGGGAT 
GCCAAGCACA 
CGAGTCCTGG 
CGCCTGATCA 
CAGCAGGGCG 
GAAGAAGTCA 
AATTTTCCTG 
TTTGTCAGAG 
AGTTAAGCAG 
AGCTATCTTT 
ATGAAGCATT 
AGAAAAA7UUV 



11 
1 

GAATGTCTGT 
ATAATGTGAT 
ATCTGGAATA 
CTGAGAATGA 
ACAGAGCTCA 
GAGCCACAGC 
TGAAGGCAGA 
TAGGGTATTC 
CTTTGGATGC 
GCAAGGAAAA 
TCGAAGTCCG 
GAGTGTCTCC 
TCACCCTCGC 
GTGTGGGAAT 
CACAGTTTTC 
TGACCAAGTG 
GGTTCGCCTT 
TGTACAATGT 
GCACTCAGGA 
GCTTCAACAC 
TCTTCTGGTT 
CCGACTATTT 
AAGCTGGTTT 
TGCTGACCTG 
CTCCAGATAT 
TATTTCTGGT 
CCTGCAAAAA 
GAAAAGCGGT 
AGAGGCTGGG 
TCCCGCATGG 
TCCGTGCTTA 
ACTGATCTTA 
AAGACTGGCG 
TTTGTTAGTT 
GCCCTCCCAA 
CAACTGTGCT 
AAAAAAA 



21 
1 

AATTGTTCGA 
TTTTGAGAGA 
CTTTGCCACG 
QTATGAGGAG 
ACGGTTGGAA 
CATAGAAGAT 
AATTAAAATA 
CTGCCGATTG 
CACAAGGGCA 
TGACGTGGCC 
GAGGAGTTTC 
TCTGCAGAAG 
CATCGGAGAC 
CAGTGGGAAT 
CTACTTAGAG 
CATCTTGTAC 
TGTTAATGGA 
GATTTTCACC 
GAGCATGCTC 
AAAGGTTTTC 
TCCCATGAAA 
ATTTGTTGGA 
GGAGACCACA 
GCTGGTGTTT 
GAGAGGACAG 
TCCTACTGCC 
GACATTGCTG 
GCTGCGGGAT 
CCGGAAGACG 
GTATGCTTTT 
TGACACCACC 
GGAAAGAGAT 
TCCAAGGCCA 
ACATATTCCC 
CTCGTCTGCA 
CTGTGAGGTC 



31 

1 

ACTCCTTCAG 
CTTTCAAAAG 
GAAGGCTTGC 
TGGCTGAAAG 
GAGTGTTACG 
CGCCTTCAAG 
TGGGTGTTGA 
GTATCGCAGA 
GCCATTACTC 
CTCATCATCG 
CTGGATTTGG 
TCTGAGATAG 
GGCGCCAACG 
GAAGGCATGC 
AAGCTTCTGT 
TGCTTCTATA 
TTTTCT6GGC 
GCTTTGCCGC 
AGGTTTCCCC 
TGGGGTCACT 
GCTCTGGAGC 
AATATTGTTT 
GCTTGGACTA 
TTTGGCATCT 
GCAACTATGG 
TGTTTGATTG 
GAGGAGGTGC 
AGCAATGGAA 
CCCCCGACGC 
TCTCAAGAAG 
AAAAAGAAAT 
TCAGTTTGTT 
AAACACCAGG 
TCGCAAACCT 
GTGCTTAGCC 
TCAAATTAAA 



41 
t 

GACGACTTCG 
ACTCAAAATA 
GGACTCTCTG 
TCTATCAGGA 
AGATCATTGA 
CAGGAGTTCC 
CAGGAGACAA 
ATATGGCCCT 
AGCACTGCAC 
ATGGCCACAC 
CACTCTCGTG 
TGGATGTGGT 
ATGTCGGGAT 
AGGCCACCAA 
TGGTTCATGG 
AGAACGTGGT 
AGATTTTATT 
CCTTCACTCT 
AGCTCTACAA 
GCATCAACGC 
ATGATACTGT 
ACACATATGT 
AATTCAGTCA 
ACTCGACCAT 
TCCTGAGCTC 
AAGATGTGGC 
AGGAGCTGGA 
AGAGGCTGAA 
TGTTCCGGGG 
AACACGGAGC 
CCAGGAAGAA 
GCACCCAGTG 
AAACACATTT 
GGAGTGCAGA 
TAACTTTTGT 
AACATTAT6T 



51 
1 

GCTTTACTGT 
TATGGAGGAA 
TGTGGCTTAT 
AGCCAGCACC 
GAAGAATTTG 
AGAAACCATC 
ACAAGAAACT 
TATCCTATTG 
T6ACCTTGGG 
CCTGAAGTAC 
CAAAGCGGTC 
GAAGAAGCGG 
GATCCAGACA 
CAACTCGGAT 
AGCCTGGAGC 
CCTGTATATT 
TGAACGTTGG 
GGGAATCTTT 
AATCACCCAG 
CTTGGTCCAC 
GTTTGACAGT 
TGTTGTTACT 
TCTGGCTGTC 
CTGGCCCACC 
CGCACACTTC 
ATGGAGAGCA 
AACCAAGTCT 
CGAGCGCGAC 
CAGCTCCCTG 
TGTTAGTCAG 
ATAAGACATG 
TTAACACATC 
CTGTGGCCTT 
CCACAGGGGA 
TTATGTCGTT 
TTCACCAATA 



Seq ID NO: 222 Protein sequence; 
Protein Accession #: NP_057613 



MSVIVRTPSG 
ENEYEEWIiKV 
KABIKIWVLT 
KENDVALIID 
TLAIGDGAND 
TKCILYCFYK 
TQESMLRFPQ 
DYLPVGNIVY 
PDMRGQATMV 
KAVLRDSNGK 
RAYDTTKKKS 



11 

I 

RLRLYCKGAD 
YQEASTILKD 
GDKQETAINI 
GHTLKYALSF 
VGMIQTAHVG 
NWIiYIIEIjW 
LYKITQNGEG 
TYWVTVCLK 
liSSAHFWIiGL 
RLNERDRLIK 
RKK 



21 

1 

NVIFERLSKD 
RAQRLEECYE 
GYSCRLVSQN 
EVRRSFLDLA 
VGISGNEGMQ 
FAFVNGPSGQ 
FNTKVFWGHC 
AGLETTAWTK 
FLVPTACLIE 
RLGRKTPPTL 



31 

1 

SKYMEETLCH 
IIEKNLLLLG 
MALIXiLKEDS 
LSCKAVICCR 
ATNNSDYAIA 
ILFERWCIGL 
INALVHSLIL 
FSHLAVW6SM 
DVAWRAAKHT 
FRGSSLQQGV 



41 

I 

IiEYFATBGLR 
ATAIEDRLQA 
LDATRAAITQ 
VSPLQKSEIV 
QFSYLEKIiLL 
YNVIFTALPP 
FWFPMKALEH 
LTWLVFFGIY 
CKKTLLEEVQ 
PH6YAFSQEB 



51 

I 

TLtCVAYADLS 
GVPETIATLL 
HCTDLCaJIiLG 
DWKKRVKAI 
VHGAWSYNRV 
FTIiGIFERSC 
DTVFDSGHAT 
STIWPTIPIA 
ELETKSRVLG 
HGAVSQEEVI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



85 



Seq ID NO: 223 DNA sequence 
Nucleic Acid Accession #: BC017001 
Coding sequence: 1-394 



11 



21 



31 
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wo 02/086443 

I 1 I I I I 

AACGCTGGGC AGGGCCGGCG CGGGTCGGGG GGCGCCCGAG GGGCCCGGGC CX3AGCGGCGG 60 

CGCGCAGGGC GGCAGCATCC ACTCGG6CCG CATCGCCGCG GTGCACAACG TGCCGCTGAG 120 

CGTGCTCATC CGGCCGCTGC CGTCCGTGTT GGACCCCGCC AAGGTGCAGA GCCTCGTGGA 180 

CACGATCCGG GAGGACCCAG ACAGCGTGCC CCCCATCGAT GTCCTCTGGA TCAAAGGGGC 240 

CCAGGGAGGT GACTACTTCT ACTCCTTTGG GGGCTGCCAC CGCTACGCGG CCTACCAGCA 300 

ACTGCAGCGA GAGACCATCC CCGCCAAGCT TGTCCAGTCC ACTCTCTCAG ACCTAAGGGT 360 

GTACCTGGGA GCATCCACAC CAGACTTGCA GTAGCAGCCT CCTTGGCACC TGCTGCCACC 420 

TTCAAGAGCC CAGAAGACAC ACCTGGCCTC CAGCAGGCT6 GGCCATGCAG AAGGGATAGC 480 

AGGGGTGCAT TCTCTTTGCA CCTGGCGAGA GGGTCTGACT CTGGGCACCC CTCTCACCGG 540 

CTACAAGGCC TTGGACTCAC TGTACAGTGT GGGAGCCCCA GTTCCCACCT CTGTGACAAT 600 

AGGATCATGG CCTTACCCTT GAAGCATTAC CGAGAAGGAG AACAGAGATG GGCTTGAAGA 660 

GCCACGTGCT GCCGGCTCCA AATTCCCAAG GACAAGGATC CCTCTGCATT TTTGTCTATG 72 0 

TAACCTCTTA TATGGACTAC ATTCAGCTGC AAGGAAAGGA AAACCTTGAT TGCAGTGGTT 780 

TAAACA/W^CA GAAGATTGTT TTTCCACATA GCATGGATTC TGGAGATGGG TGGCTAATGG 840 

TATTGGTTCA ACAACTCCAC GGAGGTAGGG GTCACGTCTT GGATCCTTTT GCCTTAATCT 90 0 

CAGTGCTCGT TACTTCATGG TCCCAAGATG GCTGCTGTAT CCCCAAGAAT CATGTCTGCG 960 

TTCAAGGAAG GAGGGGTGGA GGAAGAGGAA GGGCCAAACT AGCTGGACCC GTCACCTTCT 102 0 

ATCAGAAAGT AAAACCTCGT CAGAAGTCTG TTTCCTGCTC TCTCCCTCTG CATATCTTCA 1080 

CTTAGATGCC CTTGGCCCGA GCCAGCTACC ATTGCACCTC TAGCTGCAAA CAAAGCTAAG 1140 

ACAGCAGGGA ACAGAATTGT CATGGCTGAA TAGACCAATC GTGTTCCATC TACTGAGACT 1200 

GGCACACTGC CTCCTGCAAT AAAACTGGGA TCCCATTACC AAGAGAGT^ TGCAGAATTG 1260 

TGTACCAGTT AGCTTTTGCT GTGTAACAAA CCATCCCCAA ACTTGGCAGC TAGAAACAAA 1320 

CCCTGTATTT TCCCACAATC CTATGGGTTG GCAATTTGGG CTGGGCTCAA CAGGGCAGTT 13 80 

CTGCTGCTCA CACCTGGGAT CCCTCATGGA GCTAAGGTCA GCTGTTACCT CAGCTGGGCC 1440 

TGGATGGTCT AGGATAGCCT TACTCACTTG CCTGGCAGGT GACAGGCTGT TGGCTGGAAT 1500 

TGCTTGGTTC TCCTCCATGT GGCCTCTCCA GCAGGCTAGC TCAGGCTTAT TCACATGATG 1560 

GCTTCAGGAT TCCAAAGAGA GTGAGAGTAjG AAGCTGAAAG ACTTCTTGAG TTCTTGGCCT 162 0 

GGAACTGGGA CTAGGACAGT GTCACTTCTG CTAAGTTCTT TTGGTCAGAG CAAATCACAA 168 0 

GGCTTTACCC AGATTCAAGG GATGAGAAAC AGACTACATG TCTTGATGAG GGGAACCACA 1740 

AAGAGCTTGT GGCCATTTTT CACCTATCAC AAATAATTTT GGATGGGTAT TTATTTGGAT 1800 

AAAGGTATTT CCCTCTTCCC CCTTTCTCTC TGTCTCATGG GGCCTCACTC TGCCAAGTTG 1860 

GAAGGCACTA AGACATTGTC CTGGCCCTCA GGGTCTAGGG GAA6AGGTGT TGGGGCAGGA 1920 

AGTGAGTCTC TCCATGGGCT GGACCCACTG TAGTAGGAGT GCCTCCTTGT CTGCACTGCT 1980 

GGTATGGGGT TAGGCCAGGT AGGACATTCC AGAGGGGCTT CTGAAAACCA AGAGTCCCTG 2040 

GGGAAAGGGA ACAGAGTAAG GCAGGCCTTG TTCTCACTGC CCTCTAAGGG AACTTGGTCA 2100 

CTCGGCACTT TTAAGCCTCA GTTTCTCCAG TTCAATAATA AGGACAAGAG CTTTTCCCAT 2160 

GCATTCTCTT TCCCCGGGAA AGTTGACTGA QGTGACCAGT AATAGAATTG AAAAGGGAGA 2220 

GTGTCTTCAG TGCAATGTGG CATCCTGGAT TGGGTCTTGG AACAAAAACA GGACATTAGT 2280 

GGGAAAATTG GAAATCTGAA AAAAGTCTGA ATTTTAGTTA ATATACCAAT TTCAGTCTCT. 2340 

TGGTTTTGAC AGATGTACCA TGGTGATGTA AGATGTTGAC CTTGGGGTAG GCTGGGTGAA 240 0 

GGGTATACAG GAACTCTTTG TACTATCTCT GCAACTTCTC TGTAAATCTA GTATCATTCC 2460 
AAAATAAAAG TTTATTTAAT TTAAA7UVAAA AAAAAAAAAA AA 



Seq ID NO: 224 Protein sequence: 
Protein Accession #: AAH17001.1 

1 11 21 31 41 51 

I 1 I I I I 

TLGRAGAGRG APEGPGPSGG AQGGSIHSGR lAAVHNVPLS VLIRPLPSVL DPAKVQSLVD 60 
TIREDPDSVP PIDVLWIKGA QGGDYFYSFG GCHRYAAYQQ LQRETIPAKL VQSTLSDLRV 120 
YLGASTPDLQ 

Seq ID NO: 225 DNA sequence 
Nucleic Acid Accession #: NM_021048 
Coding sec[uence : 1 * . 1110 



1 11 21 31 41 51 

I I 1 I i 1 

ATGCCTCGAG CTCCAAAGCG TCAGCGCTGC ATGCCTGAAG AAGATCTTCA ATCCC7VAAGT 60 

GAGACACAGG GCCTCGAGGG TGCACAGGCT CCCCTGGCTG TGGAGGAGGA TGCTTCATCA 120 

TCCACTTCCA CCAGCTCCTC TTTTCCATCC TCTTTTCCCT CCTCCTCCTC TTCCTCCTCC 180 

TCCTCCTGCT ATCCTCTAAT ACCAAGCACC CCAGAGGAGG TTTCTGCTGA TGATGAGACA 240 

CCAAATCCTC CCCAGAGTGC TCAGATAGCC TGCTCCTCCC CCTCGGTCGT TGCTTCCCTT 300 

CCATTAGATC AATCTGATGA GGGCTCCAGC AGCCAAAAGG AGGAGAGTCC AAGCACCCTA 360 

CAGGTCCTGC CAGACAGTGA GTCTTTACCC AGAAGTGAGA TAGATGAAAA GGTGACTGAT 420 

TTGGTGCAGT TTCTGCTCTT CAAGTATC7VA ATGAAGGAGC CGATCACATU^ GGCAGAAATA 480 

CTG6AGAGTG TCATAAAAAA TTATGAAGAC CACTTCCCTT TGTTGTTTAG TGAAGCCTCC 540 

GAGTGCATGC TGCTGGTCTT TGGCATTGAT GTAAAGGAAG TGGATCCCAC TGGCCACTCC 600 

TTTGTCCTTG TCACCTCCCT GGGCCTCACC TATGATGGGA TGCTGAGTGA TGTCCAGAGC 660 

ATGCCCAAGA CTGGCATTCT CATACTTATC CTAAGCATAA TCTTCATAGA GGGCTACTGC 720 

ACCCCTGAGG AGGTCATCTG GGAAGCACTG AATATGATGG GGCTGTATGA TGGGATGGAG 780 

CACCTCATTT ATGGGGAGCC CAGGAAGCTG CTCACCCAAG ATTGGGTGCA GGAAAACTAC 840 

CTGGAGTACC GGCAGGTGCC TGGCAGTGAT CCTGCACGGT ATGAGTTTCT GTGGGGTCCA 900 

AGGGCTCATG CTGAAATTAG GAAGATGAGT CTCCTGAAAT TTTTGGCCAA GGTAAATGGG 960 

AGTGATCCAA GATCCTTCCC ACTGTGGTAT GAGGAGGCTT TGAAAGATGA GGAAGAGAGA 1020 

GCCCAGGACA GAATTGCCAC CACAGATGAT ACTACTGCCA TGGCCAGTGC AAGTTCTAGC 1080 
GCTACAGGTA GCTTCTCCTA CCCTGAATAA 

Seq ID NO; 226 Protein sequence: 
Protein Accession #; NP_0663 86 

1 11 21 31 41 51 

I I I I I I 

MPRAPKRQRC MPEEDLQSQS ETQGLEGAQA PIAVEEDASS STSTSSSFPS SFPSSSSSSS 60 



276 



wo 02/086443 

SSCyPLIPST PEEVS7VDDET 
QVLPDSESLP RSEIDEKVTD 
ECMLLVFGID VKEVDPTGHS 
TPEEVIWEAL NMMGLYDGME 
RAHAEIRKMS LLKFL7\KVNG 
ATGSFSYPE 



PNPPQSAQIA CSSPSWASL 
LVQFLLFKYQ MKEPITKAEI 
FVLVTSLGLT YDGMLSDVQS 
HLIYGEPRKL LTQDWVQENY 
SDPRSFPLWY EBALKDEEER 



PLDQSDEGSS SQKEESPSTL 120 

LESVXKNYED HFPLLFSEAS 180 

MPKTGILILI LSIIFIEGYC 240 

LEYRQVPGSD PARYEFLWGP 300 

AQDRIATTDD TTT^MASASSS 360 
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Seg ID NO: 227 DNA sequence 

Nucleic Acid Accession #: NM_005025.1 

Coding sequence i 82 '-13 14 



1 
I 

GCGGAGCACA 
GAGGCTTGAA 
AGTATGGCTA 
TATAATCGTC 
GCTCTTGCAA 
CACTCAATGG 
TCAAACATGG 
GTGCAAAATG 
GCAGCAGTAA 
TGGGTGGAGA 
GCTGCCACTT 
TTTAGGCCTG 
ATTCCAATGA 
GAAGCTGGTG 
ATGCTGGTGC 
CAGCTGGTTG 
AGGTTCACAG 
GJ^AATTTTCA 
TCCAAAGCAA 
GTCTCAGGAA 
CATCCATTTT 
GTCATGCATC 
TTATTTGAAT 
TAGGATTTGT 
AATATATGTA 
TGTTATGTCA 



11 
I 

GTCCGCCGAG 
ACTGTTACAA 
CAGGGGCCAC 
TTAGAGCCAC 
TGGGAATGAT 
GATATGACAG 
TAACTGCTAA 
GATTTCATGT 
ATCATGTGGA 
ATAACACAAA 
ATCTGGCCCT 
AAAATACTAG 
TGTATCAGCA 
GTATCTACCA 
TGTCCAGACA 
AAGAATGGGC 
TGGAACAGGA 
TCAAAGATGC 
TTCACAAGTC 
TGATTGCAAT 
TCTTTCTTAT 
CTGAAACAAT 
AACAAGGAAA 
GTTTTACAGT 
AATTATAAGT 
TTGTGTTTGT 



21 

1 

CACAAGCTCC 
TATGGCTTTC 
TTTCCCTGAG 
TGGTGAAGAT 
GGAACTTGGG 
CCTAAAAAAT 
AGAGAGCCAA 
CAATGAGGAG 
CTTCAGTCAA 
CAATCTGGTG 
CATTAATGCT 
AACCTTTTCT 
AGGAGAATTT 
AGTCCTAGAA 
GGAAGTTCCT 
AAACTCTGTG 
AATTGATTTA 
AAATTTGACA 
CTTCCTAGAG 
TAGTAGGATG 
CAGAAACAGG 
GAACACAAGT 
ACAGTAACTA 
ATATCTTAAG 
AACTTGTCAA 
GTGCTGTTGT 



31 
I 

AGCATCCCGT 
CTTGGACTCT 
GAAGCCATTG 
GAAAATATTC 
GCCCAAGGAT 
GGTGAAGAAT 
TATGTGATGA 
TTTTTGCAAA 
AATGTAGCCG 
AAAGATTTGG 
GTCTATTTCA 
TTCACTAAAG 
TATTATGGGG 
ATACCATATG 
CTTGCTACTC 
AAGAAGCAAA 
AAAGATGTTT 
GGCCTCTCTG 
GTTAATGAAG 
GCTGTGCTGT 
AGAACTGGTA 
GGACATGATT 
AGCACATTAT 
ATAATATTTA 
GGAATGTTAT 
TTAAAATAAA 



41 
I 

CAGGGGTTGC 
TCTCTTTGCT 
CTGACTTGTC 
TCTTCTCTCC 
CTACCCAGAA 
TTTCTTTCTT 
AAATTGCCAA 
TGATGAAAAA 
TGGCCAACTA 
TATCCCCAAG 
AGGGGAACTG 
ATGATGAAAG 
AATTTAGTGA 
AAGGAGATGA 
TGGAGCCATT 
AAGTAGAAGT 
TGAAGGCTCT 
ATAATAAGGA 
AAGGCTCAGA 
ATCCTCAAGT 
CAATTCTATT 
TCGAAGAACT 
GTTTGCAACT 
TWVTAGTTCC 
CAGTATTAAG 
AGTACCTATT 



51 
1 

AGGTGTGTGG 
GGTTCTGCAA 
AGTGAATATG 
ATTGAGTATT 
AGAAATCCGC 
GAAGGAGTTT 
TTCCTTGTTT 
ATATTTT7\AT 
CATCAATAAG 
GGATTTTGAT 
GAAGTCGCAG 
TGAAGTCCAA 
TGGCTCCAAT 
AATAAGCATG 
AGTCAAAGCA 
ATACCTGCCC 
TGGAATAACT 
GATTTTTCTT 
AGCTGCTGCT 
TATTGTCGAC 
CATGGGACGA 
TTAAGTTACT 
GGTATATATT 
AGATAAAAAC 
CTAATGGTCC 
GAACATGTG 



Seq ID NO: 228 Protein sequence: 
Protein Accession #: NP_005016.1 



MAFLGLFSIiL 
ELGAQGSTQK 
NEEFLQMMKK 
INAVYFKGNW 
VLEIPYEGDE 
IDLKDVLKAL 
SRMAVLYPQV 



11 

! 

VLQSMATGAT 
EIRHSMGYDS 
YPNAAVNHVD 
KSQFRPENTR 
ISMMLVLSRQ 
GITEIFIKDA 
IVDHPFFFLI 



21 
I 

PPEEAIADLS 
LKNGEEFSFL 
FSQNVAVANY 
TFSFTKDDES 
EVPLATLEPL 
NLTGLSDNKE 
RNRRTGTILF 



31 
1 

VNMYNRLRAT 
KEFSNMVTAK 
INKWVENNTN 
EVQIPMMYQQ 
VKAQLVEBWA 
IFLSKAIHKS 
MGRVMHPETM 



41 

] 

GEDENILFSP 
ESQYVMKIAN 
NLVKDLVSPR 
GEFYYGEFSD 
NSVKKQKVEV 
FLEVNEEGSE 
NTSGHDFEEL 



51 
1 

LSIALAMGMM 
SLFVQNGFHV 
DFDAATYLAL 
GSNEAGGIYQ 
YLPRFTVEQE 
AAAVSGMIAI 



Seq ID NO: 229 DNA sequence 
Nucleic Acid Accession #; NM_003695 
Coding sequence: 12~398 



CGACATCAGA 
CAGCCCTTAC 
TCTGCCCGGC 
ATCTGGTGAA 
TCAGCAGCGG 
ACAACGCTGC 
TGAGCCTCCT 
TCATGCCTTT 
GGGTGCCAGG 
CATGGAATGC 
ACAGAGGATG 
GATTTCACAC 
TAT^TGATTT 



11 

1 

GATGAGGACA 
CCTGCGCTGC 
CAGCTCTCGC 
GAAGGACTGT 
CACCAGCTCC 
ACCCACCCGC 
GGCCGTCATC 
CCTTCCCTTT 
AGCCCCAGGC 
TGATGACTTG 
CAGCCCCCAG 
TCCTTCTGTT 
AAACC 



21 

1 

GCATTGCTGC 
CACGTGTGCA 
TTCTGCAAGA 
GCGGAGTCGT 
ACCCAGTGCT 
ACCGCCCTCG 
TTAGCCCCCA 
CTCTGGGGAT 
TGAGGGCTTC 
GAGCAGGCCC 
CTGCATGGAA 
TTGTTGCCGT 



31 

1 

TCCTTGCAGC 
CCAGCTCCAG 
CCACGAACAC 
GCACACCCAG 
GCCAGGAGGA 
CCCACAGTGC 
GCCTGTGACC 
TCCACACCTC 
CCCGAAAGTC 
CACAGACCCC 
GGTGGAGGAC 
TTATTTTGTA 



41 

1 

CCTGGCTGTG 
CAACTGCAAG 
AGTGGAGCCT 
CTACACCCTG 
CCTGTGCAAT 
CCTCAGCCTG 
TTCCCCCCAG 
TCTTCCCCAG 
TGGGACCAGG 
ACAGAGGATG 
AGAAGCCCTG 
CTCAAATCTC 



51 

I 

GCTACAGGGC 
CATTCTGTGG 
CTGAGGGGGA 
CAAGGCCAGG 
GAGAAGCTGC 
GGGCTGGCCC 
GGAAGGCCCC 
CCGGCAACGG 
TCCAGGTGGG 
AAGCCACCCC 
TGGATCCCCG 
TACATGGAGA 



Seq ID NO: 230 Protein sequences 
Protein Accession #: NP_003686 

1 11 21 31 41 51 

I I I 1 I I 

MRTALLLIiAA LAVATGPALT LRCHVCTSSS NCKHSWCPA SSRFCKTTNT VBPLRGNLVK 
KDCAESCTPS YTLQGQVSSG TSSTQCCQED LCNEKLHNi^A PTRTATiAHSA LSLGLALSLL 
AVIIiAPSL 



Seq ID NO: 231 DNA sequence 

Nucleic Acid Accession #: Eos seG[uence 

Coding sequence: 126-752 



60 
120 

180 
240 
300 
3 60 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



CCGGGCAGGT 
AGGGGCGCAG 
AGAAGATGAA 
GTGTGAGGGA 
G6A6AACTCG 
TCTCTCTTGA 
GAAAGTACCA 
ACCCAGTGGA 
CCCGTGTGGC 
ACGAGTCTTC 
AAGTTGGGCC 
TCATCCTGTC 
TTCAGGATGG 
CAAGAGTTCA 
TTCACTGCAC 
TGACTCAGTT 
ACCAGCAACC 
AGGGAGGGGA 
TTATTTCCTG 
TCTGGTATTA 
GGAATTCAGC 
AGCTCTGACT 
GGCTTCACAT 
GTGGTTTTTT 
CAGAAGGATG 
GGTAATTCCT 
TGGGGAGCTT 
TCCTAGAGAA 
TGCTGTCAGC 
TTCTACTGTG 
TGTATAACTC 
6TGTTTGTGA 



GGCTCATGCT 
GAATTCTGAT 
GGATATCGAC 
GAGAACCAGC 
ACCGTTGGAA 
TGCCTCCATG 
TCATGGCTTG 
CAATGCTGGG 
CCACAAGAAG 
TGACGTGAAC 
AGACGCTGCT 
CATCGTGTGC 
CTGTATTCTG 
GCCTTCCTTT 
CGCCATCTTA 
CCACATTTTG 
CGGCTGTATA 
GAGGTGGAAT 
GGACTTGGCA 
ATTTAATCTC 
GTAGCTACCT 
TACAGCTGCA 
CAATTTTTTT 
GTTTTATTTT 
CCACCAGTTT 
CACAACTTAG 
TTCGTTTTGT 
TCTACTCCGT 
CTGATATACT 
TGTCGTTAAA 
GACTCTGTAT 
6ATGCTCTTT 



CGGGAGCGTG 
GTGAAACTAA 
ATAGGAAAAG 
ACTTCTGGGA 
TGCCAAGATG 
CATTCTCAGC 
AGTGCTCTGA 
CTTTTTTCCT 
GGGGAGCTCT 
TGCAGAAGAC 
TCCCTGCGAA 
CTGATGATCA 
CGGTCAGAAT 
GGAGACTGCT 
CTGAGTTGCT 
GATTGCATAC 
CAGTGGTGAC 
ACAGTAGTTC 
AAAATCTGAT 
AGGAAAAACA 
CCAGACCGTG 
GTCACCTTTG 
CTTCCTTTAG 
TGTCAAGATT 
TTCCTTGAGG 
TGCACCAGTA 
TTTGTTTTTA 
TGCAGAATCA 
ACTTTGGACT 
CAACCGTCGG 
GTTTCAATGT 
GAAGATGGTA 



GTTGAGCGGC 
CAGTCTGTGA 
AGTATATCAT 
CGCACAGAGA 
CCTTGGAAAC 
TCAGAATCCT 
AGCCCATCCG 
GTATGACTTT 
CAATGGAAGA 
TAGAGAGACT 
GGGTTGTGTG 
CGCAGCTGGC 
GAGAGAGTCA 
CCATCAGTGC 
TCACGTGAGG 
TGGAAAAGAA 
CCAAGCAATG 
TTGGAATCTG 
TGGTGGGGAT 
AGAAATTAAC 
GTGTCTGGCC 
CTATAAGGCA 
GGTGGGGGAT 
GATTTTTAGA 
CCTAGGATTT 
GCACCAGCCA 
ATTCTCTTTC 
TTGCAACCTC 
CTGGAAACAG 
AGACCAGATG 
ATGTTACTGC 
CTTTTATATT 



TGGCGCGGTT 
GCCCTGGAAC 
CCCCAGTCCT 
CCGTGAAGAT 
A6CAGCCCGA 
GGATGAGGAG 
GACTACTTCC 
TTCGTGGCTT 
CGTGTGGTCT 
GT6GCAA6AA 
GATCTTCTGC 
TGGCTTCAGT 
AGCTGGGCAG 
CGAGGTGTGT 
AAAAGGGGGC 
GCCAATCTTC 
GATATAJ\ACC 
AAGTCTCCTA 
CTCCTAGGAC 
CCAGAGAGAG 
TCCATTTTTG 
CCTGGGTAGA 
TGGTTTGGCT 
TGCAAGGACT 
TTTATTCTGT 
TTTTGAGCAG 
CTTAGCAGCA 
AGGAGCCCTC 
ATATGGGTTC 
ACCTGTTAGA 
AATGCTTCAC 
T 



GTCCTGGAGC 
CTCCACTCAG 
GGGTATAGAA 
TCCAAGTTCA 
GCCGAGGGCC 
CATCCCAAGG 
AAACACCAGC 
TCTTCTCTGG 
CTGTCCAAGC 
GAGCTGAATG 
CGCACCAGGC 
GGACCAAATT 
AATCTCTCGC 
GGGAACAGGC 
TTTGGCCCTG 
TTGCTAGTAA 
TAAAAATCTG 
TTTGATCAGG 
CTAGTGGACA 
TCTGGGTTTT 
TCTGTCATTC 
AGGGTGGATG 
TTCTTTTGTT 
TGAAAAGACC 
CCCGAGCAGA 
AGTACCTCTT 
AGGTCTTTTT 
ACTGATTGAG 
TATTCTCTAT 
TG6CTAGTCC 
CTGCTGTACA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
66Q 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



Seq ID NO: 2 32 Protein sequence: 
Protein Accession # : Eos sequence 



11 



21 



31 

I 



41 



51 



MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL BTAARAEGLS 
LDASMHSQLR ILDBEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGI.FSCM TFSWLSSIjAR 
VAHKKGEIiSM EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 
LSIVCLMITQ IiAGFSGPNFQ DGGIIiRSE 

Seq ID NO: 2 33 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



TTTTAATGGT 
CACAACATGA 
TGTTTCTTAG 
TAGTAAACTT 
TTTGTTTCCA 
TGTTTGACTG 
TTGTTATGGT 
AAGCTATTTT 
TATATTATGG 
ATTAGCCTAC 
AATAGGTTAA 
AATTAAAGTA 
TATGAAAGCT 
TATATATTTT 
TATCCTGTAT 
TAGAAGACCC 
TTTTTTTTAG 
GATTCTGTTA 
TCCAGATAAA 
CTTAGGCTCT 
CCATAGTATC 
ACTATAATAG 
TATCAATGTT 
TACATAAAJ\A 
AAGTCATTAA 
AGAATCATCT 
GGAAGATGGA 
GAGTTGAAGA 
CTTTTA7VAAC 
TATTCTTCTA 
AGCCCTTGTG 
CTGTTCAGGT 
TGAGATGGAG 
ATCATATGGA 
TTTTAATTCT 
CAATCCCCAA 
TTTCTGGAAT 



11 

I 

GCTCATATAT 
ATCACATAAT 
ATCGTAGTCA 
CTCTTTCATC 
AAGTCACAAT 
AAAGCAAAAC 
ATATCTTTTT 
AATCATCAAG 
ATTAACCAGA 
TATAGTATTA 
AAAGTAGTTA 
CATTTTAAAT 
CTGGCTATCA 
TTAGTTCCTT 
TTTTTTTAAG 
ACTCTTACTA 
GTAGTTTAAA 
AGCATCCAAA 
AGATGGAGAA 
CTCCATGTAT 
AAGTGGAGGG 
AAGTTT6AGT 
ATCCAATGAT 
GTGCTCATGT 
TAATTTAATA 
GAGGACTTTT 
GCCATGCTTG 
CCACTGCTCT 
TAGTAATGTA 
ATT^CTAGCA 
TAAGATTATT 
GAGAAAACAT 
GAGGTGGGCA 
AGTTTGGAAA 
CACTGTCTCA 
GGGGCAGTGT 
TGCACCTACA 



21 

! 

ACTGTATTTT 
CATGATTTTT 
TTGAGAAGTC 
TTTGTGTTAG 
TGAATTATTC 
AACGTGACAG 
ATTTUU^TATT 
TATGGAAAAC 
ATTGTATCAT 
ATAAATTTTT 
CAAATTAAAC 
GAGCTTTATA 
TCCTGGGATA 
TGAGATAACT 
AATTGTTTTA 
GGTTCCCTAA 
GCAAGCACTG 
AACAATGCCT 
TACCTCATGT 
CTTTCTTAAG 
TAGTTCAGAA 
AATATTTTAA 
TTTTATTAAA 
ATTTGAATTT 
ATTGTTTTAA 
AATATGGAAT 
TTTTTCCAAA 
AAATTAGTGC 
CCCAGTTAAG 
TTTATTACAT 
ATTTCTTCTC 
AATGGATTTT 
CATTTAAGGT 
AGAGAGCTTA 
AAAGAGAATC 
TACCTTACTC 
TGTTTTCTTA 



31 

I 

TTGTTGTTTA 
TTTTTTTACT 
CCAATAACTC 
CTCTGTAGTC 
TTAGATACCT 
TTTATTTTCA 
TATTTTGACT 
AAATTACTAT 
TTTTGGCCTA 
CAGTTGGTTT 
TTACTAATTT 
ATACCTTAAA 
GTAATTTCTA 
AATTTCTAAT 
TAAATAGGTC 
GGATCTGCCA 
ATACCAGTGG 
AATTTCAGTT 
ACTGTGACTT 
GAAAAGTTTC 
AAGTTAATAG 
TAAATTTATA 
AAATTACCTT 
TAAATAATTT 
ATCAGTGGTT 
CCACCTCATA 
AGCTCTTTGA 
AGGAAAATGC 
TTTTGATGGT 
GAAATTTAAG 
TATAACTTCA 

CAGTTCACTA 
TCACAGGTTT 
AGCTCTCCAG 
CTTCACTGCT 
TTAACATTCA 



41 

I 

GTTTTACTTA 
TTTACTCCCC 
TAAACTTTTG 
TTAACCTGGA 
TAAGCCACTG 
AACACTAACT 
AAGCTTTCAT 
TGCATTTTCC 
ATGTCTGGAT 
GGGCAAATTT 
ATACCTGATT 
AAGTTGGTTC 
ATTATATAGT 
TATATATGTT 
ATAAGATACA 
TAGATTTTTT 
GAGTTG6TCT 
CTTAGGTTAT 
GAAAATGAAT 
TGAGTGTGAT 
GAAATCTTTT 
TAATTCAAAT 
ATTATTAGAA 
ATTTAAATCA 
TTCAACCCTC 
ACAATTAAGT 
GTGATTCTAA 
TTTTATTTCT 
TTAAATTCCA 
AGTTTAAGTT 
AAATAGATAT 
CTCTGGAGCT 
ACCTATGGTT 
GTATGCTG6T 
CAGTTCTAGA 
TCTTAGAAGG 
GAATTGGGAA 



51 

I 

TTGAGAGTGT 
AAATTATTCA 
AGTTATAACG 
TTTTAATTTT 
AATTCAGTTC 
TCTTGATATT 
AAAATATTTG 
TATATATGCA 
ATAAAAGATA 
AAACCTGAAA 
TTTTTTCTTG 
TAATTTAAAA 
ATTTCAAAAC 
TCAAAAACCA 
AGGTCTGCAT 

TGATCTAGGA 
GGCTTGTGAC 
TCTTAAAATT 
CTCTCTTTTG 
GTGACAGCAG 
GATAAAAATG 
CTGTGCCTAT 
AGACCACCAT 
ACTTCATATT 
CTAAATTTCT 
TTT6TAGTCA 
CCCATGTTAA 
CTAAAGAACA 
CCATCAAACT 
TTCATTCAAA 
GCCTGTTCAG 
CAGAGTTCTG 
GAATGGATAG 
AAAGCTTTGA 
TAGAATTAAG 
TATTAATTTT 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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TCCAGTGAGT AGTTTTCTGA AATTGGTAAC TTGGAGAGTA AAATAACGTA TTTTGCTTTT 2280 

CAATTTTGTG TTTGTTTACT TTTATGTAAA AATTTGATAT GTGAATTACA CAGTTCTAAT 2340 

AAAACCTCAT GCCTTTTCAT TACATCTAAT TTGAACTCTC AACTTCAGTG CCAGAA6TGC 2400 

TTTAAAGATG CTTTAATGAA AAGTATTAAG AAAATATATA GATTTGTATG TCAGTTTATA 2460 

CTTCAGAAAT CCATATATTT GTCATATTTA TTTTTTTAGA AACCTCCTAA TTGGATAACT 252 0 

AGATGGTATT TAAAATGAAT GCCCAAAAAT ATCTTGTACC TTTGTCCAAA AGTTTATCTG 2580 

TTGGAAGCCG CCAGCCATTC ATGTAGAGAG TTTATAAGAA AATAATTTAA AATTGTATGC 2640 

ATTTTATATT ACTATGGTAT CTGTGTACCA TATTTCTAAG TATTCATTAT TAAATTGGTA 2700 

CTTCTTAAAA CCATAACCTG GCTTGCCTTT TAGTGTTAAA CACAAAATCC AACATTGTAT 2760 

ATAGAGATTC TTCTTTTATG AAGAAGAGCT GACGTAATTT ATTACCAGTG CATCTGCACA 2820 

AAGACATTAA CATAAGTCTC TGAGCAGTGA TACATTTTCA AACATGAAGA GTGACAACCA 2880 

CCACATTAAA CAACCACGGC AACACTCAGA CTTGGCACTT TCCTACGAAT CCATCCTATA 2 940 

TGTGCCTGGT ATCGCCTCTG GCATAACTTA CACGAATCGT CCTCCCTACT TGTCTACGCT 3000 

CCTTCATCAA GCACTTGCCA ACACATTCAC CTCTAACTTG TACAACCTTA CC7VACTCACC 3060 

ACAACATCTG CAACTCTACC CTATCAACTG CCAACCTAAA GACCCCCAAC ACAACACAAC 3120 

CCCCAAACAC AAAACCACTA AATCATAACC ACCACACACG CCACACACCA CACACCCACC 3180 

CACACAACCA ACACACCACG ACCAAACACC CCACCACAAA CAAGCTAACA ACCACAAACA . 3240 
GACAACACAT CACATACACT CACTACCCCC CCATACTCCC ACCCACCA 

Seq ID NO: 234 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 27-281 

1 11 21 31 41 51 

I I I I t I 

AGCAGGAGGA GAGCTGGCGG GAAGACATGC ACCCCTTGAA GACCCAGAGA GAGGCC6TCT 60 

GTCTACCGCG TAGCAGTTAC ATCAGACTGA GACACTTCCT GTTTACAGGA GACTATAAAA 120 

TTCCTGCCCC GTGCTCATTT GGGGCTGACG CCATTTTAGG CCTCAGCCCA TCTGCACCCA 180 

GGCGCTCACT GAAACAGTGT GTTGCTCCAC ACCGCCTTGT TTTGCTTGTT GGCGCGCTCT 240 

CAGGGTTCCG ACCAATCCAA GAGCCTTGCA GAAAGCATTA ACGTGCTTTT CTCTTTGGCA 300 

GAGTTTTTCT TTGCTCTGAT CTTGGAGACA TCCCTCTGCC TAGTGGAAAC ATAAGGAATA 360 

CAGAAAGAAT GCAAGGAGAT AGACCAACGT GAGATTCTCC TTCATGCACT CAAGAGAAAG 420 

ATGTTGCAGG AAGAGCTAGT CTTTCAGGCT GGGCTGGTGA CCTGAGAAAG AATGTCCAGC 480 

TTTTCTTCTC CACTTGGCAT ATCAAGAGCC AGGCGTGGT^ GACTAAAACA GGAAATGTTT 540 

ATAAAAACTG TTCAGCGGTT CGCCAACAAG AAGTGGTAAA GTAGCAAAAA TGGGGATGGA 600 

GATGCCAGGA GGAAAGATGC CAGGGGTAAA GTGGGAAAAT GGGAACCTGA AGCCAGGAGG 660 

TCAAGCCAAG CCAACAGGTG TTCTGTTTTT CATCACAGAA CTAATAAGTG GTGCTGAGGA 720 

CTCAAACCCG GGGAAGCCCA CTCTAGAACC CATGCTGGTC ATCCATATCC CCAAGGCCCT 780 

GGTCAGAACA CAGCTAAGCA GATGGCTTGG GTCATCAGGA CGTCCATTAC ATCCAAAGGA 840 

AGACAGCCTG TGACGTTTCA AAAGCAAAAG TCCCCTACCA GCCAGTGAAG CTACCTGATT 900 

TCTCAGTATC TTACGCCCAG TGACACGATC TACCCTCAAA ACTTAAAAAA AAAAGGGAAA 960 

CATAAACACA TAACAGCAGC AGCAATAATT AAAGATGAGA TGAGAACAAT TAAGAAAAAA 1020 

GGAAAGGTCT CCTGTGACTG TTTTATTTTT AGGGAAACAG AGAGGAAGAA GAATGATTTT 1080 

TCTTTTGATG ACTCTATATC CAACTCTGAG GTTTGATTAA AGAAATGACC TTGAACCACA 1140 

GCA/U^GAAAA ATAAAAGACA ATTTCCAGTA AGTATGCCAG TTCGAATTAA TGATTTACTT 1200 

TTTATTTTTA AACTGAATTC AGCAGAGATT TACATGCATT ACGATGATTA ACATCTGAAA 12 60 

TTTGACCTTG AAATAATCTT TACATTGTAA ATTCTTAATG ATCAAAACAA GGTTCTCAGT 132 0 

GATTAAAACA TATTAGTAAT TAATTATTAA AGGAGAATAA TTGCAAATAC AACATTCCTA 1380 

AAATCTCAAG GCTTTTAAAG CATTTGTACA AATGACTGGA CATTTTTTAA ATTTGAAAAA 1440 

AAAAT^AAAGC CCTCCATCTG ATTCTCATTT TCATTGTCAG TGCAACAACA AAAAAGGTAT 1500 

GCACTTCTCT TCTCATTTTC CACTGTCTCG CAAGCTAGAA ATTCTCACGA CTACCTTT6A 1560 

TCCCATCAAA GCCAAAGAAA GAA7VAGAAAA TTGTTCTGTA CAGATATATG ACATTAAAAA 1620 
ATAATCCC 



Seq ID NO J 235 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I i I 

MHPLKTQREA VCLPRSSYIR LRHFLFTGDY KIPAPCSFGA DAILGLSPSA PRRSLKQCVA 60 
PHRLVLLVGA LSGFRPIQEP CRKH 

Seq ID NO: 236 DNA sequence 

Nucleic Acid Accession #: NM_00207S 

Coding sequence : 406 . . 1428 

1 11 21 31 41 51 

I I I I I I 

CCACAATAGG GGCAGACCTG TCCATCCTTC TCTGTGGGTC CCCTGTACCT TTCTCCCCCA 60 

ACAGGATCAG ACCCAGAGGC AGCTGGTTGG GGTTTGTCGA GAAGAAGGAT TATCCAGATC 120 

AGTCCTTTCT AATCTCAGCT CCTGCCTGTA CCCTCCCATA CTCACCAAAC CCTCTTCCCC 180 

ACCACCCTGA GCTGAGGAGC ACAGTTTGAG GCCCCCCCAA CCCCCCGCCG GTCGGGGCCA 240 

GGCCAGGCCA GGCCAGCTCC TCTGGCAGCA GAGCCTGGGC AGGTGACGGG CGGGCGCGGG 300 

CGTCGCAGCT GAGGGAGTAA GGAGGCTCCC AGGAACCGGA GCTGGAAACC CGGCCGAGGT 360 

CCAGCCAGAG CCC7\AGAGCC AGAGTGACCC CTCGACCTGT CAGCCATGGG GGAGATGGAG 420 

CAACTGCGTC AGGAAGCGGA GCAGCTCAAG AAGCAGATTG CAGATGCCAG GAAAGCCTGT 480 

GCTGACGTTA CTCTGGCAGA GCTGGTGTCT GGCCTAGAGG TGGTGGGACG AGTCCAGATG 540 

CGGACGCGGC GGACGTTAAG GGGACACCTG GCCAAGATTT ACGCCATGCA CTGGGCCACT 600 

GATTCTAAGC TGCTGGTAAG TGCCTCGCAA GATGGGAAGC TGATCGTGTG GGACAGCTAC 660 

ACCACCAACA AGGTGCACGC CATCCCACTG CGCTCCTCCT GGGTCATGAC CTGTGCCTAT 720 

GCCCCATCAG GGAACTTTGT GGCATGTGGG GGGCTGGACA ACATGTGTTC CATCTACAAC 780 

CTCAAATCCC GTGAGGGCAA TGTCAAGGTC AGCCGGGAGC TTTCTGCTCA CACAGGTTAT 840 

CTCTCCTGCT GCCGCTTCCT GGATGACAAC AATATTGTGA CCAGCTCGGG GGACACCACG 900 

TGTGCCTTGT GGGACATTGA GACTGGGCAG CAGAAGACTG TATTTGTGGG ACACACGGGT 960 

GACTGCATGA GCCTGGCTGT GTCTCCTGAC TTCAATCTCT TCATTTCGGG GGCCTGTGAT 1020 

GCCAGTGCCA AGCTCTGG6A TGTGCGAGAG GGGACCTGCC GTCAGACTTT CACTGGCCAC 1080 
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GAGTCGGACA TCAACGCCAT 
GATGACGCTT CCTGCCGCTT 
CACGAGAGCA TCATCTGCGG 
TTCGCTGGCT ACGACGACTT 
GGCATCCTCT CTGGCCACGA 
GCTGTGGCCA CAGGTTCCTG 
AAGGGAAGTG GAAGGCAGTG 
GGTGTTCTCT TCTATATTCC 
GGGAGCATGG GACTGTGCCT 
CCATCTCCTC CCATGGCCTT 
GGACAACCTG CCCCTCCCCA 
GCCCTAGGAT TCCTCCCCCA 
TTGGCCCTGT GACTATGGCT 
TTCTCCTTTT TCTACCTTTT 
GT 

Seq ID NO: 237 Protein sequence: 
Protein Accession NP_002066 

1 11 21 31 41 51 

I I I I I I 

MGEMEQLRQE AEQLKKQIAD ARKACADVTL AELVSGLEW GRVQMRTRRT LRGHLAKIYA 60 
MHWATDSKLL VSASQDGKLI VWDSYTTNKV HAIPLRSSWV MTCAYAPSGN FVACGGLDNM 120 
CSIYNLKSRE GNVKVSRELS AHTGYLSCCR FLDDNNIVTS SGDTTCALWD lETGQQKTVF 180 
VGHTGDCMSL AVSPDFNLFI SGACDASAKL WDVREGTCRQ TFTGHBSDIN AICFFPNGEA 240 
ICTGSDDASC RLFDLRADQE LICFSHESII CGITSVAFSL SGRLLFAGYD DFNCNVWDSM 300 
KSERVGILSG HDNRVSCLGV _ TMGMAVATG SWDSFLKIWN 

Seq ID NO: 238 DNA sequence 

Nucleic Acid Accession #: CAT cluster 

1 11 21 31 41 51 

1 I I I I I 

TCCCAATGTG TNGAACCTAC CATAAATTCT TTTCTTACN6 GACAATCTTA TNCTAANCAA 60 

TACCATTTGC TTTTAAGGCA GATAATCCTC CT^GTTTTCT AATGATATCT GAAACTATTA 120 

ACTGATTCTG TGAATTATGA AATCTGAAAA GGAATTGGAA GTTGCTAAAA ATCTATCATT 180 

TGCATTGACC AGTGTGAAGC ACAGTGGAAT GAGAATGCGT GCCCTGACAC CAAAGAAAAA 240 

TAAGTGACTG GAAAGCTGAA GAATCACCGG CTTCAGTGAC ATGGAACCCA GTGATTTGAT 300 

TTTTGACGAG TATCGGGTGA CTTTGAGGTG GTCAAGAAAC CACACTTTAA GAACAATGTC 360 

CAAAAAGGGG AAAAAAAAGA GCAACCAAAG AAAAAAAATC CATAAAATTG CACAGAAGAA 420 

AAGAAAGAAA AATAAAATAC ACAATATGGA CGATGGAGAA AAACAGTTAC ATTTCTTTAT 480 

GGATCAAGAA GTTTGTGTAC ACATAATCTC ATTTTGAGAT ATATAACTAT TTTTGTCTTT 540 

CAGAAGTGAA TCAAAATATT TCAAAATGCT GTCTTATGAA ACTACAATAT TCTCACAGAT 600 

TAGAAAAGTT TTTCTGTAAA AGTCAGATAG TAAATATTTT AGGTTTTGCA GTGTCTTTTG 660 

CAACTACTCA ACTTTCCTAC TGTAGCACAA GAGTAGCTGT GGTACTGTGC AAATAAATTG 72 0 

CTTGTGTTCC AATAAAGCTT CATTTACAAA AACATGCCAT GGGCCATATT TGGCCTGTAC 7 80 

ACTGTTGTTT GCCAAGTCCT AATATAGTTG CTTAGCAAGT ATTGTGAGCT ATTTGAGGAA 840 

GACATGAAAG TTCATTGGGT TGCTAAAAAG TATGTAGAAA TTCAAAGGAA AATTAAAATT 900 

TAGGCTAAGT TATAATACAC TGTTTTAACA ATTGTAAAAT GTAAGAGAAA TTTACAAATA 960 
AAAATCCCAA ATAAAA 

Seq ID NO: 239 DNA sec[uence 

Nucleic Acid Accession #: NM__001786.1 

Coding sequence: 130-1023 

1 11 21 31 41 . 51 

I I I I I I 

GGGGGGGGGG GGCACTTGGC TTCAAAGCTG GCTCTTGGAA ATTGAGCGGA GAGCGACGCG 60 

GTTGTTGTAG CTGCCGCTGC GGCCGCCGCG GAATAATAAG CCGGGATCTA CCATACCCAT 12 0 

TGACTAACTA TGGAAGATTA TACCAAAATA GAGAAAATTG GAGAAGGTAC CTATGGAGTT 180 

GTGTATAAGG GTAGACACAA AACTACAGGT CAAGTGGTAG CCATGAAAAA AATCAGACTA 240 

GAAAGTGAAG AGGAAGGGGT TCCTAGTACT GCAATTCGGG AAATTTCTCT ATTAAAGGAA 300 

CTTCGTCATC CAAATATAGT CAGTCTTCAG GATGTGCTTA TGCAGGATTC CAGGTTATAT 360 

CTCATCTTTG AGTTTCTTTC CATGGATCTG AAGAAATACT TGGATTCTAT CCCTCCTGGT 420 

CAGTACATGG ATTCTTCACT TGTTAAGAGT TATTTATACC AAATCCTACA GGGGATTGTG 480 

TTTTGTCACT CTAGAAGAGT TCTTCACAGA GACTTAAAAC CTCAAAATCT CTTGATTGAT 540 

GACAAAGGAA CAATTAAACT GGCTGATTTT GGCCTTGCCA GAGCTTTTGG AATACCTATC 600 

AGAGTATATA CACATGAGGT AGTAACACTC TGGTACAGAT CTCCAGAAGT ATTGCTGGGG 660 

TCAGCTCGTT ACTCAACTCC AGTTGACATT TGGAGTATAG GCACCATATT TGCTGAACTA 720 

GCAACTAAGA AACCACTTTT CCATGGGGAT TCAGAT^TTG ATCAACTCTT CAGGATTTTC 780 

AGAGCTTTGG GCACTCCCAA TAATGAAGTG TGGCCAGAAG TGGAATCTTT ACAGGACTAT 840 

AAGAATACAT TTCCCAAATG GAAACCAGGA AGCCTAGCAT CCCATGTCAA AAACTTGGAT 900 

GAAAATGGCT TGGATTTGCT CTCGAAAATG TTAATCTATG ATCCAGCCAA ACGAATTTCT 960 

GGCAAAATGG CACTGAATCA TCCATATTTT AATGATTTGG ACAATCAGAT TAAGAAGATG 1020 

TAGCTTTCTG ACAAAAAGTT TCCATATGTT ATGTCAACAG ATAGTTGTGT TTTTATTGTT 1080 

AACTCTTGTC TATTTTTGTC TTATATATAT TTCTTTGTTA TCAAACTTCA GCTGTACTTC 1140 

GTCTTCTAAT TTCAAAAATA TAACTTAAAA ATGTAAATAT TCTATATGAA TTTAAATATA 1200 
ATTCTGTAAA TGTGAAAAAA AAAAAAAAAA AAAAA 



CTGTTTCTTC CCCAATGGAG AGGCCATCTG CACGGGCTCG 1140 

GTTTGACCTG CGGGCAGACC AGGAGCTGAT CTGCTTCTCC 1200 

CATCACGTCC GTGGCCTTCT CCCTCAGTGG CCGCCTACTA 1260 

CAACTGCAAT GTCTGGGACT CCATGAAGTC TGAGCGTGTG 132 0 

TAACAGGGTG AGCTGCCTGG GAGTCACAGC TGACGGGATG 1380 

GGACAGCTTC CTCAAAATCT GGAACTGAGG AGGCTGGAGA 1440 

AACACACTCA GCAGCCCCCT GCCCGACCCC ATCTCATTCA 1500 

GGGTGCCATT CCCACTAAGC TTTCTCCTTT GAGGGCAGTG 1560 

TTGGGAGGCA GCATCAGGGA CACAGGGGCA AAGAACTGCC 1620 

CCCTCCCCAC AGTCCTCACA GCCTCTCCCT TAATGAGCAA 1680 

GCCCTTTGCA GGCCCAGCAG ACTTGAGTCT GAGGCCCCAG 1740 

GAGCCACTAC CTTTGTCCAG GCCTGGGTGG TATAGGGCGT 1800 

CTGGCACCAC TAGGGTCCTG GCCCTCTTCT TATTCATGCT 1860 

TTTCTCTCCT AAGACACCTG CAATAAAGTG TAGCACCCTG 1920 



Seq ID NO: 240 Protein sequence: 
Protein Accession ft: NP_001777.1 

1 11 21 31 41 51 

i I I I t I 

MEDYTKIEKI GEGTYGWYK 6RHKTTGQW AMKKIRIiBSE EEGVPSTAIR BISLLKELRH 60 
PNIVSLQDVL MQDSRLYLIF EFLSMDLKKY LDSIPPGQYM DSSLVKSYLY QILQGIVFCH 120 



280 



wo 02/086443 

SRRVLHRDLK PQNLLIDDKG TIKLADFGLA RAFGIPIRVY THEWTLWYR SPEVLLGSAR 
YSTPVDIWSI GTIFAELATK KPLFHGDSEI DQLFRIFRAL GTPNNEVWPE VESLQDYKNT 
FPKWKPGSIiA SHVKNLDENG LDLLSKMLIY DPAKRISGKM AI>NHPYFNDIi DNQIKKM 



180 
240 
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Seq ID NO: 241 DNA sequence 

Nucleic Acid Accession #: NM_033379.1 

Coding sequence: 132-854 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



CGCCCGCGCG 
GCTTTGCAGA 
ATTGACTAAC 
TTGTGTATAA 
TAGAAAGTGA 
AACTTCGTCA 
ATCTCATCTT 
GTCAGTACAT 
TATTGCTGGG 
TTGCTGAACT 
TCAGGATTTT 
TACAGGACTA 
AAAACTTGGA 
AACGAATTTC 
TTAAGAAGAT 
TTTTTATTGT 
AGCTGTACTT 
ATTTAAATAT 



11 

1 

CGGGCTCAAC 
GAGCGCCCTC 
TATGGAAGAT 
GGGTAGACAC 
AGAGGAAGGG 
TCCAAATATA 
TGAGTTTCTT 
GGATTCTTCA 
GTCAGCTCGT 
AGCAACTAAG 
CAGAGCTTTG 
TAAGAATACA 
TGAAAATGGC 
TGGCAAAATG 
GTAGCTTTCT 
TAACTCTTGT 
CGTCTTCTAA 
AATTCTGTAA 



21 

TTTGTAGAGC 
CAGGGACTAT 
TATACCAAAA 
AAAACTACAG 
GTTCCTAGTA 
GTCAGTCTTC 
TCCATGGATC 
CTTGTTAAGG 
TACTCAACTC 
AAACCACTTT 
GGCACTCCCA 
TTTCCCAAAT 
TTGGATTTGC 
GCACTGAATC 
GACAAAAAGT 
CTATTTTTGT 
TTTCAAAAAT 
ATGTGAAAAA 



31 

1 

GAGGGGCCAA 
GCGTGCGGGG 
TAGAGAT^AAT 
GTCAAGTGGT 

CTGCAATTCG 
AGGATGTGCT 
TGAAG^lAATA 

tagtaacact 
cagttgacat 
tcgatgggga 
ataatgaagt 
ggaaaccagg 
tctcgaaaat 

ATCCATATTT 
TTCCATATGT 
CTTATATATA 
ATAACTTAAA 
AAAAAAAAAA 



41 

1 

CTTGGCAGAG 
ACACGGGATC 
TGGAGAAGGT 
AGCCATGAAA 
GGAAATTTCT 
TATGCAGGAT 
CTTGGATTCT 
GTGGTACAGA 
TTGGAGTATA 
TTCAGAAATT 
GTGGCCAGAA 
AAGCCTAGCA 
GTTAATCTAT 
TAATGATTTG 
TATGTCAACA 

AATGTAAATA 
AAAAAA 



51 
1 

CGCGCGGCCA 
TACCCATACC 
ACCTATGGAG 
AAAATCAGAC 
CTATTAAAGG 
TCCAGGTTAT 
ATCCCTCCTG 
TCTCCAGAAG 
GGCACCATAT 
GATCAACTCT 
GTGGAATCTT 
TCCCATGTCA 
GATCCAGCCA 
GACAATCAGA 
GATAGTTGTG 
ATCAAACTTC 
TTCTATATGA 



Seq ID NO: 242 Protein secjuence: 
Protein Accession #: NP 203698.1 



11 



21 



31 



41 



51 

I I 1 I I I 

MEDYTKIEKI GEGTYGWYK GRHKTTGQW AMKKIRLESE EEGVPSTAIR EISLLKELRH 
PNIVSLQDVL MQDSRIiYLIF EFLSMDLKKY LDSIPPGQYM DSSLVKWTL WYRSPEVLLG 
S7VRYSTPVDI WSIGTIFAEL ATKKPLFHGD SEIDQLFRIF RALGTPNNEV WPEVESLQDY 
KNTFPKWKPG SLASHVKNLD ENGLDLLSKM LIYDPAKRIS GKMALNHPYP NDLDNQIKKM 

Seq ID NO: 243 DNA sequence 

Nucleic Acid Accession #: AFlOlOSl.l 

Coding sequence: 221-856 



GAGCAACCTC 
CGACCCAGAG 
GCGGGGCCCA 
ACCTGCCACC 
6CTGTTGGGC 
GCCCCAGTGG 
CGAGGGGCTG 
TGACTCCTTG 
CATCCTCCTG 
CTTGGAAGAC 
TCTTGCAGGT 
ATTCTATGAC 
TGGCTGGGCT 
CCGAAAAACA 
GAAAGACTAC 
GGACATTGAG 
GTATGGTATT 
AAACATGGCT 
TTGTATTACT 
TATATATAGA 
CTCATTATGT 
CCATATTGAT 
CAGTCAAATA 
CTAATTTACC 
TTATTTTTTA 
TTTCATTGGT 
AGCCAAGAAG 
GTGATAAATT 
TTTGCTTTGA 
CACAACTTTA 
ACCTTTTTGT 
TATATCTTCC 
GATAATCTGG 
TCTTTTTTCT 
AATATTAATT 
TTTATTTGCT 
CTTCATGTGA 
ACACATACCT 
AAACCTACGC 
ATTCTTTCAG 
TTTCCAGTCT 



11 

I 

AGCTTCTAGT 
CTTCTCCAGC 
GCCACCTTCG 
CCTGAGCCAG 
TTCATTCTCG 
AGGATTTACT 
TGGATGTCCT 
CTGAATCTGA 
GGAGTGATAG 
GATGAGGTGC 
CTGGCTATTT 
CCTATGACCC 
GCTGCTTCTC 
ACCTCTTACC 
GTGTGACACA 
ATACTATCAT 
ACAAAACAAA 
TAATCTTATT 
GCTTCCCATT 
TATGTATATA 
TGATACTAGC 
GAAGATGTTT 
TCATTTACTC 
AAGGATGAAT 
CCATAATCTT 
CTCTATCTCC 
AATTTATTAC 
CCTGTTGACC 
AAATATTTGT 
TTGATTGAAT 
TCCCCATTCC 
TAATAAGGTG 
TGACAAATAT 
ATCTGCCAAA 
AGTTTATATT 
CAGCTGGCTG 
TTCACTGCCT 
TCATGTGGTT 
ACATACCTTC 

GTACAGAATG 



21 

I 

ATCCAGACTC 
GGCGGCGCAG 
GGAGTCCGGG 
CGCGGGCGCC 
CCTTCCTGGG 
CCTATGCCGG 
GCGTGTCGCA 
GCAGCACATT 
CAATCTTTGT 
AGAAGATGAG 
TAGTTGCCAC 
CAGTCAATGC 
TCTGCCTTCT 
CAACACCAAG 
GAGGCAA7\AG 
TAACATTAGG 
CAAACAAACA 
TTATCTTCTT 
GAGTAATCAT 
TACATGTTTT 
ATACTTAAAA 
ATTGGTATAT 
TTCTTCATTA 
TCTTTCAATT 
ATAGCACTTG 
TGAATCTAAC 
AAATCAGAAC 
TTCCCACACA 
CCAATTGAGT 
TTTTAAGCTA 
TTAATTGTAT 
TGGTCTGTTT 
TCTCTCTGTA 
TTGAGATAAT 
ACTCTCATTC 
AGACACTGAA 
TCCTCTCTCT 
CAGTGCCTTC 
ATGTGGCTCA 
CATGTTTGTG 
CTATTTCACT 



31 
I 

CAGCGCCGCC 
CGAGCAGGGC 
TTGCCCACCT 
CGAGCGAGTC 
ATGGATCGGC 
CGACAACATC 
GAGCACCGGG 
GCAAGCAACC 
GGCCACCGTT 
GATGGCTGTC 
AGCATGGTAT 
CAGGTACGAA 
GGGAGGTGCC 
GCCCTATCCA 
GAGAAAATCA 
ACCTTAGAAT 
AAAAACCCAT 
TCCTC/y^TAT 
ACTCAAATGG 
TCTATTAAAA 
TATCTCTAAA 
TTTCTTTTTC 
GCTTTGGGTG 
CTTCATGCGT 
CATCGTTATT 
ACATTTCATA 
TTTGGAGGCA 
ATCCCTGTAC 
AGCTGCATGC 
CTTATTCATA 
TGTTTTCCCA 
GTCTGAACAA 
GCTGTAAGCA 
GATACTTAAC 
TTTGAACATG 
GAAGTCACTG 
ACCAGTCTAT 
CTCTCTCTAC 
GTGCCTTCCT 
CTCTGTTCCA 
TGAGCAAGAT 



41 
I 

CCGGGCGCGG 
TCCCCGCCTT 
GCAAACTCTC 
ATGGCCAACG 
GCCATCGTCA 
GTGACCGCCC 
CAGATCCAGT 
CGTGCCTTGA 
GGCATGAAGT 
ATTGGGGGTG 
GGCAATAGAA 
TTTGGTCAGG 
CTACTTTGCT 
AAACCTGCAC 
TGTTGAAACA 
TTTGGGTATT 
GTGTTAT^T 
AGGAGGGAAG 
GGGAAGGGGT 
ATAGACAGTA 
ATAGGTAAAT 
GTCCTTATAT 
CCTTTGCCAC 
GCCCTTTTCA 
AAGCCCTTAT 
GCCTACATTT 
AATCTTTCTG 
TCTGACCCAT 
TGTTCCCCCA 
GTTTTATATC 
AGTGTAATTA 
AGTGCTAGAC 
AGTCACTTAA 
CAGTTAGAAG 
AACTATGCCT 
AACAAAACCT 
TTCCACTGAA 
CAGTCTATTT 
CTCTCTACCA 
TTTTAACAAC 
GATGTATGGA 



51 
I 

ACCCCAACCC 
AACTTCCTCC 
CGCCTTCTGC 
CGGGGCTGCA 
GCACTGCCCT 
AGGCCATGTA 
GCAAAGTCTT 
TGGTGGTTGG 
GTATGAAGTG 
CGATATTTCT 
TCGTTCAAGA 
CTCTCTTCAC 
GTTCCTGTCC 
CTTCCAGCGG 
AACCGAAAAT 
GTAATCTGAA 
ACTCAGTGCT 
ATTTTACCAT 
GCTCCTTAAA 
AAATACTATT 
GTATTTAATT 
ACATATGTAA 
AAGACCTAGC 
TATACTTATT 
TTGTTTTGTG 
TAGTTTCTAA 
CATGACCAAA 
AGCACTCTTG 
GGTGTTGTAA 
CCCCTAAACT 
TCATGCGTTT 
TTTCTGGAGT 
TCTTTCTACC 
AGGTAGTGTG 
ATGTAGTGTC 
ACACACGTAC 
CAAAACCTAC 
CCACTGAACA 
GTCTATTTCC 
TGCTCTTACT 
AAGGGTGTTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
• 660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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GCACTGGTGT CTGGAGACCT GGATTTGAGT CTTGGTGCTA TCAATCACCG TCTGTGTTTG 252 0 

AGCAAGGCAT TTGGCTGCTG TAAGCTTATT GCTTCATCTG TAAGCGGTGG TTTGTAATTC 2580 

CTGATCTTCC CACCTCACAG TGATGTTGTG GGGATCCAGT GAGATAGT^T ACATGTAAGT 2640 

GTGGTTTTGT AATTTGAAAA GTGCTATACT AAGGGAAAGA ATTGAGGAAT TAACTGCATA 2700 

5 CGTTTTGGTG TTGCTTTTCA AATGTTTGAA AATAAAAAAA TGTTAAGAAA TGGGTTTCTT 2760 

GCCTTAACCA GTCTCTCAAG TGATGAGACA GTGAAGTAAA ATTGAGTGCA CTAAACGAAT 2820 

AAGATTCTGA GGAAGTCTTA TCTTCTGCAG TGAGTATGGC CCAATGCTTT CTGTGGCTAA 2880 

ACAGATGTAA TGGGAAGAAA TT^AAAGCCTA C6TGTTG6TA AATCCAACAG CAAGGGAGAT 2940 

TTTTGAATCA TAATAACTCA TAAGGTGCTA TCTGTTCAGT GATGCCCTCA GAGCTCTTGC 30 00 

10 TGTTAGCTGG CAGCTGACGC TGCTAGGATA GTTAGTTTGG AAATG6TACT TCATAATAAA 3060 

CTACACAAGG AAAGTCAGCC ACCGTGTCTT ATGAGGAATT GGACCTAATA AATTTTAGTG 3120 

TGCCTTCCAA ACCTGAGAAT ATATGCTTTT GGAAGTTAAA ATTTAAATGG CTTTTGCCAC 3180 

ATACATAGAT CTTCATGATG TGTGAGTGTA ATTCCATGTG GATATCAGTT ACCAAACATT 3240 

ACAAAAAAAT TTTATGGCCC AAAATGACCA ACGAAATTGT TACAATAGAA TTTATCCAAT 3300 

15 TTTGATCTTT TTATATTCTT CTACCACACC TGGAAACAGA CCAATAGACA TTTTGGGGTT 3360 

TTATAATGGG AATTTGTATA AAGCATTACT CTTTTTCAAT AAATTGTTTT TTAATTTAAA 3420 

AAAAGGAAAA AAAAAAAAAA AAA 

20 Seq ID NO: 244 Protein sequences 
Protein Accession #: A7\D16433.1 

1 11 21 31 

' 1 1 I 

ZD MANAGLQIiLG FILAFLGWIG AIVSTALPQW RIYSYAGDNI 

QIQCKVFDSIi LNLSSTLQAT RALMWGILIi GVIAIFVATV 
IGGAIFLLAG LAILVATAWY GNRIVQEFYD PMTPVNARYE 
LLCCSCPRKT TSYPTPRPYP KPAPSSGKDY V 

30 Seq ID NO; 245 DNA sequence 

Nucleic Acid Accession #: CAT cluster 

1 11 21 31 41 51 

I 1 I I I 1 

J J TTTTTTTTTT TTTTTTTTTT TTTTTCAAGG AGAGCACAAG GAACTTTATT AATGACTTTC 60 

TTAATGGTTA AATGCTGTTT ACCAAGTGAC CCAGAGGCAG CGTGGTTTAG TGGTTTCAAC 120 

AGCATGGTCC CGAGAGTCTG ACAAACCTCA GTTCAAATCC TTCTTTTGTC TTCACTTAGT 180 

TTTTCTTCCT GAGATTTAGT TTCTTCATCG TTAACAATGA GGATATTAAT ATGTTTCACA 240 

CAGTTGTTAT G7\AGAATGCA TATATTAGAA TGCCTGTAGT CTCAGCTACT CAGGAGGCTA 300 

40 AGGTGGGGAG GTCGCTCAAG CCCAGGAATT CAAAGCTGCA ATGCATTATG ATTACAGCTG 360 

TTAATAGCCA CTGCACTTCA GCCTGGGCAA TGTAGTAAGA TCCCATCTCT GGCTCGGAGG 420 

GTCCTACGCC CACGGAGTCT CGCTGATTGC TAGCACAGCA GTCTGAGATC AAACTGCA 

Seq ID NO: 246 DNA sequence 
45 Nucleic Acid Accession #: XM_058553.2 
Coding sequence: 897-1400 

1 11 21 31 41 51 

CA 1 1 I 1 1 ' 

DU AATTTTCAGA AGTTTCGTAT GGGGATGGTT TTATATAAAT TCAGGTTTTT CCCACAATAA 60 

TAAATGTATT TAGTCTCAGT GCTCAATAGA A6AGATTTCT AATAGAAAAG GATTCAAACT 120 

GTGAAACCAT TTCTCTTTTA ATGTTTCACA TTCCTGTTAC AGATTTGTTC TCTTGTGACT 180 

CTGTTATCCA TAATATGGAC AGTTCTTGAG TCCTAACATT GAGAGGTTTT CCCTTAGTGC 240 

ATAGAGGGAA TGAGTATTAA TTGGAGAAGC TTAAAGTATT GCCACTTTAG CACTGAAGAT 300 

55 TGGGATGAGA GGAGGTGAAA CCTCACTAGA AAAAGGGACA ATGTTAGTGT GGCCCTTCCT 360 

GATCATGTTT AAGAAAAGTC ATGAAAATGG TGAACTAGTG TTTCCAAGCA TATTGGAAGG 420 

GTTGAGTGTA TACTGTCTGT CAAAGACTTC CAGCATTTCC AGGTCCTAGA GAGGAACAAG 480 

ACTGGTAACC TGCCTATCTG TATTTTTAAG AACCCAGGAG GAAAGCTTTA TAATAGAACA 540 

TTATTTCTGT GTTTATGTAT AAGGGGTTTT TTGTTTTTTT AAAGACAGGA TCTCACTCCA 600 

60 TTGTCCAGGC CAAGTGCAAT GGCACGAACC TCATAGCTCC TGGACTTAAG TGATCTGCCT 660 

GCCTTTGCCT CCTGAGTAGC TGGGACTACA GGCATGAGCC CCCATGCCTG GCTAAGTTTG 720 

TTTTTTTGTT TGTTTGTTTG TTTGTTTTTG GGGGGGGTTG TTTTGTTTTT TGTAGAGACG 780 

TAGTCTTGCT TTGTTGCCAG GCTAGTCTCA AACTCCTGGC TTCAAGTGAT CCTCCTGCCT 840 

CAGCCTCCCA GAGTGCTAGG ATTACAGCAC TTGGATTCAG CTTCTTCATT TCCAACATGG 90 0 

65 AAGAAACTTA CACCGACTCC CTGGACCCTG AGAAGCTATT GCAATGCCCC TATGACAAAA 960 

ACCATCAAAT CAGGGCTTGC AGGTTTCCTT ATCATCTTAT CAAGTGCAGA AAGAATCATC 102 0 

CTGATGTTGC AAGCAAATTG GCTACTTGTC CCTTCAATGC TCGCCACCAG GTTCCTCGAG 1080 

CTGAAATTAG TCATCATATC TCAAGCTGTG ATGACAGTUiQ TTGTATTGAG CAAGATGTTG 1140 

TCAACCAAAC CAGGAGCCTT AGACAAGAGA CTCTGGCTGA GAGCACTTGG CAGTGCCCTC 1200 

70 CTTGCGATGA AGACTGGGAT AAAGATTTGT GGGAGCAGAC CAGCACCCCA TTTGTCTGGG 1260 

GCACAACTCA CTACTCTGAC AACAACAGCC CTGCGAGCAA CATAGTTACA GAACATAAGA 1320 

ATAACCTGGC TTCAGGCATG CGAGTTCCCA AATCTCTGCC GTATGTTCTG CCATGGAAAA 13 80 

ACAATGGAAA TGCACAGTAA CTGAATACCT ATCTCATCAA ATGCCAGACC CTAGAAGACT 1440 

' GTTGCTTCTT CTTCTACCAG TGGGTTCTCA TTTTCCTCCT AATCTAATTA TAGAATGGTA 1500 

75 AACTCCCTGT GACTTTCCAA ACTGACAAGC ACACTTTTTT CCTCCCCCCT TGAATCCTCA 1560 

TTTAATGCAA GAACCCTCAT ACTCA6AAGC TTCCAAATAA ACCTTTGATA CAGATTG 



Seq ID NO: 247 Protein secjuence: 
80 Protein Accession #: XP_0585S3.1 

1 11 21 31 41 

I I I I I 

MEETYTDSLD PEKLLQCPYD KNHQIRACRF PYHLIKCRKN HPDVASKLAT 
85 RAEISHHISS CDDRSCIEQD WNQTRSLRQ ETLAESTWQC PPCDEDWDKD 

WGTTHYSDNN SPASNIVTEH KNNIiASGMRV PKSLPYVLPW KNNGNAQ 



41 51 

1 I 

VTAQAMYEGL WMSCVSQSTG 60 
GMKCMKCLED DEVQKMRMAV 120 
FGQALFTGWA AASLCLLGGA 180 



CPFNARHQVP 60 
LWEQTSTPFV 120 



282 
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Seq ID NO: 248 DNA sequence 
Nucleic Acid Accession #: NM_003392 
Coding sequences 758.. 1855 



PCT/US02/12476 



5 



1 



11 



21 



31 



41 



51 



TTAAGGAAAT CCGGGCTGCT CTTCCCCATC TGGAA6TGGC TTTCCCCACA TCGGCTC6TA 60 

AACTGATTAT GAAACATACG ATGTTAATTC GGAGCTGCAT TTCCCAGCTG GGCACTCTCG 120 

10 CGCGCTGGTC CCCGGGGCCT CGCCCCCCAC CCCCTGCCCT TCCCTCCCGC GTCCTGCCCC 180 

CATCCTCCAC CCCCCGCGCT GGCCACCCCG CCTCCTTGGC AGCCTCTGGC GGCAGCGCGC 240 

TCCACTCGCC TCCCGTGCTC CTCTCGCCCA TGGAATTAAT TCTGGCTCCA CTTGTTGCTC 3 00 
GGCCCAGGTT GGGGAGAGGA CGGAGGGTGG CCGCAGCGGG TTCCTGAGTG AATTACCCAG 360 
GAGGGACTGA GCACAGCACC AACTAGAGAG GGGTCAGGGG GTGCGGGACT CX5AGCGAGCA 420 
15 GGAAGGAGGC AGCGCCTGGC ACCAGGGCTT TGACTCAACA GAATTGAGAC ACGTTTGTAA 480 
TCGCTGGCGT GCCCCGCGCA CAGGATCCCA GCGAAAATCA GATTTCCTGG TGAGGTTGCG 540 
TGGGTGGATT AATTTGGAAA AAGAAACTGC CTATATCTTG CCATCAAAAA ACTCACGGAG 600 
GAGAAGCGCA GTCAATCAAC AGTAAACTTA AGAGACCCCC GATGCTCCCC T6GTTTAACT 660 
TGTATGCTTG AAAATTATCT GAGAGGGAAT AAACATCTTT TCCTTCTTCC CTCTCCAGAA 720 
20 GTCCATTGGA ATATTAAGCC CAGGAGTTGC TTTGGGGATG GCTGGAAGTG CAATGTCTTC 780 
CAAGTTCTTC CTAGTG6CTT TGGCCATATT TTTCTCCTTC GCCCAGGTTG TAATTGAAGC 840 
CAATTCTTGG TGGTCGCTAG GTATGAATAA CCCTGTTCAG ATGTCAGAAG TATATATTAT 90 0 
AGGAGCACAG CCTCTCTGCA GCCAACTGGC AGGACTTTCT CAAGGACAGA AGAAACTGTG 960 

CCACTTGTAT CAGGACCACA TGCAGTACAT CGGAGAAGGC GCGAAGACAG GCATCAAAGA 1020 

25 ATGCCAGTAT CAATTCCGAC ATCGACGGTG GAACTGCAGC ACTGTGGATA ACACCTCTGT 1080 

TTTTGGCAGG GTGATGCAGA TAGGCAGCCG CGAGACGGCC TTCACATACG CCGTGAGCGC 1140 

AGCAGGGGTG GTGAACGCCA TGAGCCGGGC GTGCCGCGAG GGCGAGCTGT CCACCTGCGG 1200 

CTGCAGCCGC GCCGCGCGCC CCAAGGACCT GCCGCGGGAC TGGCTCTGGG GCGGCTGCGG 12 60 

CGACAACATC GACTATGGCT ACCGCTTTGC CAAGGAGTTC GTGGACGCCC GCGAGCGGGA 132 0 

30 GCGCATCCAC GCCAAGGGCT CCTACGAGAG TGCTCGCATC CTCATGAACC TGCACAACAA 13 80 

CGAGGCCGGC CGCAGGACGG TGTACAACCT GGCTGATGTG GCCTGCAAGT GCCATGGGGT 1440 

GTCCGGCTCA TGTAGCCTGA AGACATGCTG GCTGCAGCTG GCAGACTTCC GCAAGGTGGG 1500 

T6ATGCCCTG AAGGAGAAGT ACGACAGCGC GGCGGCCATG CGGCTCAACA GCCGGGGCAA 1560 

GTTGGTACAG GTCAACAGCC GCTTCAACTC GCCCACCACA CAAGACCTGG TCTACATCGA 1620 

35 CCCCAGCCCT GACTACTGCG TGCGCAATGA GAGCACCGGC TCGCTGGGCA CGCAGGGCCG 1680 

CCTGTGCAAC AAGACGTCGG AGGGCATGGA TGGCTGCGAG CTCATGTGCT GCGGCCGTGG 1740 

GTACGACCAG TTCAAGACCG TGCAGACGGA GCGCTGCCAC TGCAAGTTCC ACTGGTGCTG 1800 

CTACGTCAAG TGCAAGAAGT GCACGGAGAT CGTGGACCAG TTTGTGTGCA AGTAGTGGGT 1860 

GCCACCCAGC ACTCAGCCCC GCTCCCAGGA CCCGCTTATT TATAGAAAGT ACAGTGATTC 1920 

40 TGGTTTTTGG TTTTTAGAAA TATTTTTTAT TTTTCCCCAA GAATTGCAAC CGGAACCATT 1980 

TTTTTTCCTG TTACCATCTA AGAACTCTGT GGTTTATTAT TAATATTATA ATTATTATTT 2 040 

GGCAATAATG GGGGTGGGAA CCACGAAAAA TATTTATTTT GTGGATCTTT GAAAAGGTAA 2100 

TACAAGACTT CTTTTGGATA GTATAGAATG AAGGGGGAAA TAACACATAC CCTAACTTAG 2160 

CTGTGTGGGA CATGGTACAC ATCCAGAAGG TAAAGAAATA CATTTTCTTT TTCTCAAATA 2220 

45 TGCCATCATA TGGGATGGGT AGGTTCCAGT TGAAAGAGGG TGGTAGAAAT CTATTCACAA 2280 

TTCAGCTTCT ATGACCAAAA TGAGTTGTAA ATTCTCTGGT GCAAGATAAA AGGTCTTGGG 2340 

AAAACAAAAC AAAACAAAAC AAACCTCCCT TCCCCAGCAG GGCTGCTAGC TTGCTTTCTG 2400 

CATTTTCAAA ATGATAATTT ACAATGGAAG GACAAGAATG TCATATTCTC AAGGAAAAAA 2460 

GGTATATCAC ATGTCTCATT CTCCTCAAAT ATTCCATTTG CAGACAGACC GTCATATTCT 252 0 

50 AATAGCTCAT GAAATTTGGG CAGCAGGGAG GAAAGTCCCC AGAAATTAAA AAATTTAAAA 2580 

CTCTTATGTC AAGATGTTGA TTTGT^GCTG TTATAAGAAT TGGGATTCCA GATTTGTAAA 2640 

AAGACCCCCA ATGATTCTGG ACACTAGATT TTTTGTTTGG GGAGGTTGGC TTGAACATAA 2 700 

ATGAAATATC CTGTATTTTC TTAGGGATAC TTGGTTAGTA AATTATAATA GTAGAAATAA 2 760 

TACATGAATC CCATTCACAG GTTTCTCAGC CCAAGCAACA AGGTAATTGC GTGCCATTCA 2 82 0 

55 GCACTGCACC AGAGCAGACA ACCTATTTGA GGAAAAACAG TGAAATCCAC CTTCCTCTTC 2880 

ACACTGAGCC CTCTCTGATT CCTCCGTGTT GTGATGTGAT GCTGGCCACG TTTCC7^AACX3 2940 

GCAGCTCCAC TGGGTCCCCT TTGGTTGTAG GACAGGAAAT GAAACATTAG GAGCTCTGCT 3000 

TGGAAAACAG TTCACTACTT AGGGATTTTT GTTTCCTAAA ACTTTTATTT TGAGGAGCAG 3060 

TAGTTTTCTA TGTTTTAATG ACAGAACTTG GCTAATGGAA TTCACAGAGG TGTTGCAGCG 3120 

60 TATCACTGTT ATGATCCTGT GTTTAGATTA TCCACTCATG CTTCTCCTAT TGTACTGCAG 3180 

GTGTACCTTA AAACTGTTCC CAGTGTACTT GAACAGTTGC ATTTATAAGG GGGGAAATGT 3240 

GGTTTAATGG TGCCTGATAT CTCAAAGTCT TTTGTACATA ACATATATAT ATATATACAT 3300 

ATATATAAAT ATAAATATAA ATATATCTCA TTGCAGCCAG TGATTTAGAT TTACAGCTTA 3360 

CTCTGGGGTT ATCTCTCTGT CTAGAGCATT GTTGTCCTTC ACTGCAGTCC AGTTGGGATT 3420 

65 ATTCCAAAAG TTTTTTGAGT CTTGAGCTTG GGCTGTGGCC CCGCTGTGAT CATACCCTGA 3480 

GCACGACGAA GCAACCTCGT TTCTGAGGAA GAAGCTTGAG TTCTGACTCA CTGAAATGCG 3540 

TGTTGGGTTG AAGATATCTT TTTTTCTTTT CTGCCTCACC CCTTTGTCTC CAACCTCCAT 3600 

TTCTGTTCAC TTTGTGGAGA GGGCATTACT TGTTC6TTAT AGACATGGAC GTTAAGAGAT 3660 

ATTCAAAACT CAGAAGCATC AGCAATGTTT CTCTTTTCTT AGTTCATTCT GCAGAATGGA 3720 

70 AACCCATGCC TATTAGAAAT GACAGTACTT ATTAATTGAG TCCCTAAGGA ATATTCAGCC 3780 

CACTACATAG ATAGCTTTTT tTTTTTTTTT TTTTTTTTAA TAAGGACACC TCTTTCCAAA 3840 

CAGGCCATCA AATATGTTCT TATCTCAGAC TTACGTTGTT TTAAAAGTTT GGAAAGATAC 3900 

ACATCTTTTC ATACCCCCCC TTAGGAGGTT GGGCTTTCAT ATCACCTCAG CCAACTGTGG 3960 

CTCTTAATTT ATTGCATAAT GATATCCACA TCAGCCAACT GTGGCTCTTT AATTTATTGC 402 0 

75 ATAATGATAT TCACATCCCC TCAGTTGCAG TGAATTGTGA GCAAAAGATC TTGAAAGCAA 4080 

AAAGCACTAA TTAGTTTAAA ATGTCACTTT TTTGGTTTTT ATTATACAAA AACCATGAAG 4140 

TACTTTTTTT ATTTGCTAAA TCAGATTGTT CCTTTTTAGT GACTCATGTT TATGAAGAGA 42 00 

GTTGAGTTTA ACAATCCTAG CTTTTAAAAG AAACTATTTA ATGTAAAATA TTCTACATGT 4260 
CATTCAGATA TTATGTATAT CTTCTAGCCT TTATTCTGTA CTTTTAATGT ACATATTTCT 4320 
80 GTCTTGCGTG ATTTGTATAT TTCACTGGTT TAAAAAACAA ACATCGAAAG <3CTTATTCCA 43 80 
AATGGAAGAT AGAATATAAA ATAAAACGTT ACTTGTAAAA AAATiAAAA 
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Protein Accession #: NP__003383 



1 



11 



21 



31 



41 



51 



283 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

MAGSAMSSKF FLVAIAIFFS FAQWIEANS WWSLGMNNPV QMSEVYIIGA QPLCSQIiAGL 60 
SQGQKKLCHL YQDHMQYIGB OMCISIKECQ YQPRHRRWNC STVDNTSVPG RVMQIQSRET 120 
AFTYAVSJUVG WNAMSRACR BGELSTCQCS RAARPKDLPR DWLWQGCGDN IDYQYRFAKB 180 
FVDARERERI HAKGSYESAR ILMNLHNNEA GRRTVYNLAD VACKCHGVSG SCSLKTCWLQ 240 
LADFRKVGDA LKEKVDSAAA MRLNSRGKLV QVNSRFNSPT TQDLVYIDPS PDYCVRNEST 300 
GSLGTQGRLC NKTSEGMDGC ELMCCGRGYD QFKTVQTERC HCKFHWCCYV KCKKCTEIVD 3S0 
QFVCK 

Seq ID NO: 250 DMA sequence 
Hucleic Acid Accession #s 1IM_014058 
Coding sequence: 56.. 1324 



PCT/US02/12476 



TGACTTGGAT 
TCGGCCAGAT 
CGTCATCTTC 
GAGATATAAT 
ACTATATGCT 
TGAATCAATG 
TCAGGTTATC 
TAGATTTCAC 
TGAAAAGCTG 
AAAAATCAAC 
TAAAACTCTA 
GCCCTGGCAG 
TGCCACATGG 
GACTGCTTCC 
AATTGTCCAT 
TTCTAGCCCT 
TGAGTTTCAA 
TTACAGTCAA 
TGAACCTCAA 
AGGAATU^CA 
AGATATCTGG 
GCCTGGTGTT 
CTAAGAGAGA 
CCATTTTTAG 
AATAAACTGT 



11 
1 

GTAGACCTCG 
GTGGTGAGGG 
ATATCCCTGA 
CAAAAGAAGA 
GAGTTTGGCA 
GTGAAAAATG 
AAGTTCAGTC 
TCTACTGAGG 
CAAGATGCTG 
AAGACAGAAA 
GGTCAGAGTC 
GCTAGCCTGC 
CTTGTGAGTG 
TTTGGAGTAA 
GAAAAATACA 
GTTCCCTACA 
CCAGGTGATG 
AATCATCTTC 
GCTTACAATG 
GATGCATGCC 
TACCTTGCTG 
TATACTAGAG 
AAAGCCTCAT 
AGATACAGAA 
TTGCTTGATG 



21 
I 

ACCTTCACAG 
CTAGGAAT^G 
TTGTCCTGGC 
CCTACAATTA 
GAGAGGCTTC 
CATTTTATAA 
AACAGAAGCA 
ATCCTGAAAC 
TAGGACCCCC 
CAGACAGCTA 
TCAGGATCGT 
AGTGGGATGG 
CTGCTCACTG 
CAATAAAACC 
AACACCCATC 
CAAATGCAGT 
TGATGTTTGT 
GACAAGCACA 
ACGCCATAAC 
AGGGTGACTC 
GAATAGTGAG 
TTACGGCCTT 
GGAACAGATA 
TTGGAGiU^GA 
CAAAAAAAAA 



31 
1 

GACTCTTCAT 
AGTTTGTTGG 
AGTGTGCATT 
CTATAGCACA 
TAACAATTTT 
ATCTCCATTA 
TGGAGTGTTG 
TGTAGATAAA 
TAAAGTAGAT 
TCTAAACCAT 
TGGTGGGACA 
GAGTCATCGC 
TTTTACAACA 
TTCGAAAATG 
ACATGACTAT 
ACATAGAGTT 
GACAGGATTT 
GGTGACTCTC 
TCCTAGAATG 
TGGAGGACCA 
CTGGGGAGAT 
GCGGGACTGG 
ACATTTTTTT 
CTTGCAAAAC 
A 



Seq ID NO: 251 Protein sequence: 
Protein Accession #; NP 054777 



MYRPDWRAR 
DKLYAEFGRE 
ICRFHSTEDP 
RSKTLGQSLR 
RWTASFGVTI 
SYEFQPGDVM 
LEGKTDACQG 
GI 



11 
1 

KRVCWEPWVI 
ASNNFTEMSQ 
ETVDKIVQLV 
IVGGTEVEEG 
KPSKMKRGLR 
FVTGFGALKN 
DSGGPLVSSD 



21 
1 

GLVIFISLIV 
RLESMVKNAF 
LHEKLQDAVG 
EWPWQASLQW 
RIIVHEKYKH 
DGYSQNHLRQ 
ARDIWYLAGI 



31 

I 

LAVCIGLTVH 
YKSPLREEFV 
PPKVDPHSVK 
DGSHRCGATL 
PSHDYDISIiA 
AQVTLIDATT 
VSWGDKCAKP 



41 
I 

TGCTGGTTGG 
GAACCCTGGG 
GGACTCACTG 
TTGTCATTTA 
ACAGAAATGA 
AGGGAAGAAT 
GCTCATATGC 
ATTGTTCAAC 
CCTCACTCAG 
TGCTGCGGAA 
GAAGTAGAAG 
TQTGGAGCAA 
TATT^GAACC 
AAACGGGGTC 
GATATTTCTC 
TGTCTCCCTG 
6GAGCACTGA 
ATAGACGCTA 
TTATGTGCTG 
CTGGTTAGTT 
GZUVTGTGCGA 
ATTACTTCAA 

AGCTA6ATTT 



41 

YVRYNQKKTY 
KSQVIKFSQQ 
IKKINKTETD 
INATWLVSAA 
ELSSPVPYTN 
CNEPQAYNDA 
NKPGVYTRVT 



51 
I 

CAATGATGTA 
TTATCGGCCT 
TTCATTATGT 
CAACTGACAA 
GCCAGAGACT 
TTGTCAAGTC 
TGTTGATTTG 
TTGTTTTACA 
TTAAAATTAA 
CACGAAGAAG 
AGGGTGAATG 
CCTTAATTAA 
CTGCCAGATG 
TCCGGAGAAT 
TTGCAGAGCT 
ATGCATCCTA 
AAAATGATGG 
CAACTTGCAA 
GCTCCTTAGA 
CAGATGCTAG 
AACCCAACAA 
AAACTGGTAT 
GGTGTGGAGG 
GACTGATCTC 



51 
i 

NYYSTLSFTT 
KHGVLAHMIjIj 
SYIiNHCCGTR 
HCFTTYKNPA 
AVHRVCLPDA 
ITPRMLCAGS 
ALRDWITSKT 



Seq ID NO: 252 DNA sequence 

Nucleic Acid Accession #: NM_003504.2 

Coding sequence: 71-1771 



1 

I 

GGCACGAGGC 
CGCCGTGGCT 

GAGGGTCCTT 
GGCCTTGTTC 
ACTTGAAACT 
TGGAGCTAAT 
GTGTGACACC 
ACTCATTAAA 
AGAGGAGGAT 
CACACGGTTA 
GGAGGCCCGG 
GTCAGCCATG 
GTGGTGGGCC 
ATACGTGACT 
GGATGAGGAG 
CCTGGTGCTC 
AGCCAGGTTC 
CATGGGTCTT 
GGAGAATTTG 
CGTGCAGACT 
CTTTGCCACC 
CATCCAGGCT 
ACTCGCCAAG 
CCTCGTCATC 
CATGCTGTTC 
TGTGTGTTCG 



11 
I 

CTCGTGCCGC 
ATGTTCGTGT 
CTCTTCGTGG 
CAGTGTGACC 
GCATTTCTTG 
GTAGACCTAT 
CATAGGCCAG 
CAAGATGATG 
GAAGAGCATT 
GAAGAGGAGA 
AGAAGAGACA 
GTGATGTTTG 
ATCGTTGGAC 
GATGTTGGTG 
AACACACTCT 
TACCAGCACT 
AAGCTGTGGT 
CCCCTGAAGC 
CGGGAAATGA 
TTCAGCATTC 
ATGTCTTTGA 
CTGGACAGCC 
AAGCAGCTGC 
TCCCAGGGGC 
TCTAGGCCGG 
ACAAAGAACC 



21 

1 

CGGGCTCTTG 
CCGATTTCCG 
CCTCGGACGT 
ACGTGCAATA 
AGCATAAAGA 
TGGATATTCT 
TCAATGTCGT 
ACCTTGAAGT 
CAGGAAATGA 
TAGTGGAGCA 
TCCTCTTTGA 
AGCTGGCTTG 
TAACAGACCA 
TCCTGCAGCG 
CCGTGGACTG 
GGTCCCTCCA 
CTGTGCATGG 
AGGTGTU^GCA 
TTGAAGAGTC 
ATTTTGGGTT 
TGGAGAGCCC 
TCTCCAGGAG 
GAGCCACCCA 
CTTTCCTGTA 
CATCCCTAAG 
GGCGCTGCAA 



31 
I 

GTACCTCAGC 
CAAAGAGTTC 
GGATGCTCTG 
TACGCTGGTT 
ACAGTTTCAT 
TCAACCTGAT 
CAATGTATAC 
TCCCGCCTAT 
CAGTGATGGG 
AACCATGCGG 
CTACGAGCAG 
GATGCTGTCC 
GTGGGTGCAA 
CCACX3TTTCC 
CACACGGATC 
TGACAGCCTG 
ACAGAAGCGG 
GAAGTTCCAG 
TGCA7UVTAAA 
CAAGCACAAG 
CGAGAAGGAT 
TAACCTGGAC 
GCAGACCATT 
CTGCTCTCTC 
CCTGCTCAGC 
ACTGCTGCCC 



41 
I 

6CGAGCGCCA 
TACGAGGTGG 
TGTGCGTGCA 
CCAGTTTCTG 
TATTTTATTC 
6AAGACACTA 
AACGATACCC 
GAAGACATCT 
TCAGAGCCTT 
AGGAGGCAGC 
TATGAATATC 
AAGGACCTGA 
GACAAGATCA 
CGCCACAACC 
TCCTTTGAGT 
TGCAACACCA 
CTCCAGGAGT 
GCCATGGACA 
TTTGG6ATGA 
TTTCTGGCCA 
GGCTCAGGGA 
AAGCTGTACC 
6CCAGCTGCC 
ATGGAGGGCA 
AAACACCTGC 
CTGGTGATGG 



60 

120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



60 
120 
ISO 
240 
300 
360 
420 



51 
I 

GGCGTCCGGC 
TCCAGAGCCA 
AGATCCTTCA 
GGTGGCAAGA 
TCATAAACTG 
TATTCTTTGT 
AGATCAAATT 
TCAGGGATGA 
CTGAGAAGCG 
GGCGAGAGTG 
ATGGGACATC 
ATGACATGCT 
CTCAAATGAA 
ACCGGAACGA 
ATGACCTCCG 
GCTATACCGC 
TCCTTGCAGA 
TCTCCTTGRA 
AGGACATGCG 
GCGACGTGGT 
CAGATCACTT 
ATGGCCTGGA 
TTTGCACCAA 
CTCCAGATGT 
TCAAGTCCTT 
CTGCCCCCCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



284 



wo 02/086443 

GAGCATGGAG CATGGCACAG TGACCGTGGT GGGCATCCCC CCAGAGACCG ACAGCTCGGA 1620 

CAGGAAGAAC TTTTTTGG6A GGGCGTTTGA GAAGGCAGCG GAAAGCACCA GCTCCCGGAT 1680 

GCTGCACAAC CATTTTGACC TCTCAGTAAT TGAGCTGAAA GCTGAGGATC GGAGCAAGTT 1740 

TCTGGACGCA CTTATTTCCC TCCTGTCCTA GGAATTTGAT TCTTCCAGAA TGACCTTCTT 1800 

ATTTATGTAA CTGGCTTTCA TTTAGATTGT AAGTTATGGA CATGATTTGA GATGTAGAAG 1860 

CCATTTTTTA TTA7ATAA7A TGCTTATTTT AGGCTCCGTC CCCAAAAAAA AAAAAAAAAA 1920 
AAAAAT^AAAA AA 



PCT/US02/12476 



10 



Seq ID NO: 253 Protein sequence: 
Protein Accession #: NP_003495.1 



15 



20 



25 



MFVSDFRKEF 
AFLEHKEQFH 
QDDDLEVPAY 
RRDILFDYEQ 
DVGVLQRHVS 
KLWSVHGQKR 
FSIHFGFKHK 
KQLRATQQTI 
TKNRRCKIiIiP 
HFDLSVIELK 



11 
I 

YEWQSQRVIi 
YFILINCGAN 

EDIFRDEEED 
YEYHGTSSAM 
RHNHRNEDEE 
LQEFLADMGL 
FLASDWFAT 
ASCLCTNLVI 
LVMAAPIjSME 
AEDRSKFLDA 



21 

I 

LFVASDVDAL 
VDLLDILQPD 

EEHSGNDSDG 
VMFELAWMLS 
NTLSVDCTRI 
PLKQVKQKFQ 
MSLMESPEKD 
SQGPFLYCSL 
HGTVTWGIP 
LISLLS 



31 
I 

CACKIIiQALF 
EDTIFFVCDT 
SEPSEKRTRIi 
KDLNDMLWWA 
SFEYDLRLVL 
AMDISLKENL 
GSGTDHFIQA 
MEGTPDVMLF 
PETDSSDRKN 



41 

I 

QCDHVQYTLV 
HRPVNWNVY 
EEEIVEQTMR 
IVGLTDQWVQ 
YQHWSLHDSL 
REMIEESANK 
LDSLSRSNLD 
SRPASIiSLLS 
FFGRAFBKAA 



51 
I 

PVSGWQEIiET 
NDTQIKLLIK 
RRQRREWEAR 
DKITQMKYVT 
CNTSYTAARF 
FGMKDMRVQT 
KLYHGLELAK 
KHLLKSFVCS 
ESTSSRMIiHN 



Seq ID NO: 254 DNA sequence 
Nucleic Acid Accession #: NM_022337 
Coding sequence: 48.. 683 



60 
120 
180 
240 
300 
360 
420 
480 
540 



30 
35 
40 
45 
50 
55 



GGCTGCGCTT 
ACAAGGAGCA 
TCATCAAGCG 
ACTTCGCGCT 
ATATCGCAGG 
GTGCATTTAT 
AAAATGATTT 
TGGCCAACAA 
AGTTCTGCAA 
ACATTGATGA 
TGGAGTCTAT 
GCTCTGGCTG 
TTGTTCCACA 
CACATGTGGC 
GTTCTTTCTA 
TCTGTTACAA 
TTATTTGCTT 
AACTAGCTGT 
AATATATTCT 
GACCTCCATT 
ACAGGTGTGC 
AACTGAATAT 
CTCAAGCTGT 
GCAAGTGAAC 



11 

1 

CCCTGGTCAG 
CCTGTACAAG 
CTACGTGCAC 
CAAGGTGCTC 
TCAAGAAAGA 
TGTCTTCGAT 
GGACTCCAAG 
ATGTGACCAG 
GGAGCACGGT 
AGCCTCCAGA 
TGAGCCGGAC 
TGCCAAATCC 
AATTGTGCCT 
AAGCCAAAGA 
TGCTTTCCTC 
ACTTCTGTCA 
CTTTTAATCA 
CAAGTCAAGG 
CTGATGGCCT 
CTCGGCAGAC 
TATATTGTCC 
TGTATGAAAA 
GGGGCTCCTC 
AATATU^CAT 



21 
I 

GCACGGCACG 
TTGCTGGTGA 
CAGAACTTCT 
CACTGGGACC 
TTTGGAAACA 
GTCACCAGGC 
TTAAGTCTCC 
GGGAAGGATG 
TTCGTAGGAT 
TGCCTGGTGA 
GTCGTGAAGC 
TAGTAGGCAC 
CTATTTTTAC 
TCTATGCCTC 
ACCATCATCA 
TGTAGCTGAC 
GCAAAGGCCT 
ACTGGCTTTC 
GACAGGCCTA 
CTAAGAGTTG 
TTGTCCTAAC 
GACATGCCTC 
TATACATGCT 
TAAAAGATJ^A 



31 

I 

TCTGGCCGGC 
TTGGCGACCT 
CCTCGCACTA 
CGGAGACTGT 
TGACGAGGGT 
CAGCCACATT 
CTAATGGCAA 
TGCTCATGAA 
GGTTTGAAAC 
AACACATACT 
CCCATCTCAC 
CTTTGCTGGT 
CATTTTGGGT 
TGTTTTTTCA 
CA6TGTTTAC 
CAAAATCCTG 
CAAGTCTTAA 
ACCTTGCCCT 
TTAAGTAGAT 
CCTCTGAGTT 
TGTCACTTGC 
CATATGTGCC 
ATACATGTAA 



41 
I 

CGCCAGGATG 
GGGCGTGGGG 
CCGGGCCACA 
GGTGCGCCTG 
CTATTACCGA 
TGAAGCAGTG 
ACCX3GTTTCA 
CAATGGCCTC 
ATCAGCAAAG 
TGCAAATGAG 
ATCAACCAAG 
GTCTGGTAGG 
AAACGTCAGG 
ATGAGAGAGA 
AAACTTTTGA 
CAGGGCCACA 
AATAAAAGGG 
GGTGTCTTTT 
GTGATATTTT 
AGCTCTTTGG 
CATGGCCTGA 
TTTCTGTTAG 
TATATATTAT 



51 
I 

CAGGCCCCGC 
AAGACCAGTA 
ATCGGCGTGG 
CAGCTCTGGG 
GAAGCTATGG 
GCAAAGTGGA 
GTGGTTTTGT 
AAGATGGACC 
GAAAATATAA 
TGTGACCTAA 
GTTGCCAGCT 
AATGACCTCA 
ATAGATATAC 
AATAGCAAAT 
AAATATTTAG 
GTCGGCACTG 
GAGAAGAACA 
TCCAGATTTC 
CTTCCAAGAT 
AATCGTGAAC 
ATGTTGGCTT 
CTCTCTTTGA 
ATATATTTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 



65 



Seq ID NO: 255 Protein sequence: 
Protein Accession #: NP_071732 



1 11 21 31 41 ■ 51 

I I I I I I 

MQAPHKEHLY KLLVIGDLGV GKTSIIKRYV HQNFSSHYRA TIGVDFALKV IJIWDPETWR 
LQLWDIAGQE RFGNMTRVYY REAMGAFIVF DVTRPATFEA V7UCWKNDLDS KLSIiPNGKPV 
SWLLANKCD QGKDVLMNNG LKMDQFCKEH GFVGWFETSA KENINIDEAS RCLVKHILAN 
ECDLMESIEP DWKPHLTST KVASCSGCAK S 



60 
120 
180 



70 
75 
80 
85 



Seq ID NO: 256 DNA sequence 
Nucleic Acid Accession #: NM_016321 
Coding sequence: 25.. 1464 



GGAACC6CCC 
CCGCTGGCGG 
GGTGTTCGTG 
GAACTTGAGC 
CGTGATGGTC 
CGCCGTGGGC 
GGGCTGGTTC 
CGCTGACTTC 
CCCCATTCAG 
CATTCTCCTT 
TGGCGCCTAC 
CAAGGAGAGA 
CCTGTGGATG 
CCGAGCCGCC 



11 

1 

GCTGCCAGCC 
CTGCCGCTCA 

CGCTACGACT 
GACATGGAGA 
TTCGTGGGCT 
TTCAACTTCC 
CACTTCTTAC 
TGCGTGGCCT 
CTGCTCATCA 
AACCTGCTAA 
TTTGGGCTCA 
CAGAATTCTG 
TACTGGCCCA 
ATCAACACCT 



21 
I 

CGGCCAGGCA 
CCTGCCTGCT 
TCGAGGCCGA 
ACGAATTCTA 
TCGGCTTCCT 
TGTTGGCAGC 
AAGACCGCTA 
CTGTCTGCGT 
TGACTTTCTT 
AGGTGAAGGA 
CAGTGACCCG 
TGTACCAGTC 
GCTTCAACTC 
ACTGCTCCTT 



31 

1 

CCCCTGCAGC 
CCTGCAGGTG 
CGCCCACTGG 
CTATCGCTAC 
CATGACTTTC 
CTTCGGCATC 
CATCGTCGTG 
GGCCTTTGGG 
CCAAGTGACC 
TGCAGGAGGC 
GATCCTCTAC 
GGACCTCTTT 
AGCCATATCC 
GGCAGCCTGC 



41 
I 

ATGGCCTGGA 
ATTATGGTGA 

TGGTCAGAGA 
CCAAGCTTCC 
CTGCAGCGCT 
CAGTGGGCGC 
GGCGTGGAGA 
GCAGTTCTGG 
CTCTTCGCTG 
TCCATGACCA 
CGACGCAACC 
GCCATGATTG 
TACCATGGGG 
GTGCTTACCT 



51 

1 

ACACCAACCT 
TTCTCTTCGG 
GGACGCACAA 
AGGACGTGCA 
ACGGCTTCAG 
TGCTCATGCA 
ACCTCATCAA 
GTAAAGTCAG 
TG7VATGAGTT 
TCCACACATT 
TAGAGCAGA6 
GCACCCTCTT 
ACAGCCAGCA 
CGGTGGCAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



285 



wo 02/086443 

ATCCAGTGCC CTGCACAAGA AGGGCAAGCT GGACATGGTG CACATCCAGA ATGCCACGCT 900 

CGCAGGAGGG GTGGCCGTGG GTACCGCTGC TGAGATGATG CTCATGCCTT ACGGTGCCCT 960 

CATCATCGGC TTCGTCTGCG GCATCATCTC CACCCTGGGT TTTGTATACC TGACCCCATT 1020 

CCTGGAGTCC CGGCTGCACA TCCAGGACAC ATGTGGCATT AACAATCTGC ATGGCATTCC 1080 

TGGCATCATA GGCGGCATCG TGGGTGCTGT GACAGCGGCC TCCGCCAGCC TTGAAGTCTA 1140 

TGGAAAAGAA GGGCTTGTCC ATTCCTTTGA CTTTCAAGGT TTCAACGGGG ACTGGACCGC 1200 

AAGAACACAG GGAAAGTTCC AGATTTATGG TCTCTTGGTG ACCCTGGCCA TGGCCCTGAT 1260 

GGGTGGCATC ATTGTGGGGC TCATTTTGAG ATTACCATTC TGGGGACAAC CTTCAGATGA 1320 

GAACTGCTTT GAGGATGCXSG TCTACTGGGA GATGCCTGAA GGGAACAGCA CTGTCTACAT 13 80 

CCCTGAGGAC CCCACCTTCA AGCCCTCAGG ACCCTCAGTA CCCTCAGTAC CCATGGTGTC 1440 

CCCACTACCC ATGGCTTCCT CGGTACCCTT GGTACCCTAG GCTCCCAGGG CAGGTGAGGA 1500 

GCAGGCTCCA CAGACTSTCC TGGGGCCCAG AGGAGCTGGT GCTGACCTAG CTAGGGATGC 1560 

AAGAGTGAGC AAGCAGCACC CCCACCTGCT GGCTTGGCCT CAAGGTGCCT CCACCCCTGC 1620 

CCTCCCCTTC ATCCCAGGGG GTCTGMCTGA GAATGGAGAA GGAGAAGCTA CAAAGTGGGC 1680 

ATCCAAGCCG GGTTCTGGCT GCAGAAGTTC TGCCTCTGCC TGGGGTCTTG GCCACATTGG 1740 

AGAAAAACAG GCTCAAAGTG GGGCTGGGAC CTGGTGGGTG AACCTGAGCT CTCCCAGGAG 1800 

ACAACTTAGC TGCCAGTCAC CACGTATGAG GCTCTTCTAC CCCGTGCCTG CACCTCGGCC 1860 

AGCATCTCCT ATGCTCCCTG GGTCCCCCAG ACCTCTCTGT GTTGTGTGCG TGGCAGCCTC 1920 
CAG6AATAAA CATTCTTGTT GTCCTTTGTA AAAAAAAAAA AAAAAAAA 



Seq ID NO: 257 Protein sequences 
Protein Accession #: NP_057405 



1 • 11 21 31 41 51 

I I I I I I 

MAWNTNLRWR IiPLTCLIiLQV IMVILFGVFV RYDFEADAHW WSERTHKNLS DMENBFYYRY 60 

PSFQDVHVMV FVGFGFLMTF LQRYGFSAVG FNFLLAAFGI QWTOjIMQGWF HFLQDRYIW 120 

GVENLINADF CVASVCVAFG AVLGKVSPIQ LLIMTFFQVT LFAVNEFILL NLLKVKDAGG 180 

SMTIHTFGAY FGLTVTRILY RRNLEQSKER QNSVYQSDLF AMIGTLFLWM YWPSFNSAIS 240 

YHGDSQHRAA INTYCSLAAC VLTSVAISSA LHKKGKLDMV HIQNATLAGG VAVGTAAEMM 300 

LMPYGALIIG FVCGIISTLG FVYLTPFLES RLHIQDTCGI NNIiHGIPGII GGIVGAVTTVA 360 

SASLEVYGKE GLVHSFDFQG FNGDWTARTQ GKFQIYGLLV TLAMAIjMGGI IVGIjILRLPF 420 
WGQPSDENCF EDAVYWEMPE GNSTVYIPBD PTFKPSGPSV PSVPMVSPLP MASSVPIiVP 



Seq ID NO: 258 DNA sequence 

Nucleic Acid Accession #: NM_002358.2 

Coding sequence: 75.. 692 



1 11 21 31 41 51 

I I I I I I 

GGGAAGTGCT GTTGGAGCCG CTGTGGTTGC TGTCCGCGGA GTGGAAGCGC GTGCTTTTGT 60 

TTGTGTCCCT GGCCATGGCG CTGCAGCTCT CCCX3GGAGCA GGGAATCACC CTGCGCGGGA 120 

GCGCCGAAAT CGTGGCCGAG TTCTTCTCAT TCGGCATCAA CAGCATTTTA TATCAGCGTG 180 

GCATATATCC ATCTGAAACC TTTACTCGAG TGCAGAAATA CGGACTCACC TTGCTTGTAA 240 

CTACTGATCT TGAGCTCATA AAATACCTAA ATAATGTGGT GGAACAACTG AAAGATTGGT 300 

TATACAAGTG TTCAGTTCAG AAACTGGTTG TAGTTATCTC AAATATTGAA AGTGGTGAGG 360 

TCCTGGAAAG ATGGCAGTTT GATATTGAGT GTGACAAGAC TGCAAAAGAT GACAGTGCAC 420 

CCAGAGAT^AA GTCTCAGAAA GCTATCCAGG. ATGAAATCCG TTCAGTGATC AGACAGATCA 480 

CAGCTACGGT GACATTTCTG CCACTGTTGG AAGTTTCTTG TTCATTTGAT CTGCTGATTT 540 

ATACAGACAA AGATTTGGTT GTACCTGAAA AATGGG7UVGA GTCGGGACCA CAGTTTATTA 600 

CCAATTCTGA GGAAGTCCGC CTTCGTTCAT TTACTACTAC AATCCACAAA GTAAATAGCA 660 

TGGTGGCCTA CAAAATTCCT GTCAATGACT GAGGATGACA TGAGGAAAAT AATGTAATTG 720 

TAATTTTGAA ATGTGGTTTT CCTGAAATCA GGTCATCTAT AGTTGATATG TTTTATTTCA 780 

TTGGTTAATT TTTACATGGA GAAAACCAAA ATGATACTTA CTGAACTGTG TGTAATTGTT ' 840 

CCTTTATTTT TTTGGTACCT ATTTGACTTA CCATGGAGTT AACATCATGA ATTTATTGCA 900 

CATTGTTCAA AAGGAACCAG GAGGTTTTTT TGTCAACATT GTGATGTATA TTCCTTTGAA 960 

GATAGTAACT GTAGATGGAA AAACTTGTGC TATAAAGCTA GATGCTTTCC TAAATCAGAT 102 0 

GTTTTGGTCA AGTAGTTTGA CTCAGTATAG GTAGGGAGAT ATTTAAGTAT AAAATACAAC 1080 

AAAGGAAGTC TAAATATTCA GAATCTTTGT TAAGGTCCTG AAAGTAACTC ATAATCTATA 1140 

AACAATGAAA TATTGCTGTA TAGCTCCTTT TGACCTTCAT TTCATGTATA GTTTTCCCTA 1200 

TTGAATCAGT TTCCAATTAT TTGACTTTAA TTTATGTAAC TTGAACCTAT GAAGCAATGG 1260 

ATATTTGTAC TGTTT7VATGT TCTGTGATAC AGAACTCTTA AAAATGTTTT TTCATGTGTT 132 0 

TTATAAAATC AAGTTTTAAG TGAAAGTGAG GAAATAAAGT TAAGTTTGTT TTAAAAAAAA 1380 
AAAAAAAAAA 



Seq ID NO: 259 Protein sec[uence: 
Protein Accession #: NP_002349.1 

1 11 21 31 41 51 

I I I i I I 

MALQLSREQG ITLRGSAEIV AEFFSFGINS ILYQRGIYPS ETFTRVQKYG LTLLVTTDLE 60 
IiIKYLNNWE QLKDWLYKCS VQKIiWVISN lESGEVLERW QFDIECDKTA KDDSAPREKS 120 
QKAIQDEIRS VIRQITATVT FIiPLIiEVSCS FDIiIiIYTOKD LWPEKWEES GPQFITNSEE 180 

VRLRSFTTTI HKVNSMVAYK IPVND 



Seq ID NO: 260 DNA sequence 
Nucleic Acid Accession #: NM_001211 
Coding sequence: 43.. 3195 



1 11 21 31 41 51 

1 I I I I I 

AAAGGCCTGC AGCAGGACGA GGACCTGAGC CAGGAATGCA GGATGGCGGC GGTGAAGAAG 60 

GAAGGGGGTG CTCTGAGTGA AGCCATGTCC CT6GAGGGAG ATGAATGGGA ACTGAGTAAA 120 

GAAAATGTAC AACCTTTAAG GCAAGGGCGG ATCATGTCCA C6CTTCAGGG AGCACTGGCA 180 

CAAGAATCT6 CCTGTAAOWV TACTCTTCAG CAGCAGAAAC GGGCATTTGA ATATGAAATT 240 



286 



wo 02/086443 

CGATTTTACA CTGGAAATGA CCCTCTGGAT GTTTGGGATA GGTATATCAG CTGGACAGAG 300 

CAGAACTATC CTCAAGGTGG GAAAGAGAGT AATATGTCAA CGTTATTAGA AAGAGCTGTA 360 

GAAGCACTAC AAGGAGAAAA ACGATATTAT AGTGATCCTC GATTTCTCAA TCTCTGGCTT 420 

AAATTAGGGC GTTTATGCAA TGAGCCTTTG GATATGTACA GTTACTTGCA CAACCAAGGG 480 

5 ATTGGTGTTT CACTTGCTCA GTTCTATATC TCATGGGCAG AAGAATATGA AGCTAGAGAA 540 

AACTTTAGGA AAGCAGATGC GATATTTCAG GAAGGGATTC AACAGAAGGC TGAACCACTA 600 

GAAAGACTAC AGTCCCAGCA CCGACAATTC CAAGCTCGAG TGTCTCGGCA AACTCTGTTG 660 

GCACTTGAGA AAGAAGAAGA GGAGGAAGTT TTTGAGTCTT CTGTACCACA ACGAAGCACA 720 

CTAGCTGAAC TAAA6AGCAA AGGGAAAAAG ACAGCAAGAG CTCCAATCAT CCGTGTAGGA 780 

10 GGTGCTCTCA AGGCTCCAAG CCAGAACAGA GGACTCCAAA ATCCATTTCC TCAACAGATG 840 

CAAAATAATA GTAGAATTAC TGTTTTTGAT GAAAATGCTG ATGAGGCTTC TACAGCAGAG 900 

TTGTCTAAGC CTACAGTCCA GCCATGGATA GCACCCCCCA TGCCCAGGGC CAAAGAGAAT 960 

GAGCTGCAAG CAGGCCCTTG GAACACAGGC AGGTCCTTGG AACACAGGCC TCGTGGCAAT i020 

ACAGCTTCAC TGATAGCTGT ACCCGCTGTG CTTCCCAGTT TCACTCCATA TGTGGAAGAG 1080 

15 ACTGCACAAC AGCCAGTTAT GACACCATGT ATU^ATTGAAC CTAGTATAAA CCACATCCTA 1140 

AGCACCAGAA AGCCTGGAAA GGAAGAAGGA GATCCTCTAC AAAGGGTTCA GAGCCATCAG 1200 

CAAGCGTCTG AGGAGAAGAA AGAGAAGATG ATGTATTGTA AGGAGAAGAT TTATGCAGGA 1260 

GTAGGGGAAT TCTCCTTTGA AGAAATTCGG GCTGAAGTTT TCCGG7WVGAA ATTAAAAGAG 1320 

CAAAGGGAAG CCGAGCTATT GACCAGTGCA GAGAAGAGAG CAGAAATGCA GAAACAGATT 1380 

20 GAAGAGATGG AGAAG/U^^GCT AAAAGAAATC CAAACTACTC AGCAAGAAAG AACAGGTGAT 1440 

CAGCAAGAAG AGACGATGCC TACAAAGGAG ACAACTAAAC TGCAAATTGC TTCCGAGTCT 1500 

CAGAAAATAC CAGGAATGAC TCTATCCAGT TCTGTTTGTC AAGTAAACTG TTGTGCCAGA 1560 

GAAACTTCAC TTGCGGAGAA CATTTGGCAG GJ^ACAACCTC ATTCTAAAGG TCCCAGTGTA 1620 

CCTTTCTCCA TTTTTGATGA GTTTCTTCTT TCAGAAAAGA AGAATAAAAG TCCTCCTGCA 1680 

25 GATCCCCCAC GAGTTTTAGC TCAACGAAGA CCCCTTGCAG TTCTCAAAAC CTCAGAAAGC 1740 

ATCACCTCAA ATGAAGATGT GTCTCCAGAT GTTTGTGATG AATTTACAGG AATTGAACCC 1800 

TTGAGCGAGG ATGCCATTAT CACAGGCTTC AGAAATGT7UV CAATTTGTCC TAACCCAGAA 1860 

GACACTTGTG ACTTTGCCAG AGCAGCTCGT TTTGTATCCA CTCCTTTTCA TGAGATAATG 1920 

TCCTTGAAGG ATCTCCCTTC TGATCCTGAG AGACTGTTAC CGGAAGAAGA TCTAGATGTA 1980 

30 AAGACCTCTG AGGACCAGCA GACAGCTTGT GGCACTATCT ACAGTCAGAC TCTCAGCATC 2040 

AAGAAGCTGA GCCCAATTAT TGAAGACAGT CGTGAAGCCA CACACTCCTC TGGCTTCTCT 2100 

GGTTCTTCTG CCTCGGTTGC AAGCACCTCC TCCATCAAAT GTCTTCAAAT TCCTGAGAT^ 2160 

CTAGAACTTA CTAATGAGAC TTCAGAAAAC CCTACTCAGT CACCATGGTG TTCACAGTAT 2220 

CGCAGACAGC TACTGAAGTC CCTACCAGAG TTAAGTGCCT CTGCAGAGTT GTGTATAGAA 2280 

35 GACAGACCAA TGCCTAAGTT GGAAATTGAG AAGGAAATTG AATTAGGTAA TGAGGATTAC 2340 

TGCATTAAAC GAGAATACCT AATATGTGAA GATTACAAGT TATTCTGGGT GGCGCCAAGA 2400 

AACTCTGCAG AATTAACAGT AATAAAGGTA TCTTCTCAAC CTGTCCCATG GGACTTTTAT 2460 

ATC7\ACCTCA AGTTA/AGGA ACGTTTAAAT GAAGATTTTG ATCATTTTTG CAGCTGTTAT 2520 

CAATATCAAG ATGGCTGTAT TGTTTGGCAC C7VATATATAA ACTGCTTCAC CCTTCAGGAT 2580 

40 CTTCTCCAAC ACAGTGAATA TATTACCCAT GAAATAACAG TGTTGATTAT TTATAACCTT 2640 

TTGACAATAG TGGAGATGCT ACACAAAGCA GAAATAGTCC ATGGTGACTT GAGTCCAAGG 27 00 

TGTCTGATTC TCAGAAACAG AATCCACGAT CCCTATGATT GTAACAAGAA CAATCAAGCT 2760 

TTGAAGATAG TGGACTTTTC CTACAGTGTT GACCTTAGGG TGCAGCTGGA TGTTTTTACC 2820 

CTCAGCGGCT TTCGGACTGT ACAGATCCTG GAAGGACTiAA AGATCCT6GC TAACTGTTCT 2880 

45 TCTCCCTACC AGGTAGACCT GTTTGGTATA GCAGATTTA6 CACATTTACT ATTGTTCAAG 2940 

GAACACCTAC AGGTCTTCTG GGATGGGTCC TTCTGGAAAC TTAGCCAAAA TATTTCTGAG 3 000 

CTAAAAGATG GTGAATTGTG GAATAAATTC TTTGTGCGGA TTCTGAATGC CAATGATGAG 3 060 

GCCACAGTGT CTGTTCTTGG GGAGCTTGCA GCAGAAATGA ATGGGGTTTT TGACACTACA 3120 

TTCCAAAGTC ACCTGAACAA AGCCTTATGG AAGGTAGGGA AGTTAACTAG TCCTGGGGCT 3180 

50 TTGCTCTTTC AGTGAGCTAG GCAATCAAGT CTCACAGATT GCTGCCTCAG A6CAATGGTT 3240 

GTATTGTGGA ACACTGAAAC TGTATGTGCT GTAATTTAAT TTAGGACACA TTTAGATGCA 3300 

CTACCATTGC TGTTCTACTT TTTGGTACAG GTATATTTTG ACGTCACTGA TATTTTTTAT 33 60 

ACAGTGATAT ACTTACTCAT GGCCTTGTCT AACTTTTGTG AAGAACTATT TTATTCTAAA 3420 

CAGACTCATT ACAAATGGTT ACCTTGTTAT TTAACCCATT TGTCTGTACT TTTCCCTGTA 3480 

55 CTTTTCCCAT TTGTAATTTG TAAAATGTTC TCTTATGATC ACCATGTATT TTGTAAATAA 3540 
TAAAATAGTA TCTGTTAAAA AAAAAAAAAA AAAAAAAAAA AAA 



PCT/US02/12476 



60 
65 
70 
75 
80 



Seq ID NO: 261 Protexn sequences 
Protein Accession #: NP_001202 



^aAAVKKEGGA 
AFEYEIRFYT 
FLNLWLKLGR 
QKAEPLERLQ 
PIIRVGGALK 
PRAKENELQA 
SINHILSTRK 
RKKLKEQREA 
QIASESQKIP 
NKSPP7\DPPR 
ICPNPEDTCD 
SQTLSIKKIiS 
PWCSQYRRQIj 
FWVAPRNSAE 
CFTLQDLLQH 
NKNNQALKIV 
HLLLFKEHLQ 
GVFDTTFQSH 



11 
I 

LSEAMSLEGD 
GNDPLDVWDR 
LCNEPLDMYS 
SQHRQFQARV 
APSQNRGLQN 
GPWNTGRSLE 
PGKEEGDPIiQ 
ELLTSAEKRA 
GMTLSSSVCQ 
VLAQRRPIjAV 
FARAARFVST 
PIIEDSREAT 
LKSLPELSAS 
LTVIKVSSQP 
SEYITHEITV 
DFSYSVDLRV 
VFWDGSFWKL 
LNKALWKVGK 



21 
1 

EWELSKENVQ 
YISWTEQNYP 
YLHNQGIGVS 
SRQTLLALEK 
PFPQQMQNNS 
HRPRGNTASL 
RVQSHQQASE 
EMQKQIEEME 
VNCCARETSIi 
LKTSESITSN 
PFHEIMSLKD 
HSSGFSGSSA 
AELCIEDRPM 
VPWDFYINLK 
LIIYNLLTIV 
QLDVFTIjSGF 
SQNlSEIiKDG 
LTSPGALLFQ 



31 
I 

PLRQGRIMST 

QGGKESNMST 
LAQFYISWAE 
EEEEEVFESS 
RITVFDENAD 
lAVPAVLPSF 
EKKEKMMYCK 
KKLKEIQTTQ 
AENIWQEQPH 
EDVSPDVCDE 
LPSDPERLIjP 
SVASTSSIKC 
PKLEIEKEIE 
LKERLNEDFD 
EMLHKAEIVH 
RTVQILEGQK 
ELWNKFFVRI 



41 

I 

LQGALAQESA 
LLERAVEALQ 
EYEARENFRK 
VPQRSTLAEIj 
EASTAELSKP 
TPYVEETAQQ 
EKIYAGVGEF 
QERTGDQQEE 
SKGPSVPFSI 
FTGIEPLSED 
EEDLDVKTSE 
LQIPEKIjELT 
LGNEDYCIKR 
HFCSCYQYQD 
GDLSPRCLIL 
ILANCSSPYQ 
IJJANDEATVS 



51 

1 

CNNTLQQQKR 
GEKRYYSDPR 
ADAIFQEGIQ 
KSKGKKTARA 
TVQPWIAPPM 

pvmtpckiep 

sfeeiraevf 
tmptkettkl 

FDEFLIjSEKK 
AIITGFRNVT 
DQQTACGTIY 
NETSENPTQS 
EYLICEDYKL 
GCIVWHQYIN 
RNRIHDPYDC 
VDLFGIADLA 
VLGEIiAAEMtV 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



85 



Seq ID NO: 262 DNA sequence 
Nucleic Acid Accession #: NM_003784 
Coding sequence : 3 6 5 . . 1 S 0 7 



31 



41 



287 



10 
15 
20 
25 
30 
35 
40 



WO 02/086443 

I I I I I I 

GTCTACTTAT CAATAAGCAG CTGCCTGTGC AGAGTGCA6G CTGCACCTTT GGACAGCCTT 60 

TAAAACTGAA TTCTCAGAAT TTTAGAACAA ATTTTTGTCT AGTiAATGCTG ACTTTGGTTC 120 

ATTAGGTAGT GGTAAAACAG GCTCCCTTCG AAGCTCTCCT TCATCACCTT CCTAAGTGCA 180 

TGTACAGGGA AGCTCTCCTT CATCACCTTC CTAAGTGCAT GGGGGAAAAT ACCTAGGGCT 240 

CAACAGTCTT GAGAAGTGTG GAAACATTTT CTTTGTGAGT GAGAACAGAT CACCTAGAGA 300 

AAGGAAACCA GATTCCCATC ACTGCTTCTG GGTATCAGAT GCTAGCGCTG CACTCCATTT 360 

TGCAATGGCC TCCCTTGCTG CAGCAAATGC AGAGTTTTGC TTCAACCTGT TCAGAGAGAT 420 

GGATGACAAT CAAGGAAATG GAAATGTGTT CTTTTCCTCT CTGAGCCTCT TCGCTGCCCT 480 

GGCCCTGGTC CGCTTGGGCG CTCAAGATGA CTCCCTCTCT CAGATTGATA AGTTGCTTCA 540 

TGTTAACACT GCCTCAGGAT ATGGAAACTC TTCTAATAGT CAGTCAGGGC TCCAGTCTCA 600 

ACTGAAAAGA GTTTTTTCTG ATATAAATGC ATCCCACAAG GATTATGATC TCAGCATTGT 660 

GAATGGGCTT TTTGCTGAAA AAGTGTATGG CTTTCATAAG GACTACATTG AGTGTGCCGA 720 

AAAATTATAC GATGCCAAAG TGGAGCGAGT TGACTTTACG AATCATTTAG AAGACACTAG 780 

ACGTAATATT AATAAGTGGG TTGAAAATGA AACACATGGC AAAATCAAGA ACGTGATTGG 840 

TGAAGGTGGC ATAAGCTCAT CTGCTGTAAT GGTGCTGGTG AATGCTGTGT ACTTCAAAGG 900 

CAAGTGGCAA TCAGCCTTCA CCAAGAGCGA AACCATAAAT TGCCATTTCA AATCTCCCAA 960 

GTGCTCTGGG AAGGCAGTCG CCATGATGCA TCAGGAACGG AAGTTCAATT TGTCTGTTAT 1020 

TGAGGACCCA TCAATGAAGA TTCTTGAGCT CAGATACAAT GGTGGCATAA ACATGTACGT 1080 

TCTGCTGCCT GAGAATGACC TCTCTGAAAT TGAAAACAAA CTGACCTTTC AGAATCTAAT 1140 

GGAATGGACC AATCCAAGGC GAATGACCTC TAAGTATGTT 6AGGTATTTT TTCCTCAGTT 1200 

CAAGATAGAG AAGAATTATG AAATGAAACA ATATTTGAGA GCCCTAGGGC TGAAAGATAT 12 60 

CTTTGATGAA TCCAAAGCAG ATCTCTCTGG GATTGCTTCG GGGGGTCGTC TGTATATATC 1320 

AAGGATGATG CACAAATCTT ACATAGAGGT CACTGAGGAG GGCACCGAGG CTACTGCTGC 13 80 

CACAGGAAGT AATATTGTAG AAAAGCAACT CCCTCAGTCC ACGCTGTTTA GAGCTGACCA 1440 

CCCATTCCTA TTTGTTATCA GGAAGGATGA CATCATCTTA TTCAGTGGCA AAGTTTCTTG 1500 

CCCTTG7\A7A TCCAATTGGT TTCTGTTATA GCAGTCCCCA CAACATCAAA GRACCACCAC 1560 

AAGTCAATAG ATYTGRGTTT AATTGGAAAA ATGTGGTGTT TCCTTTGAGT TTATTTCTTC 1620 

CTAACATTGG TCAGCAGATG ACACTGGTGA CTTGACCCTT CCTAGACACC TGGTTGATTG 1680 

TCCTGATCCC TGCTCTTAGC ATTCTACCAC CATGTGTCTC ACCCATTTCT AATTTCATTG 1740 

TCTTTCTTCC CACGCTCATT TCTATCATTC TCCCCCATGA CCCGTCTGGA AATTATGGAG 1800 

RGTGCTCAAC TGGTAAGGAG AACGTAGAAG TAGCCCTAGG GATCCTTTTT GAAACTCTAC 1860 

AGTTATCGCA GATATTCTAG CTTCATTGTA AGCAATCTAG GAAATAAGCC CTGCTGCTTT 192 0 

CTAGAAATAA GTGTGAAGGA TAAATTTTCT TTGTTGACCT ATGAAGATTT TAGAGTTTAC 1980 

CTTCATATGT TTGATTTTAA ATCAGTGTAT AATCTAGATG GTAAAAAATG TGAAATTGGG 2040 

ATTAGGGACC TACCAAAATA TTTCATTAAT GCTTTCAATT GACAAATTTT GGCCTTTCTT 210 0 

TGATAAGACA ATATGTACAT GTTTTTTCAA ATATTAAAGA TCTTTT7\ACT GTTGGCAGTT 2160 

GTTATCTACA GAATCATATT TCATATGCTG TGTAGTTTAT AAGTTTTTCC TCTATTTATC 2220 
AGAATAAAGA AATACAACAT ACCTGTAAA 

Seq ID NO: 2 63 Protein sequence: 
Protein Accession #: NP 003775 



PCT/US02/12476 



45 
50 
55 
60 
65 
70 
75 
80 
85 



1 
I 

MASLAAANAE 
NTASGYGNSS 
LYDAKVERVD 
WQSAFTKSET 
LPENDLSEIE 
DESKADLSGI 
FLFVIRKDDI 



11 

1 

FCFNLFREMD 
NSQSGLQSQL 
FTNHLEDTRR 
INCHFKSPKC 
NKLTFQNLME 
ASGGRLYXSR 
ILFSGKVSCP 



21 

I 

DNQGNGNVFF 
KRVFSDINAS 
NINKWVENET 
SGKAVAMMHQ 
WTNPRRMTSK 
MMHKSYIEVT 



31 
I 

SSLSLFAALA 
HKDYDLSIVN 
HGKIKNVIGE 
ERKFNLSVIE 
YVBVFFPQFK 
EEGTEATAAT 



41 
I 

LVRLGAQDDS 
GLFAEKVYGF 
GGISSSAVMV 
DPSMKItiELR 
lEKNYEMKQY 
GSNIVEKQLP 



Seq ID NO: 2 64 DNA sequence 
Nucleic Acid Accession #: AB052906 
Coding sequence: 74-814 



1 
I 

AAAACCTTGA 
CTCTGGGTCC 
GCTCCTGCTG 
CATCACCGTC 
GGATGAAAAG 
CCTGGGGAAG 
GGTGGTGGAC 
GGAACCCCTC 
TGGATCTTGG 
AATGTGGACA 
GGTTGTGGCC 
CTTCTTGATG 
CTCAGGCACA 
CATCCTCCCC 
AAGCTGATAC 
CCAGCTGCCC 
TGGACCCAAT 
TACCTAACAT 
TTCTGGCTGA 
GTACTTCTTT 
TAGACTTCAG 
ATAAGAAAAA 
TTTAAATAAA 



11 

I 

GGTGATTCAT 
TTAATGGCAG 
TCCGGCTGGT 
ATCCCTAAGT 
ACTTTTCTTC 
AAACTAAATG 
ATACTTACAG 
ACCCTGCAGG 
CAGTTCAGTT 
ACGGTTCATC 
ATGTCCTTCC 
GGCATGGACA 
ACCCAACTCA 
TGCTTCATCC 
CATiAAGGCTC 
ACGACCTACG 
AGCTCATTCA 
ATTATGCAAT 
CTAAACAAGA 
GAATGATGAT 
ACCTCTGGGG 
ATTTATATTA 
GAGTTCTATT 



21 

I 

CTTCCAGGCT 
CAGCCGCCGC 
CCCGGGCTGG 
TCAGACCTGG 
ACTATGACTG 
TCACAACGGC 
AGCAACTGCX3 
CCAGGATGTC 
TCGATGGGCA 
CTGGAGCCAG 
ATTACTTCTC 
GCACCCTGGA 
GGGCCACAGC 
TCCCTGGCAT 
CTGTGAGCAC 
GTGTATGTCC 
CTGCCTTGAT 
TTTCTCTTGG 
TATATCATTT 
CTCTTTCTTG 
ATTCTTTCCG 
ATGATTGTTT 
TCCCAAAAAA 



31 
I 

CTCCTTCCAT 
TACCAAGATC 
GCGAGCCGAC 
ACCACGGTGG 
TGGCAACAAG 
CTGGAAAGCA 
TGACATTCAG 
TTGTGAGCAG 
GATCTTCCTC 
AAAGATGAAA 
AATGGGAGAC 
GCCAAGTGCA 
CACCACCCTC 
CTGAGGAGAG 
GGTCTTGATC 
AGTGGCCTCC 
TCCTTTTGCC 
TGCTACCTGA 
TCTTTCTTCT 
CAAATGATAT 
TGTCCTGAAA 
CCTTTAGTAA 
AAAAAAAAAA 



41 
I 

CAAGTCTCTC 
CTTCTGTGCC 
CCrCACTCTC 
TGTGCGGTTC 
ACAGTCACAC 
CAGAACCCAG 
CTGGAGAATT 
AAAGCTGAAG 
CTCTTTGACT 
GAAAAGTGGG 
TGTATAGGAT 
GGAGCACCAC 
ATCCTTTGCT 
TCCTTTAGAG 
AAACTCGCCC 
AGCAGATCAT 
AACAATTTTA 
TGGAATTCCT 
CTTTTTGTTT 
TGTCAGTAAA 
GAGAATTTTT 
TTTATTGTTC 
A 



51 

I 

LSQIDKLLHV 
HKDYIECAEK 
LVNAVYFKGK 
YNGGINMYVL 
IiRALGLKDIF 
QSTLFRADHP 



51 
i 

CTCCCTAGCG 
TCCCGCTTCT 
TTTGCTATGA 
AAGGCCAGGT 
CTGTCAGTCC 
TACTGAGAGA 
ACACACCCAA 
GACACAGCAG 
CAGAGAAGAG 
AGAATGACAA 
GGCTTGAGGA 
TCGCCATGTC 
GCCTCCTCAT 
TGACAGGTTA 
TTCTGTCTGG 
GATGACATCA 
CCAGCAGTTA 
GCACTTAAAG 
GGAAAATCAA 
ATAATCACGT 
AAATTATTTA 
TGTACTGATA 



60 
120 
180 
240 
300 
360 



' 60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



Seq ID NO: 265 Protein sequence: 
Protein Accession #: BAB61048.1 



288 



wo 02/086443 



1 11 21 31 

I I I I 

MAAAAATKIL LCLPLLIiLLS GWSRAGRADP HSLCYDITVI 
FLHYDCGNKT VTPVSPLGKK LNVTTAWKAQ NPVLREWDI 
LQARMSCEQK AEGHSSGSWQ FSFDGQIFLL FDSBKRMWTT 
SFHYFSMGDC IGWLEDFLMG MDSTLEPSAG APLT^SSGTT 
FILPGI 

Seg ID NO: 266 DNA sequence 
Nucleic Acid Accession #: XM_084a53.1 
Coding sequence: 127-444 

1 11 21 31 41 51 

I I I I i I 

ATTGATGATA TATTTAACGA AATCAAATTT GGTGAATATG TGGACACTGG AAAGCTT^TC 
GACAAGATCA ACTTACCAGA TTTCCTAT^ GTGTACCTTA ACC7VCAAGCC ACCTTTTGGT 
AACACCATGA GTGGCATCCA CAAGAGCTTT GAGGTGCTCG GTTATACCAA CTCCAAAGGG 
AAAAAGGCCA TTCGAAGAGA GGACTTCCTG AGACTGCTCG TTACTAAAGG TGAGCATATG 
ACGGAGGAGG AGATGTTGGA TTGCTTTGCT TCACTGTTTG GCCTGAATCC CGAGGGATGG 
AAATCCGAGC CTGCAACCTG CTCCGTCAAA GGTTCAGAAA TTTGCCTTGA AGAAGAACTT 
CCAGACGAAA TCACTGCAGA AATATTCGCG ACTGAAATTC TTGGCTTAAC CATTTCAG7\A 
GATTCCGGCC AGGATGGTCA GTGAAGTTAC CAGGAATGTT TAAAGCACAA AGGACTTTGG 
GTGTGTGTGC ATGCACATGT GTGTGTTTTC - CATGAGGCAC TGCTTTTTAT GCATTTCCCT 
CCCCCCTCTC ATCTTTAGAA CATTTAGACA TTAAAGCAAG TTTCTGGTGA GCAATG 



41 51 

I I 
PKPRPGPRWC AVQGQVDEKT 
LTEQLRDIQL ENYTPKEPLT 
VHPGARKMKE KWENDKWAM 
QLRATATTLI LCCLLIILPC 



Seq ID NO: 267 Protein sequence: 
Protein Accession #: XP_0848S3,1 

1 11 21 31 41 51 

I I I I I I 

MSGIHKSFEV LGYTNSKGKK AIRREDFLRL LVTKGEHMTE EEMLDCFASL FGLNPEGWKS 
EPATCSVKGS EICLEEELPD EITAEIFATE ILGLTISEDS GQD6Q 

Seq ID NO: 268 DNA sequence 
Nucleic Acid Accession #: NM_001898 
Coding sequence: 57-482 

1 11 21 31 41 51 

I I I I I I 

GGCTCTCACC CTCCTCTCCT GCAGCTCCAG CTTTGTGCTC TGCCTCTGAG GAGACCATGG 
CCCAGTATCT GAGTACCCTG CTGCTCCTGC TGGCCACCCT AGCTGTGGCC CTGGCCTGGA 
GCCCCAAGGA GGAGGATAGG ATAATCCCGG GTGGCATCTA TAACGCAGAC CTCAATGATG 
AGTGGGTACA GCGTGCCCTT CACTTCGCCA TCAGCGAGTA TAACAAGGCC ACCAAAGATG 
ACTACTACAG ACGTCCGCTG CGGGTACTAA GAGCCAGGCA ACAGACCGTT GGGGGGGTGA 
ATTACTTCTT CGACGTAGAG GTGGGCCGCA CCATATGTAC CAAGTCCCAG CCCAACTTGG 
ACACCTGTGC CTTCCATGAA CAGCCAGPlAC TGCAGAAGAA ACAGTTGTGC TCTTTCGAGA 
TCTACGAAGT TCCCTGGGAG AACAGAAGGT CCCTGGTGAA ATCCAGGTGT CAAGAATCCT 
AGGGATCTGT GCCAGGCCAT TCGCACCAGC CACCACCCAC TCCCACCCCC TGTAGTGCTC 
CCACCCCTGG ACTGGTGGCC CCCACCCTGC GGGAGGCCTC CCCATGTGCC TGCGCCAAGA 
GACAGACAGA GAAGGCTGCA 6GAGTCCTTT GTTGCTCAGC AGGGCGCTCT GCCCTCCCTC 
CTTCCTTCTT GCTTCTAATA GCCCTGGTAC ATGGTACACA CCCCCCCACC TCCTGCAATT 
AAACAGTAGC ATCGCC 

Seq ID NO: 269 Protein sequence: 
Protein Accession # :NP__0 01889. 1 

1 11 21 

I 1 1 

MAQYLSTLLL LLATLAVALA WSPKEEDRII 
DDYYRRPIiRV LRARQQTVGG VNYFFDVEVG 
EIYEVPWENR RSLVKSRCQE S 

Seq ID NO: 2 70 DNA secjuence 
Nucleic Acid Accession #: XM_093210 
Coding sequence: 13-1854 

1 11 21 31 41 51 

I i 1 I I I 

ATGGCAAGCG CCGGAATCTC CTCAGCTGCC GTTTCACAAA AGAGGTACCA GGTCCGCACC 
AAACGAGCAC ACAAGCAGCA CCAGGAGCTG CAGAAGAAGG AGGCGGCAGC GATGGACCAG 
GGCAGAGGGA ATGGGGAGGG GGCATCCTAC CCCATATCTG AGGTGCGACT GCGGGACGTA 
GAGCGGACTG GGCCTTTCCC GTTGGCGCGT GGCCTCAATC AGGACTTCTT GCCCACGTGC 
GCCTTCAAAA CGGTAAGAGC TGCAACTGAA CGTGTGAGAC ATGGTGCAGA TAGGCTGAGA 
GGCGGCGGGA GAGATGCCCA TGAACTCAAG TACCCGGACA CGCCCTCCAC TTCTACCACC 
ACGAGTAACA CCGCCCCCAC GGGACCGCTC TCGAGGTCCC CCAAGCCAAG GACGCAAGGA 
GGAACGCCCC GGCGCGCGGC CAGCAGCGGC GGGCACCGGC CCAATGGCCA CGGAACTCAG 
CACTGGCAGT CGGCCCTCCT CACACCGCAG GCGTGCAGTG TGGCCGACGG AGCCTCCCGG 
GCCGAGGACC CAGCTAGGCC GTCACCCCGG TTGCTCCCAC GGGAAGGGGC ACCAGGCAAA 
CTGCCCAAGG CCCCGAGCCC AGGCTCCCTG GCGGAGGCCT CCGCTGGTCC CGCCCAGATC 
ATGGCCGCCA CCAGGCTCCC GAGCCATGGC TTCCTGTCCG GGAACGGCCC GGCGTCCTGG 
CTGTCCAGCT AG 

Seq ID NO: 271 Protein sequence: 
Protein Accession #: XP_093210 

1 11 21 31 41 51 



31 41 51 

1 1 1 

PGGIYNADLN DEWQRALHF AISEYNKATK 
RTICTKSQPN LDTCAFHEQP ELQKKQLCSF 



289 



wo 02/086443 PCT/US02/12476 

I 1 I I I I 

MLRHGEQKRK RARKKWDFLP TCAFKTVRAA TERVRHGT^R LRGGGRDAHE LKYPDTPSTS 60 
TTTSNTAPTG PLSRSPKPRT QGGTPRRRPA AAGTRANGHG TQHWQSALLT PQACSVADGA 120 
SRAEDPARPS PRLLPREGAP GKLPKAPSPG SLAEASAGLL AHVRLQNADA QRVSISQALP 180 
5 PNSSVGRKEE RPGAGQQRRA PAPMATELST GSRPSSHRRR AWPTEPPGP RTQLEPSPRL 240 

liPREGAPGKL PKAPSPGSLA EASAGPAQIM AATRLPSRGF LSGNGPASWL SS 

Seg ID NO: 272 DNA sequence 
Nucleic Acid Accession #: Eos sequence 
10 Coding sequence; 1..732 

1 11 21 31 41 51 

I I I I I I 

GGATACTGTG TCACTCAAAG TAATGGGAGG GAGAGAGAAC AGGGAGGGTA GGGATGCTTT 60 
15 TGAAAAAGCT TTTTTTCCCA CTTTTAACTT GCTTTAGCGT TAAGAGTACT TACCAGCTAA 120 

TAATGTGGAG GAAATTATTC TTTCTCATTG GAGATTACAG AATATATCTA TTCATCTTGA 180 

ATACCCACTT GAAGCCTCTG TAGAAATGTC TCGTCCTCCG GTTGTATTTC TAA/y^CCTAC 240 

ATGATTTTGT CTTGTTTCTG CAGTGAGAAA TTACATCCAT AGCAAAGACA AAAGTCTTTT 300 

TAAATTATTT TTATTTATCT TTCATATAGT TCTTACZ^TT TCTAAAAAAT TAACACTCAT 360 
20 TTAGTATCAC AATTTATGGG AGAGGGTTTT TTGTATTTTT AAGCATATGT GGCTTATATA 420 

AT^TTGCAG AAGTCATAGG ACTGTCATGT ATTGCAGCTC TGAGAACCAA TGCCTGAAAC 480 

TTAAGCC 

25 Seq ID NO: 273 Protein sequence: 
Protein Accession #: Eos sequence 



30 



35 



65 



1 11 21 31 41 51 

I I 1 I I I 

MGGRENREGR DAFEKAFFPT FNLL 

Seq ID NO: 274 DNA sequence 

Nucleic Acid Accession NM_003 976.2 

Coding sequence: 299-961 



1 11 21 31 41 51 

I I I I I I 

CTCTGAGCTT CTCTGAGCCT TGTTTGCTCA TCTGGAAAAA GGGGATTAAA CCATTTACCT 60 

CATGGAGTTG TGAAAGAATA GCTGCAAAGC ACCTAACACA TAGTAAGGTT CCCAGTGCAG 120 

40 CTACTTCTGC TGGGTTGAGT CTAGCTGTGT AGGCCCCTTG TTCCTCACCT GGAGAAACTG 180 

GGGTGGCAGG CCGGTCCCCC ACAAAAGATA ACTCATCTCT TAATTTGCAA GCTGCCTCAA 240 

CAGGAGGGTG GGGGAACAGC TCAACAATGG CTGATGGGCG CTCCTGGTGT TGATAGAGAT 300 

GGAACTTGGA CTTGGAGGCC TCTCCACGCT GTCCCACTGC CCCTGGCCTA GGCGGCAGGC 360 

. TGCCCTGTGG CCCACCCTGG CCGCTCTGGC TCTGCTGAGC AGCGTCGCAG AGGCCTCCCT 420 

45 GGGCTCCGCG CCCCGCAGCC CTGCCCCCCG CGAAGGCCCC CCGCCTGTCC TGGCGTCCCC 480 

CGCCGGCCAC CTGCCGGGGG GACGCACGGC CCGCTGGTGC AGTGGAAGAG CCCGGCGGCC 540 

GCCGGCGCAG CCTTCTCGGC CCGCGCCCCC GCCGCCTGCA CCCCCATCTG CTCTTCCCCG 600 

CGGGGGCCGC GCGGCGCGGG CTGGGGGCCC GGGCAGCCGC GCTCGGGCAG CGGGGGCGCG 660 

GGGCTGCCGC CTGCGCTCGC AGCTGGTGCC GGTGCGCGCG CTCGGCCTGG GCCACCGCTC 720 

50 CGACGAGCTG GTGCGTTTCC GCTTCTGCAG CGGCTCCTGC CGCCGCGCGC GCTCTCCACA 780 

CGACCTCAGC CTGGCCAGCC TACTGGGCGC CGGGGCCCTG CGACCGCCCC CGGGCTCCCG 840 

GCCCGTCAGC CAGCCCTGCT GCCGACCCAC GCGCTACGAA GCGGTCTCCT TCATGGACGT 900 

CAACAGCACC TGGAGAACCG TGGACCGCCT CTCCGCCACC GCCTGCGGCT GCCTGGGCTG 960 

AGGGCTCGCT CCAGGGCTTT GCAGACTGGA CCCTTACCGG TGGCTCTTCC TGCCTGGGAC 1020 

55 CCTCCCGCAG AGTCCCACTA GCCA6CGGCC TCAGCCAGGG ACGAAGGCCT CAAAGCTGAG 1080 

AGGCCCCTAC CGGTGGGTGA TGGATATCAT CCCCG7UVCAG GTGAAGGGAC AACTGACTAG 1140 

CAGCCCCAGA GCCCTCACCC TGCGGATCCC AGCCTAAAAG ACACCAGAGA CCTCAGCTAT 12 00 

GGAGCCCTTC GGACCCACTT CTCACAGACT CTGGCACTGG CCAGGCCTCG AACCTGGGAC 1260 

CCCTCCTCTG ATGAACAGTA CAGTGGCTGA GGCATCAGCC CCCGCCCAGG CCCTGTAGGG 1320 

60 ACAGCATTTG AAGGACACAT ATTGCAGTTG CTTGGTTGAA AGTGCCTGTG CTGGAACTGG 1380 
CCTGTACTCA CTCATGGGAG CTGGCCCC 



Seq ID NO: 275 Protein sequence; 
Protein Accession #; NP_003 967.1 



1 11 21 31 41 51 

11)111 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL PdiJjSSVKEhS LGSAPRSPAP REGPPPVLAS 60 
PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 120 
70 . RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLIjG AGALRPPPGS 180 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCU3 

Seq ID NO; 276 DNA sequence 
Nucleic Acid Accession #: NM_057091.1 
/5 Coding sequence: 783-1445 

1 11 21 31 41 51 

I I I I I I 

ACTGGCCGCT GAGAGAAGAA TCGGGTGGAG CAGAGAGCAG CTGCTGCAGG GCA6ACAGCC 60 

80 GGACCCCCAA ATCTGCACGT ACCAGCAGTC AGCCGCCCCA CGCAGGGACC GGCTTACCCC 120 

TCGCTCCCCG CCCTCACTCA CTTTCTCCCG CCCTCGGCCC GGCCTCCCAG CTCTCTACTT 180 

CGCGTGTCTA CAAACTCAAC TCCCGGTTTC CGTGCCTCTC CACCGCTCGA GTTCTCTACT 240 

CTCCATATCC GAGGGGCCCC TCCCAGCATC TACCCCCCTC CCAACCTCGG GGGACCTAGC 300 

CAAGCTAGGG GGGACTGGAT CCGACGGGTG GAGCAGCCAG GTGAGCCCCG AAAGGTGGGG 360 

85 CGGGGCAGGG GCGCTCCCAG CCCCACCCCG GGATCTGGTG ACGCTGGGGC TGGAATTTGA 420 

CACCGGACGG CTGCGGCGGC GGGCAGGAGG CTGCTGAGGG ATGGAGTTGG GCCCGGCCCC 480 

CAGACAAGGC CCGGGGGCTC CGCCAGCAGC AGGTCCCTCG GGCCCCAGCC CTCGCTGCCA 540 



290 
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CCCGGGCCTG GAGCCCCACA CCCGAGGGTG CAGACTGGCT GCCAAGGCCA CACTTTTGGC 600 

TAAAAGAGGC ACTGCCAGGT GTACAGTCCT GG6CATGCGC TGTTTGAGCT TCX3GGGGAGA 660 

GCCCAGCACT GGTCCCCGGA 7VAGGTGCCTA GAAGT^CAAG GTGCAGGACC CCGTGCTGCC 720 

TCAACAGGAG GGTGGGGGAA CAGCTCAACA ATGGCTGATG GGCGCTCCTG GTGTTGATAG 780 

AGATGGAACT TGGACTTGGA GGCCTCTCCA CGCTGTCCCA CTGCCCCTGG CCTAGGCGGC 840 

AGCCTGCCCT GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT 900 

CCCTGGGCTC CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT 960 

CCCCCGCCGG CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC 1020 

GGCCGCCGCC GCAGCCTTCT CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC 1080 

CCCGCGGGGG CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG 1140 

CGCGGGGCTG CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC 1200 

GCTCCGACGA GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC 1260 

CACACGACCT CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGGCT 1320 

CCCGGCCCGT CAGCCAGCCC TGCTGCC6AC CCACX3CGCTA CX3AAGCGGTC TCCTTCATGG 1380 

ACGTCAACAG CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCX3CCTGC GGCTGCCTGG 1440 

GCTGAGGGCT CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG 150 0 

GGACCCTCCC GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC 1560 

TGAGAGGCCC CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA 1620 

CTAGCAGCCC CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG 1680 

CTATGGAGCC CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG 1740 

GGACCCCTCC TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT 1800 

AGGGACAGCA TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGT^ 1860 
CTGGCCTGTA CTCACTCATG GGAGCTGGCC CC 

Seq ID NO: 277 Protein sequences 
Protein Accession #: NP_003 967.1 

1 11 21 31 41 SI 

I I I I I I 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 60 

PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 120 

RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 180 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR IiSATACX3CLG 

Seq ID NO: 278 DNA sequence 

Nucleic Acid Accession #: NM_057160.1 

Coding sequence: 1-714 

1 11 21 31 41 51 

I I I I I I 

ATGCCCGGCC TGATCTCAGC CCGAGGACAG CCCCTCCTTG AGGTCCTTCC TCCCCAAGCC 60 

CACCTGGGTG CCCTCTTTCT CCCTGAGGCT CCACTTGGTC TCTCCGCGCA GCCTGCCCTG 120 

TGGCCCACCC TGGCCGCTCT GGCTCTGCTG AGCAGCGTCG CAGAGGCCTC CCTGGGCTCC 180 

GCGCCCCGCA GCCCTGCCCC CCGCGAAGGC CCCCCGCCTG TCCTGGCGTC CCCCGCCGGC 240 

CACCTGCCGG GGGGACGCAC GGCCCGCTGG TGCAGTGGAA GAGCCCGGCG GCCGCCGCCG 3 00 

CAGCCTTCTC GGCCCGCGCC CCCX3CCGCCT GCACCCCCAT CTGCTCTTCC CCGCGGGGGC 3 60 

CGCGCGGCGC GGGCTGGGGG CCCGGGCAGC CGCGCTCGGG CAGCGGGGGC GCGGGGCTGC 420 

CGCCTGCGCT CGCAGCTGGT GCCGGTGCGC GCGCTCGGCC TGGGCCACCG CTCCGACGAG 480 

CTGGTGCGTT TCCGCTTCTG CAGCGGCTCC TGCCGCCGCG CGCGCTCTCC ACACGACCTC 540 

AGCCTGGCCA GCCTACTGGG CGCCGGGGCC CTGCGACCGC CCCCGGGCTC CCGGCCCGTC 600 

AGCCAGCCCT GCTGCCX3ACC CACGCGCTAC GAAGCGGTCT CCTTCATGGA CGTCAACAGC 660 

ACCTGGAGAA CCGTGGACCG CCTCTCCGCC ACCGCCTGCG GCTGCCTGGG CTGAGGGCTC 720 

GCTCCAGGGC TTTGCAGACT GGACCCTTAC CGGTGGCTCT TCCTGCCTGG GACCCTCCCG 780 

CAGAGTCCCA CTAGCCAGCG GCCTCAGCCA GGGACGAAGG CCTCAAAGCT GAGAGGCCCC 840 

TACCGGTGGG TGATGGATAT CATCCCCGAA CAGGTGAAGG GACAACTGAC TAGCAGCCCC 900 

AGAGCCCTCA CCCTGCGGAT CCCAGCCTAA AAGACACCAG AGACCTCAGC TATGGAGCCC 960 

TTCGGACCCA CTTCTCACAG ACTCTGGCAC TGGCCAGGCC TCGAACCTGG GACCCCTCCT 1020 

CTGATGAACA CTACAGTGGC TGAGGCATCA GCCCCCGCCC AGGCCCTGTA GGGACAGCAT 1080 

TTGAAGGACA CATATTGCAG TTGCTTGGTT GAAAGTGCCT GTGCTGGAAC TGGCCTGTAC 1140 
TCACTCATGG GAGCTGGCCC C 

Seq ID NO: 279 Protein secjuence: 
Protein Accession #: NP_476501.1 

1 11 21 31 41 51 

1 1 11 I 1 

MPGLISARGQ PLLEVLPPQA HLGALFLPEA PLGLSAQPAL WPTLAALALL SSVAEASLGS 60 

APRSPAPREG PPPVLASPAG HLPGGRTARW CSGRARRPPP QPSRPAPPPP APPSALPRGG 120 

RAARAGGPGS RARAAGARGC RLRSQLVPVR ALGLGHRSDE LVRFRFCSGS CRRARSPHDL 180 
SLASLLGAGA LRPPPGSRPV SQPCCRPTRY EAVSFMDVNS TWRTVDRLSA TACGCLG 

Seq ID NO: 2 80 DNA sequence 

Nucleic Acid Accession #: NM__057090.1 

Coding sequence: 29-715 

1 11 21 31 41 51 

I I I I I I 

CTGATGGGCG CTCCTGGTGT TGATAGAGAT GGAACTTGGA CTTGGAGGCC TCTCCACGCT 60 

GTCCCACTGC CCCTGGCCTA GGCGGCAGGC TCCACTTGGT CTCTCCGCGC AGCCTGCCCT 120 

GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT CCCTGGGCTC 180 

CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT CCCCCGCCGG 240 

CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC GGCCGCCGCC 300 

GCAGCCTTCT CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC CCCGCGGGGG 3 60 

CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG CGCGGGGCTG 420 

CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC GCTCCGACGA 480 

GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC CACACGACCT 540 

CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGGCT CCCGGCCCGT 600 

CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG ACGTCAACAG 660 
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CACCTGGAGA ACCGTGGACC GCCTCTCCGC 
CGCTCCAGGG CTTTGCAGAC TGGACCCTTA 
GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC 
CTACCGGTGG GTGATGGATA TCATCCCCGA 
CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA 
CTTCGGACCC ACTTCTCACA GACTCTGGCA 
TCTGATGAAC ACTACAGTGG CTGAGGCATC 
TTTGAAGGAC ACATATTGCA GTTGCTTGGT 
CTCACTCATG GGAGCTGGCC CC 

Seq ID NO: 281 Protein sequence: 
Protein Accession #: NP 476431.1 



CACCGCCTGC 
CCGGTGGCTC 
AGGGACGAAG 
ACAGGTGAAG 
AAAGACACCA 
CTGGCCAGGC 
AGCCCCCGCC 
TGAJiAGTGCC 



31 



GPPPVLASPA GHIiPGGRTAR WCSGRARRPP PQPSRPAPPP 



Seq ID NO; 282 DNA sequence 

Nucleic Acid Accession #: Eos sequence 



CTACTGCACC 
ATAGCAATTT 
GCGTGAACCT 
AGGTAACGTT 
GATCTAATCA 
GCTAAATW^TG 
TATTCTTCAT 
CACTCTTAAG 
GAATCTCAGG 



11 

1 

TGCCCTCTGT 
CCTTTGGTTT 
TATGAATGCT 
TTCTTTTGCT 
ATGATCTAGT 
AACTTGAAAC 
CCTGGCTGTA 
GATTTTATGT 
ATACCTACAT 



21 
1 

TTCCTTTGGA 
TTAAGACTTC 
GCTTAAAAAT 
CTCATCTTGG 
CAGAAGCTAC 
ACGGAAGTAG 
ATAATAGCCA 
GTATTATTCA 
TTATCACTTT 



31 
I 

AATCTCTTAC 
TACATTGCTT 
AATGTCAAAA 
TTTCCATATA 
TTCACTGGCT 
TGGTTGGTCC 
TTATTTGTTA 
AATTGCTATT 
TTCAATATAT 



GGCTGCCT6G 


GCTGAGGGCT 


720 


TTCCTGCCTG 


GGACCCTCCC 


780 


GCCTCAAAGC 


TGAGAGGCCC 


840 


GGACAACTGA 


CTAGCAGCCC 


900 


GAGACCTCAG 


CTATGGAGCC 


960 


CTCGAACCTG 


GGACCCCTCC 


1020 


CAGGCCCTGT 


AGGGACAGCA 


1080 


TGTGCTGGAA 


CTGGCCTGTA 


1140 


41 


51 




1 

IjSSVAEASLG 


1 

SAPRSPAPRE 


60 


PAPPSALPRG 


GRAARAGGPG 


120 


SCRRARSPHD 


LSLASIiLGAG 


180 


ATACGCLG 






41 


51 




1 

CTTTCATTAG 


1 

GGTTTCTTTC 


60 


TTTCTTTTAT 


TATCTGTGCT 


12 0 


TATGTTTTAG 


CTGCCTACTC 


180 


CTATTTTTGG 




240 


AACAGTGATC 


ATGTTCATGT 


300 


AGTTTGAAAG 


CTCTTATTAG 


360 


TGCCTTTGTT 


ATGTAGCAGA 


420 


ACTGTTCTTT 


TTATAGTTGA 


480 


ATGTATTTCT 


TATT 
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Seq ID NO: 283 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 564-1481 



GAGACTTTTA 
CAAAGAGGTC 
GAAGTGCTAC 
ACTTGGATCA 
CGACCGATGC 
CTTTCCCTGA 
GCAAACAAGG 
GGGGAGACAC 
AAGGGCTGGC 
CCTCCCGGCC 
GCCTGGCGCG 
ACTTGGCGGG 
CCGGTTTCTC 
CTATGACGGG 
CGCTGAGCTG 
AGTTTGACGA 
TGGATGAAGA 
AGGGGTGCGC 
GCTTACCAGA 
AACGTGACCG 
AAGAAATGGA 
TTCAGAAAAA 
AGGAAAAAGC 
ATCAAGAATG 
AAAACAACAG 
ATGGTTGGAA 
TGGAAAACTT 
AATTCCGTGG 
GAAGAGTAAA 
AGCCAGGAGC 
AATAACATGC 
CAGTCAATAA 
TGGAGTCCTT 
GTTGATCTTG 
ATTTAGCTTG 



11 

I 

ATCATCTATC 
AAAAATTCAC 
CAAAACACGC 
AGTTCCCTCC 
TCACAATTCT 
TTCAGTGGCA 
GACAGGCCCT 
CGTACTAAAC 
TAAGGGAGGC 
TCCACGCGTC 
CGCAGGGCCC 
CCAGATGTGG 
TGCTGACTGC 
CGCACGTGGG 
CAGCAATTCC 
GGACGACGGT 
TGCCCATGAT 
CAGCACTCAG 
AAGCCGCCTG 
GCTGCAACTG 
AGAACGTGAA 
GAATGAGCAA 
AGCAAAGGAA 
GTTAAAGAAA 
CAAGCTGAAA 
AATGCGAAAC 
ACAGGTTTTT 
AAACCAATTC 
AGACCTGTGA 
AATCTTTGCC 
TTTTATCTGG 
CTCAACACTT 
AGAGTTGAGG 
GCATGTTGTT 
TATTAAAAGT 



21 

I 

CCTTGTGCTT 
ATGTGTAGAC 
AATGACTGTC 
CCTCTCCTCA 
GACCTCGTAA 
GGTAACATAT 
CAAAGTTGTC 
AAGCTTGCAA 
CGGGGCGGAG 
GCCATGGCAA 
GCTGGGAAAG 
GCGGCGGGGC 
AGACCCAG6T 
CAGGGGCTGG 
ACCAGGTCGC 
GACGGGGAGG 
TCAGAGGCCA 
GTTGAATCAG 
ACACCATGGG 
AAAGCTCTAG 
AAAAGAAAGA 
AAAAGAAAAG 
CTGGAGAAAG 
AAAAATGCTG 
TACAGGAGAA 
ATAAACCTCG 
ACAGTGGAAA 
ATATGCCACC 
TAAGTCAGCC 
TTGGAACTCT 
AGCTATTTAA 
AATGTGATTA 
AAGATATTTT 
TTGCAGAATA 
ACACTGTAAT 



31 
I 

TACGCAGACC 
AAATTAGGTC 
CTAAAAGTGC 

AAATATATCG 
TTATATAGGG 
TTCATGTACA 
GGTAGGGAGC 
ACAGCAGGCA 
GAAGCCAAGC 
CGCGGGGTCT 
CGCGTCCCCG 
GCTGGGGGCC 
CTCGGCCCTC 
AGGTGGTGCG 
TGTTGTCTCC 
ATGAGGAAGA 
AAGTGGCGAG 
AAAATAACCA 
AGGT6TGGTT 
AGGAATTAAA 
TAATTGCTGA 
AAAGAGAACA 
AATACTTGCA 
AAGTU^TGTGA 
AAAGGAAATA 
TCCAGCTGCA 
TTCCTATCCA 
TCCCAAAGAA 
ACACAAGTCA 
GTGCAGAATA 
TTTAAAAATC 
TTGACAAATA 
CTGGATTTTG 
AGTGGCTGAA 
ACCAATAAAA 



41 
I 

CTACAATACA 
CCTTAAGATG 
GTTCTGGGAT 
ACTTGTGCTG 
GGTGGTTTTG 
AAATGAACTG 
CAGGACCCCG 
CCTTCCTGCC 
TCTGCAGGCC 
GTGCTGGCCG 
CCGCGGCTCC 
TACTTTTCCC 
CTCGGACTCC 
CTCGCCGTCG 
CCTTGGCCAC 
CGTGGATGAT 
CCTGAGAGGA 
AGAAGAACAG 
TATTGGCAAA 
TCAACAACTA 
AGAAAAGCAC 
AAAAATTAAT 
AGAAAT^GCA 
GAG6AAGAAG 
GCT^GAAAAAA 
AAGAGCTATG 
GAACCAGCCT 
GCTAAGGATC 
TCATCTCTGG 
CAAAGATAGC 
AGAAATTGTT 
GCAATTTTTG 
GTTTTTATAA 
TATGTAAGAA 
CTAACAATTT 



51 
I 

CTAGAGGCTT 
CCAGGCAAAC 
ACACCTGTAA 
AAAGAAATCA 
GTTTCTGCGT 
CAACACCACG 
CCAGTGGCGT 
ACTGAGGAGG 
CTGACAAAGT 
GGATTGGCCG 
GCCAGTTTGA 
TCTTCCTACG 
TGCTCAGTCC 
CCGCCGCTGC 
CAGAGCTTCC 
GAGGAAGACG 
ATGGAGTTAC 
AAACAGGTGC 
GAAAAAGAAG 
GAA7^AAAG7\A 
AAGGAATGGG 
AAAGAAATGG 
AAAGAAAAAT 
AAA6AAAAGA 
AGTTTCAAGA 
GTTATGCCAA 
TTTATAATCC 
TATCAGGAAG 
TAATTCATAA 
GTATGTGGAA 
TTTTACTGCT 
CATTTGTATA 
ACTTTTTAAG 
TTGTGTTTGT 
TTCTTG 



Seq ID NO: 284 Protein sequence: 
Protein Accession #: Eos sequence 
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51 
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1080 
1140 
1200 
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1440 
1500 
1560 
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1680 
1740 
1800 
1860 
1920 
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2040 



MATRGLCWPG LAGLARAGPA GKARPRRGSA SLNLAGQMWA AGRWGPTFPS SYAGFSADCR 
PRSRPSSDSC SVPMTGARGQ GLEWRSPSP PLPLSCSNST RSLLSPLGHQ SFQFDEDDGD 
GEDEEDVDDE EDVDEDAHDS EAKVASLRGM ELQGCASTQV ESENNQEEQK QVRLPESRLT 
PWEVWFIGKE KBERDRLQLK ALEELNQQLE KRKEMEERBK RKIIAEEKHK EWVQKKNEQK 
RKEREQKINK EMEEKAAKEL EKEYLQEKAK BKYQEWLKKK NAEECERKKK EKKNNSKLKY 
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Seq ID NO: 285 DNA sequence 
Nucleic Acid Accession #: Eos sequence 
5 Coding sequence: 1-1746 

1 11 21 31 41 51 

I I I I I I 

ATGCCACTGA AGCATTATCT CCTTTTGCTG GTGGGCTGCC AAGCCTGGGG TGCAGGGTTG 60 

10 GCCTACCATG GCTGCCCTAG CGAGTGTACC TGCTCCAGGG CCTCCCAGGT GGAGTGCACC 12 0 

GGGGCACGCA TTGTGGCGGT GCCCACCCCT CTGCCCTGGA ACGCCATGAG CCTGCAGATC 18 0 

CTCAACACGC ACATCACTGA ACTCAATGAG TCCCCGTTCC TCAATATCTC AGCCCTCATC 24 0 

GCCCTGAGGA TTGAGAAGAA TGAGCTGTCG CGCATCACGC CTGGGGCCTT CCGAAACCTG 3 00 

GGCTCGCTGC GCTATCTCAG CCTCGCCAAC AACAAGCTGC AG6TTCTGCC CATCGGCCTC 3 60 

15 TTCCAGGGCC TGGACAGCCT TGAGTCTCTC CTTCTGTCCA GTTUICCAGCT GTTGCAGATC 420 

CAGCCGGCCC ACTTCTCCCA GTGCAGCAAC CTCAAGGAGC TGCAGTTGCA CGGCAACCAC 480 

CTGGAATACA TGCCTGACGG AGCCTTCGAC CACCTGGTAG GACTCACGAA GCTCAATCTG 540 

GGCAAGAATA GCCTCACCCA CATCTCACCC AGGGTCTTCC AGCACCTGGG CAATCTCCAG 600 

GTCCTCCGGC TGTATGAGAA CAGGCTCACG GATATCCCCA TGGGCACTTT TGATGGGCTT 660 

20 GTTAACCTGC AGGAACTGGC TCTACAGCAG AACCAGATTG GACTGCTCTC CCCTGGTCTC 720 

TTCCACAACA ACCACAACCT CCAGAGACTC TACCTGTCCA ACAACCACAT CTCCCAGCTG 780 

CCACCCAGCA TCTTCATGCA GCTGCCCCAG CTCAACCGTC TTACTCTCTT TGGGAATTCC 840 

CTGAAGGAGC TCTCTCTGGG GATCTTCGGG CCCATGCCCA ACCTGCGGGA GCTTTGGCTC 900 

TATGACAACC ACATCTCTTC TCTACCCGAC AATGTCTTCA GCAACCTCCG CCAGTTGCAG 960 

25 GTCCTGATTC TTAGCCGCAA TCAGATCAGC TTCATCTCCC CGGGTGCCTT CAACGGGCTA 1020 

ACGGAGCTTC GGGAGCTGTC CCTCCACACC AACGCACTGC AGGACCTGGA CGGGAATGTC 1080 

TTCCGCATGT TGGCCAACCT GCAGAACATC TCCCTGCAGA ACAATCGCCT CAGACAGCTC 1140 

CCAGGGAATA TCTTCGCCAA CGTCAATGGC CTCATGGCCA TCCAGCTGCA GAACAACCAG 12 00 

CTGGAG7VACT TGCCCCTCGG CATCTTCGAT CACCTGGGGA AACTGTGTGA GCTGCGGCTG 12 60 

30 TATGACAATC CCTGGAGGTG TGACTCAGAC ATCCTTCCGC TCCGCAACTG GCTCCTGCTC 1320 

AACCAGCCTA GGTTAGGGAC GGACACTGTA CCTGTGTGTT TCAGCCCAGC CAATGTCCGA 1380 

GGCCAGTCCC TCATTATCAT CAATGTCAAC GTTGCTGTTC CAAGCGTCCA TGTCCCTGAG 1440 

GTGCCTAGTT ACCCAGAAAC ACCATGGTAC CCAGACACAC CCAGTTACCC TGACACCACA 1500 

TCCGTCTCTT CTACCACT6A GCTAACCAGC CCTGTGGAAG ACTACACTGA TCTGACTACC 1560 

35 ATTCAGGTCA CTGATGACCG CAGCGTTTGG GGCATGACCC AGGCCCAGAG CGGGCTGGCC 162 0 

ATTGCCGCCA TTGTAATTGG CATTGTCGCC CTGGCCTGCT CCCTGGCTGC CTGCGTCGGC 168 0 

TGTTGCTGCT GCTVAGAAGAG GAGCCAAGCT GTCCTGATGC AGATGAAGGC ACCCAATGAG 1740 

TGTTAAAGAG GCAGGCTGGA GCAGGGCTGG GGAATGATGG GACTGGAGGA CCTGGGAATT 1800 

TCATCTTTCT GCCTCCACCC CTGGGTCCAT GGAGCTTTCC CGTGATTGCT CTTTCTGGCC 1860 

40 CTAGATAAAG GTGTGCCTAC CTCTTCCTGA CTTGCCTGAT TCTCCCGTAG AGAAGCAGGT 192 0 

CGTGCCGGAC CTTCCTACAA TCAGGAAGAT AGATCCAACT GGCCATGGCA AAAGCCCTGG 198 0 

GGATTTCCGA TTCATACCCC TGGGCTTCCT TCGAGAGGGC TCTTCCTCCA AATCCTCCCC 2040 

ACCTGTCCTC CAAGAACAGC CTTCCCTGCG CCCAGGCCCC CTCCGGGCCT CTGTAGACTC 2100 

AGTTAGTCCA CAGCCTGCTC ACTTCGTGGG AATAGTTCTC CGCTGAGATA GCCCCTCTCG 2160 

45 CCTAAGTATT ATGTAAGTTG ATTTCCCTTC TTTTGTTTCT CTTGTTTGTG CTATGGCTTG 2220 

ACCCAGCATG TCCCCTCAAA TGAAAGTTCT CCCCTTGATT TTCTGCTCCT GAAGGCAGGG 2280 

TGAGTTCTCT CCTCAAAGAA GACTTCAAAC CATTTAACTG GTTTCTTAAG AGCCGTCAAT 2340 

CAGCCTGGTT TTGGGGATGC TATGAAAGAG AGAAGGAAAA TCATGCCGCT CAGTTCCTGG 240 0 

AGACAGAAGA GCCGTCATCA GTGTCTCACT TGTGATTTTT ATCTGGAAAA GGAAGAAACA 2460 

50 CCCCAGCACA GCAAGCTCAG CCTTTTAGAG AAGGATATTT CCAAACTGCA AACTTTGCTT 2520 

TGAAAAGTTT AGCCCTTTAA GGAATGAAAT CATGTAGAAT TTTGGACTTC TAAAAACATT 2580 

AAAATCAGCT TATTAATACG GGATAGAGAA AGAAATCTGG TGCCTGGGGG TCCCTGTGTT 2640 

CACCCCTAGA GTTTGTTTTA AAATTTTTAA TTGAAGCATG TGAAGTGTAC STGCAGAAAA 2700 

GTGGGAACAT GATAGTGTAT GGCTTGGTGG ATTTTCACAA ACTGAACATA CCTGTGTAAT 2760 

55 CAGCATCTAG ACCCAGACCC AGAGCATCAC AAATATCCCC CATCCTGGGC TTTTCCCAGA 2820 

GGAGATGGGG GCTTCTGAAG ATGGACTTAC CTGGGACCTG CCCCCCATGA GCCAGGACGG 2880 

TCCCCCCACA GTCAGCCTGT GCAAAGGCCC CGTGGCCAGG GGTGGAGGAG AATATGTGGG 2 940 

TGT6GACAGG ATGGGAGACT GTGGCCTGAA CAGGAGATTT TATTATATCT GGAGACCCTG 3 000 

AGAGACCCTG AGACCTGGGG CACCATGGCT GGCCAGGTCA GAAGCATCCT GACTGCAGAG 3 060 

60 GTCCGTGCAG CCACACCCTC TTCCCTGCCA GCAAGTTGTC TGCGGCTCAT CGGAGGCCCC 312 0 

TCCGCCTGGA GCCTTCTATG GACGTGATAT GCCTGTATCT GTTTTTAATT TTCATTCTTC 3180 

ACTTAGGGGA AGTGAAATCG CTCAGAGATG AGATCCTTTA ATTGAAAACG AAGTGTAACG 3240 

GAATCTAGTG TCTTTCTAAT GTGGTAAAAT TCTCCATCAA CATCACAGTC AGCTGGCAGC 3300 

TGAACTTCAG AATCTCACTT ACAGCAGGCG ACACGGGGGT ACACCGATGG GTCACACTGG 3360 

65 GTCTGGGGGC TCCCTGGAGC TCCTCCTGCG TGTGGTCTGG TTAGGAGTTG AGTTGTTTGC 3 420 

TCCAGGGTTA TTCTGCTCCT CGAGTCACAG TCACACGAAT ACCTGCCTTC TCTGGCTTTC 3480 

CTGCTATACA CATATTCACA TGGCGCTCAA GAAGTTAGGC TCATGGCAAC GTGTGTCTTT 3540 

CTCTGGACAA CTGGCCCAGT TTACAGTGAA ATGGAGAATT TCAGGTCTCC ACGTCTGCCC 3 600 

AGGAAAGAAC TTCAGCTGAC TCCACGGGGA TCTGGA7UVTC CACGACCAAT CCCGATCGGC 3 660 

70 TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGCTTTGG AAATCCACCA CCAATCCCGA 3720 

TCGGCTCTTA TTAGCTCCCC GCTCCACAAG ACACCTGTGA TCTGGAAATC TACCACCAAT 3 780 

CCCGATCGGC TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGACATCC TCCAGGGCCA 3 840 

CAGGAGCACG TGCTGACCAG TTTTCCCTTC CAGTTCCTGG ACAAAAAGTG TCGAGAGGGC 3 900 

TGTTTGCaAA CACTAGTGCA CTTTGTAGCT TTTCACCCTC TGTCCCAGGG AATCTAGGAG 3960 

75 AGATGAGGCC CGTCAGAGTC AAGAGATGTC ATCCCCCCAG GGTCTCCAAG GCATTTCCAC 4020 

ACTATTGGTG GCACCTGGAG GACATGCACC AAGGCTTGCC AGAGCCAACA GGAAGTGAGC 4080 

CCAGAGCATG GCACATGAGC ATCACCCGCT GATGGTGGCC TGCTGTGCCT GGTGCCAACA 4140 

GGGGCATCCC GGCCCGTACC CCTCCAGACA GGAAGCATGG GTTTGCCCAC AGACCTGTCG 4200 

GGTGCTCCTG TGAGTGGCCT CCAGATGTCT TTGTGCATAG GCACAAGTGG GCCAGGGCTG 4260 

80 GAGGGAGGTG GGAAACCTCA TCATCCGGTG GGCCCTGCCA ATCTTAACCC AGAACCCTTA 4320 

GGTATTCCTG 6CAGTAGCCA TGACATTGGA GCACCTTCCT CTCCAGCCAG AGGCTGACCT 4380 

GAGGGCCACT GTCCTCAGAT GACACCACCC AGGAGCACCC TAGGTGAGGG GTGAGGGCCC 4440 

CCTTATGTGA ACCTCTTGCC TCTTCCTTTC TCCCATCAGA GTGGTTGGAT GGAGCCATTG 4500 

GCCTCCTTTT CTTCAGCGGG CCCTTCAACC TCTCTGCACC ATGTTGTCTG GCTGAGGAGC 4560 

85 TACTAGAAAA GCTGAGTGGA GTCTCCTTTC CAACAGGATG ATGCATTTGC TCAATTCTCA 4620 

GGGCTGGAAT GAGCCGGCTG GTCCCCCAGA AAGCTGGAGT GGGGTACAGA GTTCAGTTTT 4680 

CCTGTGTGTT TACAGCTCCT TGACAGTCCC ACGCCCATCT GGAGTGGGAG CTGGGAGTTA 4740 
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GTGTTGGAGA AGAAACAACA AAAGCCAATT AGAACCACTA TTTTTAAAAA GTGCTTACTG 
TGCACAGATA CTCTTC7\AGC ACTGGACGTG GATTCTCTCT CTAGCCCTCA GCACCCCTGC 
GGTAGGAGTG CCGCCTCTAC CCACTTGTGA TGGGGTACAG AGGCACTTGC TCTTCTGCAT 
GGTGTTCAAT AGGCTGGGAG TTTTATTTAT CTCTTCAAAC TTTGTACAAG AGCTCATGGC 
TTGTCTTGGG CTTTCGTCAT TTU^CCAAAG GAAATGGAAG CCATTCCCCT GTTGCTCTCC 
TTAGTCTTGG TCATCAGAAC CTCACTTGGT ACCATATAGA TCAAAAGCTT TGTAACCACA 
GGAAAAAATA AACTCTTCCA TCCCTTAAAG AATAGAATAG TTTGTCCCTC TCATGGGAAT 
TGGGCTGTAT GTATATTGTT CTTCCTCCTT AGAATTTAGA GATACAAGAG TTCTACTTAG 
AACTTTTCAT GGACACAATT TCCACAACCT TTCAGATGCT GATGTAGAGC TATTGGGAAA 
GAACTTCCM ACTCAGGAAG TTTGCAGAGA GCAGACAGCT AGAGATAACT CGGGACCCAG 
AGTTGGTCGA CAGATGTTAG ATGTATCCTA GCTTTTAGCC ATAAACCACT CAAAGATTCA 
GCCCCCAGAT CCCACAGTCA GAACTGAATC TGCGTTGTTG GGAAGCCAGC AGTGGCCTTG 
GGAAGGAAGC CATGGCTGTG GTTCAGAGAG GGTGGGCTGG CAAGCCACTT CCGGGGAAAA 
CTCCTTCCGC CCCAGGTTTC TTCTTCTCTT AAGGAGAGAT TGTTCTCACC AACCCGCTGC 
CTTCATGCTG CCTTCAAAGC TAGATCATGT TTGCCTTGCT TAGAGAATTA CTGCAAATCA 
GCCCCAGTGC TTGGCGATGC ATTTACAGAT TTCTAGGCCC TCAGGGTTTT GTAGAGTGTG 
AGCCCTGGTG GGCAGGGTTG GGGGGTCTGT CTTCTGCTGG ATGCTGCTTG TAATCCATTT 
GGTGTACAGA ATCAACAATA AATAATATAC ATGTAT 

Seq ID NO: 286 Protein sequence: 
Protein Accession #: NP__S70843.1 



PCT/US02/12476 



MPLKHYLLLL 
LNTHITELNE 
FQGLDSLESL 
GKNSLTHISP 
FHNNHNLQRL 
YDNHISSLPD 
PRMLANLQNI 
YDNPWRCDSD 
VPSYPETPWY 
lAAIVIGIVA 



11 

I 

VGCQAWGAGL 
SPFLNISALI 
LLSSNQLLQI 
RVFQHLGNLQ 
YLSNNHISQL 
NVFSNLRQLQ 
SLQNNRLRQL 
ILPLRNWLLL 
PDTPSYPDTT 
LACSLAACVG 



21 

i 

AYHGCPSECT 
ALRIEKNELS 
QPAHFSQCSN 
VLRLYENRLT 
PPSIFMQLPQ 
VLILSRNQIS 
PGNIFAJJVNG 
NQPRLGTDTV 
SVSSTTELTS 
CCCCKKRSQA 



31 
I 

CSRASQVECT 
RITPGAFRNIi 
LKELQLHGNH 
DIPMGTFDGL 
LNRLTLFGNS 
FISPGAFNGL 
LMAIQLQNNQ 
PVCFSPANVR 
PVEDYTDLTT 
VLMQMKAPNE 



41 

1 

GARIVAVPTP 
GSLRYLSLAN 
LEYIPDGAFD 
VNLQELALQQ 
LKELSLGIFG 
TELRELSLHT 
LENLPLGIFD 
GQSLIIINVN 
IQVTDDRSVW 
C 



51 

1 

LPWNAMSLQI 
NKLQVLPIGL 
HLVGLTKLNL 
NQIGLLSPGL 
PMPNLRELWL 
NALQDLDGNV 
HLGKLCELRIt 
VAVPSVHVPB 
GMTQAQSGLA 



Seq ID NO: 287 DNA sequence 
Nucleic Acid Accession #: NM_002362 
Coding sequence : 1 . . 954 



ATGTCTTCTG 
GAGGCCCTGG 
TCCTCCTCCT 
GGTCCTCCCC 
TGGAGGCAAC 
GACGCAGAGT 
CTGCTCCGCA 
ATCAAAAATT 
ATGATCTTTG 
ACCTGCCTGG 
GGCCTTCTGA 
GAAATCTGGG 
GGGGAGCCCA 
CAGGTACCCG 
GAAACCAGCT 
GCCTACCCAT 



11 
I 

AGCAGAAGAG 
GCCTGGTGGG 
CTCCTCTGGT 
AGAGTCCTCA 
CCAATGAGGG 
CCTTGTTCCG 
AGTATCGAGC 
ACAAGCGCTG 
GCATTGACGT 
GCCTTTCCTA 
TAATCGTCCT 
AGGAGCTGGG 
GGAAACTGCT 
GCAGTAATCC 
ATGTGAAAGT 
CCCTGCGTGA 



21 

TCAGCACTGC 
TGCACAGGCT 
CCCTGGCACC 
GGGAGCCTCT 
TTCCAGCAGC 
AGAAGCACTC 
CAAGGAGCTG 
CTTTCCTGTG 
GAAGGAAGTG 
TGATGGCCTG 
GGGCACAATT 
TGTGATGGGG 
CACCCAAGAT 
TGCGCGCTAT 
CCTGGAGCAT 
AGCAGCTTTG 



31 
I 

AAGCCTGAGG 
CCTACTACTG 
CTGGAGGAAG 
GCCTTACCCA 
CAAGAAGAGG 
AGTAACAAGG 
GTCACAAAGG 
ATCTTCGGCA 
GACCCCGCCA 
CTGGGTAATA 
GCAATGGAGG 
GTGTATGATG 
TGGGTGCAGG 
GAGTTCCTGT 
GTGGTCAGGG 
TTAGAGGAGG 



Seq ID NO: 2 88 Protein sequence: 
Protein Accession #: NP 002353,1 



MSSEQKSQHC 
GPPQSPQGAS 
LLRKYRAKEL 
TCLGLSYDGL 
GEPRKLLTQD 
AYPSLREAAL 



11 

1 

KPEEGVEAQE 
ALPTTISFTC 
VTKAEMbERV 
LGNNQIFPKT 
WVQENYIiEYR 
LEEEEGV 



21 

I 

EALGLVGAQA 
WRQPNEGSSS 
IKNYKRCFPV 
GLLIIVIiGTI 
QVPGSNPARY 



31 

1 

PTTEEQEAAV 
QEEEGPSTSP 
IFGKASESIiK 
AMEC^SASBB 
EFLWGPRAIjA 



41 

1 

SSSSPLVPGT 
DAESLFREAL 
MIPGIDVKEV 
BIWEBLGVMG 
BTSYVKVLEH 



51 

I 

LEEVPAAESA 
SNKVDELAHF 
DPASIJTYTLV 
VYDGREHTVY 
WRVNARVRI 



Seq ID NO: 289 DNA sequence 
Nucleic Acid Accession #: NM__002362 
Coding sequence: 46.. 1344 



1 
I 

CGGCGGCCGC 
GGCGACCTGA 

CATCAGCGCG 
CTCAACAGAC 
TTGACCAGAA 
CAGCCCATCG 
GGCCCCAGCA 
GTTCTACCTG 
AAATCCCATC 



11 

1 

GCCCTGGTTG 
AGCAGGCGCT 
GCAGCAGCAC 
ATAATATTGT 
ATGTGCAGTC 
ATTTGAGTGC 
GTGAAAATCT 
CAGCTGAATT 
TCCTCGATTA 



21 

1 

GGTCCCCACT 
TCCCTGTGTG 
TGCAAAGAAA 
GTTTGGTGAT 
TGTGTCTATT 
ATGCACTGTT 
GGAGGAAGAG 
CCATGGGCTT 
T6TGATGACA 



31 
I 

GCTCTCGGGG 
GCCGAGTCGC 
GAAGACATAA 
TACACATGGA 
ATTGACACAG 
GCACTTCACA 
ACAGAAAACA 
TGGGACAGCT 
ACTTTACTGT 



4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 



60 
120 
180 
240 
300 
360 
420 
480 
540 



41 


51 




1 

AAGGCGTTGA 


1 

GGCCCAAGAA 


60 


AGGAGCAGGA 


GGCTGCTGTC 


120 


TGCCTGCTGC 


TGAGTCAGCA 


180 


CTACCATCAG 


CTTCACTTGC 


240 


AGGGGCCAAG 


CACCTCGCCT 


300 


TGGATGAGTT 


GGCTCATTTT 


360 


CAGAAATGCT 


GGAGAGAGTC 


420 


AAGCCTCCGA 


GTCCCTGAAG 


480 


GCAACACCTA 


CACCCTTGTC 


540 


ATCAGATCTT 


TCCCAAGACA 


600 


GCGACAGCGC 


CTCTGAGGAG 


660 


GGAGGGAGCA 


CACTGTCTAT 


720 


AAAACTACCT 


GGAGTACCGG 


780 


GGGGTCCAAG 


GGCTCTGGCT 


840 


TCAATGCAAG 


AGTTCGCATT 


900 


AAGAGGGAGT 


CTGA 





60 
120 
180 
240 
300 



41 


51 




1 

GCGCCATGGA 


1 ■ 

CGAGGCCGTG 


60 


CAACGGTCCA 


CGTGGAGGTG 


120 


ACCTGAGTGT 


TAGAAAGCTA 


180 


CTGAGTTTGA 


TGAACCTTTT 


240 


AATTAAAGGT 


TAAAGACTCA 


300 


TTTTCCAGCT 


GAATGAAGAT 


360 


TAATTGCAGC 


AAATCACTGG 


420 


TGGTATACGA 


TGTGGAAGTC 


480 


TTTCAGACAA 


GAAC6TCAAC 


540 



294 



wo 02/086443 

AGCAACCTCA TCACCTGGAA CCGGGTGGTG CTGCTCCACG GTCCTCCTGG CACTGGAAAA 600 

ACATCCCTGT GTAAAGCGTT AGCCCAGAAA TTGACAATTA GACTTTCAAG CAGGTACCGA 660 

TATGGCCAAT TAATTGAAAT AAACAGCCAC AGCCTCTTTT CTAAGTGGTT TTCGGAAAGT 720 

GGCAAGCTGG TAACCAAGAT GTTTCAGAAG ATTCAGGATT TGATTGATGA TAAAGACGCC 780 

CTGGTGTTCG TGCTGATTGA TGAGGTGGAG AGTCTCACAG CCGCCCGAAA TGCCTGCAGG 840 

GCGGGCACCG AGCCATCAGA TGCCATCCGC GTGGTCAATG CTGTCTTGAC CCAAATTGAT 900 

CAGATTAAAA GGCATTCCAA TGTTGTGATT CTGACCACTT CTAACATCAC CGAGAAGATC 960 

GACGTGGCCT TCGTGGACAG GGCTGACATC AAGCAGTACA TTGGGCCACC CTCTGCAGCA 1020 

GCCATCTTCA AAATCTACCT CTCTTGTTTG GAAGAACTGA TGAAGTGTCA GATCATATAC 10 8 0 

CCTCGCCAGC AGCTGCTGAC CCTCCGA6AG CTAGAGATGA TTGGCTTCAT TGAT^CAAC 1140 

GTGTCAAAAT TGAGCCTTCT TTTGAATGAC ATTTCAAGGA AGAGCGAGGG CCTCAGCGGC 1200 

CGGGTCCTGA G7yUU\.CTCCC CTTTCTGGCT CATGCGCTGT ATGTCCAGGC CCCCACCGTC 1260 

ACCATAGAGG GGTTCCTCCA GGCCCTGTCT CTGGCAGTGG ACAAGCAGTT TGAAGAGAGA 1320 

AAGAAGCTTG CAGCTTACAT CTGATCCTGG GCTTCCCCAT CTGGTGCTTT TCCCATGGAG 13 80 

AACACACAAC CAGTAAGTGA GGTTGCCCCA CACAGCCGTC TCCCAGGGAA TCCCTTCTGC 1440 

AAACCAAACG TTACTTAGAC TGCAAGCTAG AAAGCCACCA AGGCCAGGCT TTGTTAAAAG 1500 

AAGTGTATTC TATTTATGTT GTTTTAAAAT GCATACTGAG AGACAAACAT CTTGTCATTT 1560 

TCACTGTTTG TAAAAGATAA TTCAGATTGT TTGTCTCCTT GTGAAGAACC ATCGAAACCT 1620 

GTTTGTTCCC AGCCCACCCC CAGTGGATGG GATGCATAAT GCCAGCAAGT TTTGTTTAAC 1680 

AGCAAAAAAG GAAGATTAAT GCAGGTGTTA TAGAAGCCAG AAGAGAAACT GTGTCACCCT 1740 

AAAGAAGCAT ATAATCATAG CATTAA/^T GCACACATTA CTCCAGGTGG AAGGTGGCAA 1800 

TTGCTTTCTG ATATCAGCTC GTTTGATTTA GTGCAAAAAT GTTTTCAAGA CTATTTAATG 18 60 

GATGTAAAAA AGCCTATTTC TACATTATAC CAACTGAGAA AAAAATGGTC GGTAAAGTGT 1920 

TCTTTCATAA TAAATAATCA AGACATGGTC CCATTTGCAG GAAAAGTGCA GACTCTGAGT 1980 

GTTCCAGGGA AACACATGCT GGACATCCCT TGTAACCCGG TATGGGCGCC CCTGCATTGC 2040 

TGGGATGTTT CTGCCCACGG TTTTGTTTGT GCAATAACGT TATCACATTT CTAATGAGGA 2100 

TTCACATTAA TATAATATAA AATAAATAGG TCAGTTACTG GTCTCTTTCT GCCGAATGTT 2160 
ATGTTTTGCT TTTATCTCAC AGTAAAATAA ATATAATTAA AAA 

Seq ID NO J 290 Protein sequence: 
Protein Accession #: NP__004228 

1 11 21 31 41 51 

I I I I I I 

MDEAVGDLKQ ALPCVAESPT VHVEVHQRGS STAKKEDINL SVRKLLNRHN IVFGDYTWTE 60 

FDEPFLTRNV QSVSIIDTEL KVKDSQPIDL SACTVALHIF QLNEDGPSSE NLEEETENII 12 0 

AANHWVLPAA EFHGLWDSLV YDVEVKSHLL DYVMTTLLFS DKNVNSNLIT WNRWLLHGP 180 

PGTGKTSLCK ALAQKLTIRL SSRYRYGQLI EINSHSLFSK WFSESGKLVT KMFQKIQDIiI 240 

DDKDALVFVL IDEVBSLTAA RNACRAGTEP SDAIRWNAV LTQIDQIKRH SNWILTTSN 300 

ITEKIDVAFV DRADIKQYIG PPSAAAIFKI YLSCIiBELMK CQIIYPRQQL LTLRELBMIG 360 

FIENNVSKLS LLLNDISRKS EGLSGRVLRK LPFLAHAIiYV QAPTVTIEGF LQALSLAVDK 420 
QFEERKKLAA YI 

Seq ID NO: 291 DNA sequence 

Nucleic Acid Accession #! NM_002658.1 

Coding sequence: 77-1372 

1 11 21 31 41 51 

I I I I I 1 

GTCCCCGCAG CGCCGTCGCG CCCTCCTGCC GCAGGCCACC GAGGCCGCCG CCGTCTAGCG 60 

CCCCGACCTC GCCACCATGA GAGCCCTGCT GGCGCGCCTG CTTCTCTGCG TCCTGGTCGT 120 

GAGCGACTCC AAAGGCAGCA ATGAACTTCA TCAAGTTCCA TCGAACTGTG ACTGTCTAAA 180 

TGGAGGAACA TGTGTGTCCA ACAAGTACTT CTCCAACATT CACTGGTGCA ACTGCCCAAA 24 0 

GAAATTCGGA GGGCAGCACT GTGAAATAGA TAAGTCAAAA ACGTGCTATG AGGGGAATGG 300 

TCACTTTTAC CGAGGAAAGG CCAGCACTGA CACCATGGGC CGGCCCTGCC TGCCCTGGAA 360 

CTCTGCCACT GTCCTTCAGC TUU^CGTACCA TGCCCACAGA TCTGATGCTC TTCAGCTGGG 420 

CCTGGGGAAA CATAATTACT GCAGGAACCC AGACAACCGG AGGCGACCCT GGTGCTATGT 480 

GCAGGTGGGC CTAAAGCCGC TTGTCCAAGA GTGCATGGTG CATGACTGCG CAGATGGAAA 540 

AAAGCCCTCC TCTCCTCCAG AAGAATTAAA ATTTCAGTGT GGCCAAAAGA CTCTGAGGCC 600 

CCGCTTTAAG ATTATTGGGG GAGAATTCAC CACCATCGAG AACCAGCCCT GGTTTGCGGC 660 

CATCTACAGG AGGCACCGGG GGGGCTCTGT CACCTACGTG TGTGGAGGCA GCCTCATCAG 720 

CCCTTGCTGG GTGATCAGCG CCACACACTG CTTCATTGAT TACCCAAAGA AGGAGGACTA 780 

CATCGTCTAC CTGGGTCGCT CAAGGCTTAA CTCCAACACG CAAGGGGAGA TGAAGTTTGA 840 

GGTGGAAAAC CTCATCCTAC ACAAGGACTA CAGCGCTGAC ACGCTTGCTC ACCACAACGA 900 

CATTGCCTTG CTGAAGATCC GTTCCAAGGA GGGCAGGTGT GCGCAGCCAT CCCGGACTAT 960 

ACAGACCATC TGCCTGCCCT CGATGTATAA CGATCCCCAG TTTGGCACAA GCTGTGAGAT 1020 

CACTGGCTTT GGAAAAGAGA ATTCTACCGA CTATCTCTAT CCGGAGCAGC TGAAAATGAC 1080 

TGTTGTGAAG CTGATTTCCC ACCGGGAGTG TCAGCAGCCC CACTACTACG GCTCTGAAGT 1140 

CACCACCAAA ATGCTATGTG CTGCTGACCC CCAATGGAAA ACAGATTCCT GCCAGGGAGA 1200 

CTCAGGGGGA CCCCTCGTCT GTTCCCTCCA AGGCCGCATG ACTTTGACTG GAATTGTGAG 12 60 

CTGGGGCCGT GGATGTGCCC TGAAGGACAA GCCAGGCGTC TACACGAGAG TCTCACACTT 1320 

CTTACCCTGG ATCCGCAGTC ACACCAAGGA AGAGAATGGC CTGGCCCTCT GAGGGTCCCC 1380 

AGGGAGGAAA CGGGCACCAC CCGCTTTCTT GCTGGTTGTC ATTTTTGCAG TAGAGTCATC 1440 

TCCATCAGCT GTAAGAAGAG ACTGGGAAGA TAGGCTCTGC ACAGATG6AT TTGCCTGTGG 1500 

CACCACCAGG GTGAACGACA ATAGCTTTAC CCTCACGGAT AGGCCT6GGT GCTGGCTGCC 1560 

CAGACCCTCT GGCCAGGATG GAGGGGTGGT CCTGACTCAA CATGTTACTG ACCAGCAACT 1620 

TGTCTTTTTC TGGACTGAAG CCTGCAGGAG TTAAAAAGGG CAGGGCATCT CCTGTGCATG 1680 

GGCTCGAAGG GAGAGCCAGC TCCCCCGACC GGTGGGCATT TGTGAGGCCC ATGGTTGAGA 1740 

AATGAATAAT TTCCCAATTA GGAAGTGTAA GCAGCTGAGG TCTCTTGAGG GAGCTTAGCC 1800 

AATGTGGGAG CAGCGGTTTG GGGAGCAGAG ACACTAACGA CTTCAGGGCA GGGCTCTGAT 1860 

ATTCCATGAA TGTATCAGGA AATATATATG TGTGTGTATG TTTGCACACT TGTTGTGTGG 1920 

GCTGTGAGTG TAAGTGTGAG TAAGAGCTGG TGTCTGATTG TTAAGTCTAA ATATTTCCTT 1980 

AAACTGTGTG GACTGTGATG CCACACAGAG TGGTCTTTCT GGAGAGGTTA TAGGTCACTC 2040 

CTGGGGCCTC TTGGGTCCCC CACGTGACAG TGCCTGGGAA TGTACTTATT CTGCAGCATG 2100 

ACCTGTGACC AGCACTGTCT CAGTTTCACT TTCACATAGA TGTCCCTTTC TTGGCCAGTT 2160 

ATCCCTTCCT TTTAGCCTAG TTCATCCAAT CCTCACTGGG TGGGGTGAGG ACCACTCCTT 2220 

ACACTGAATA TTTATATTTC ACTATTTTTA TTTATATTTT TGTAATTTTA AATAAAAGTG 2280 
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ATCAATAAAA TGTGATTTTT CTGA 

Seq ID NO: 292 ProCein sequence: 
Protein Accession # :NP_002649. 1 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 



1 
I 

MRALLARLLb 
HCEIDKSKTC 
YCRNPDNRRR 
GGEFTTIENQ 
RSRLNSNTQG 
PSMYNDPQFG 
CAJU3PQWKTD 
SHTKEENGLA 



11 



1 

CVLWSDSKG 
YEGNGHFYRG 
PWCYVQVGLK 
PWFAAIYRRH 
EMKFEVENLI 
TSCEITGFGK 
SCQGDSGGPL 
L 



21 

1 

SNELHQVPSN 
KASTDTMGRP 
PLVQECMVHD 
RGGSVTYVCG 
LHKDYSADTL 
ENSTDYLYPE 
VCSLQGRMTL 



31 

1 

CDCLNGGTCV 
CLPWNSATVL 
CJ\DGKKPSSP 
GSLISPCWVI 
AHHNDIALLK 
QLKMTWKIiI 
TGIVSWGRGC 



41 

I 

SNKYFSNIHW 
QQTYHAHRSD 
PEELKFQCGQ 
SATHCFIDYP 
IRSKEGRCAQ 
SHRECQQPHY 
ALKDKPGVYT 



51 
I 

CNCPKKFGGQ 
ALQLGLGKHN 
KTLRPRFKII 
KKEDYIVYLG 
PSRTIQTICL 
YGSEVTTKML 
RVSHFIiPWIR 



Seq ID NO: 2 93 DNA sequence 
Nucleic Acid Accession #: NM_001498 
Coding sequence: 93.. 2006 



GGCACGAGGC 
AGGAGGAGGA 
TGAGCTGGGA 
TCCTGCACAT 
AGGTGGAATA 
CTGGGGAGAA 
CTACCCTTTG 
ACGGAGGAAC 
AGGCTACTTC 
TAGGCTGTCC 
CCAA6TCCCT 
CAAGAAATAT 
ACAAGAATAC 
CTTCTAAGCC 
TCCAGGTGAC 
CTACTATCTG 
TGTCAGACAT 
AGGAGCGAGG 
ACTCAATAGA 
TAGATAAAGA 
ATGTTGCTCA 
ATGATGCTAA 
GATTTAAGCC 
AGGTGCAATT 
GAGTGATCCT 
TGAAGGTAGC 
TTTGCAAAGG 
TCGCTGCAGA 
TGTTTCCTGG 
ACACCAGATG 
TAATGACAGT 
ACAGTGTCAT 
ATGAATTATG 
GAA6TAAAAC 
ACTGGCTACA 
CTGTACTGTT 
TCTAGAGTTT 
AATAACATAT 
CAACCTACTG 
GAGTTAAAAT 
TGAAATTTTT 
TGTCTACATT 
GCACCTTATC 
GCTTTCCCCT 



11 

1 

TGAGTGTCCG 
GGAGGAGGAG 
6GAAACCAAG 
CTACCACGCC 
CATGTTGGTA 
AGTTCTTGAA 
GAGACCAGAG 
AATGTCCGAG 
TATATTAGAA 
TGGGTTCACA 
CTTCTTTCCA 
CCGACATAGG 
ACCATCTCCA 
GGATCATATT 
ATTCCAAGCC 
TCCAATTGTT 
TGATTGTCGC 
ACTGGAGCCA 
CAGCTATTTA 
GATCTACGAA 
TCTCTTTATT 
TGAGTCTGAC 
CCCTCCTCCA 
AACAGACTTT 
TTCCTACAAA 
ACAGAAAAGA 
TGGCAATGCA 
GGAGTACyVCC 
ACTGATCCCA 
TAGTATTCTG 
TGCCAGATGG 
AACTGATGAA 
TGAATGCCCA 
TGACTCATCC 
GTACCATGCC 
TTCTGGGCCA 
ATACAGTGTA 
CTAAAGTCAT 
TCTAAGCAGT 
GTTTACTGTA 
CTCTTTAAAA 
AAATCACTTG 
TATGATGTTT 
CTGAATATU^T 



21 

I 

TCTCGCGCCC 
GAGGGGGCGG 
CX3CCATGCCG 
GTCAAGGACC 
TCTTTTGATC 
ACTCTGCAAG 
TATGGGAGTT 
TTCAATACAG 
GAAAATCAGG 
CTGCCCGAGG 
GATGAAGCAA 
AGAGGAGAAA 
TTTATAGAAA 
TACATGGATG 
TGCAGTATAT 
ATGGCTTTGA 
TGGGGAGTGA 
TTGAAGAACA 
TCTAAGTGTG 
CAGCTGTTGC 
AGAGACCCAC 
CATTTTGAGA 
AACTCAGACA 
GAGAACTCTG 
TTGGATTTTC 
GATGCTGTCT 
GTGGTGGATG 
CTCATGAGCA 
ATTCTGAACT 
AACTACCTAA 
ATGAGGGAGT 
ATGAATTATA 
GAGTTACTTG 
AACTAGACAT 
TCTCAGCCCG 
GTGAGCCAGA 
CATGTACATA 
CATGAACTGG 
TTTGTATWVTQ 
AATTTTTGTT 
ACATTTTCTC 
AATCCATTGA 
CTTTTGCAAT 
ACCCATTGAA 



31 

I 

GGAAGCGGGC 
CCATGGGGCT 
ACCACGTGCG 
GGCACAAGGA 
ATGAAAATAA 
AGAAGGGGGA 
ACATGATTGA 
TTGAGGCCAA 
CTCTTTGCAC 
TCAAACCCAA 
TAAACAAGCA 
AGGTTGTCAT 
CATTTACTGA 
CCATGGGATT 
CTGAGGCCAG 
GTGCTGCATC 
TTTCTGCATC 
ATAACTATAG 
GTGAGAAATA 
AGGAAGGCAT 
TGACACTGTT 
ATATTCAGTC 
TTGGATGGAG 
CCTATGTGGT 
TCATTCCACT 
TGCAGGGAAT 
GTTGTGGCAA 
TAGACACCAT 
CTTACCTTGA 
AGCTAATTAA 
TTATCGCAAA 
GCCTTATTTT 
GATCAGCATT 
TCTACAGAAA 
TGTGTATAAT 
AATTGATTAA 
GTAAAGTATT 
CTTGTACATT 
TACTGGTAAT 
CTTTTAAAGA 
TCGTTAATTT 
AAGTGCTTCA 
TGGAATAATC 
CTCTGAAAAA 



41 

1 

GACCGCCGTC 
GCTGTCCCAG 
GCGGCACGGG 
CGTTCTCAAG 
AAAAGTCCGG 
AAGGACAAAC 
AGGGACACCA 
CATGCX3AAAA 
AATAACTTCA 
CCCAGTGGAA 
CCCTCGCTTC 
CAATGTACCA 
GGATGATGAA 
TGGAATGGGC 
ATACCTTTAT 
TCCCTTTTAC 
TGTAGATGAT 
GATCAGTAAA 
TAATGACATC 
TGATCATCTC 
TGAAGAGAAA 
CACAAATTGG 
AGTAGAATTT 
GTTTGTGGTA 
GTCAAAGGTT 
GTTTTATTTC 
GGCCCAGAAC 
CATCAATGGG 
AAACATGGAA 
GAAGAGAGCA 
CCATCCTGAC 
GAAGTGTAAC 
TAGGAAAGTA 
GAAAAATGCA 
ATGAAGACCA 
GGCTTTCTTT 
TTTGATTAAC 
TTTAAATTCT 
TGTACAATAC 
CTACCTGGGA 
TCCTTTGTCA 
AGGGTAATCT 
ACTTGGTCAC 
AAAAAAAAAA 



51 

I 

AGCCCGGAGG 
GGCTCGCCGC 
ATCCTCCAGT 
TGGGGCGATG 
TTGGTCCTGT 
CCAAACCATC 
GGACAGCCCT 
CGCCGGAAGG 
TTTCCCAGAT 
GGAGGAGCTT 
AGTACCTTAA 
ATATTTAAGG 
GCTTCAAGGG 
AATTGCTGTC 
GATCAGTTGG 
CGAGGCTATG 
AGAACTCGGG 
TCCCGATATG 
GACTTGACGA 
CTGGCCCAGC 
ATACACCTGG 
CAGACAATGA 
CGACCCATGG 
CTGCTCACCA 
GATGAGAACA 
AGGAAAGATA 
AGCACGGAGC 
AAGGAAGGTG 
GTGGATGTGG 
TCTGGAGAAC 
TACAAGCAAG 
CAAATTGCAA 
AAATATAGTG 
TTATTGACGA 
AATGATAGAA 
GGTAGGTAAA 
AATGTATTTT 
TACTCTGGAG 
TTGCATTCCA 
CCTGATTTAT 
TTTCCTTTGT 
TGGGTTTCTA 
CTTGCCCCAA 
AAAA 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 



Seq ID NO: 2 94 Protein sequence: 
Protein Accession #: NP 001489 



75 



80 



85 



1 

1 

MGLLSQGSPL, 
ENKKVRIjVLS 
EANMRKRRKE 
NKHPRFSTIiT 
MGFGMGNCCIi 
SASVDDRTRE 
EGIDHLIiAQH 
GWRVEFRPME 
QGMFYFRKDI 
YLENMEVDVD 
LIIiKCNQIAN 



11 

! 

SWEETKRHAD 
GEKVLETLQE 
ATSILEENQA 
RNIRHRRGEK 
QVTFQACSIS 
ERGLEPLKNN 
VAHLFIRDPL 
VQLTDFBNSA 
CKGGNAWDG 
TRCSIIjNYLK 
ELCECPEIiLG 



21 
I 

HVRRHGILQF 
KGERTNPNHP 
LCTITSFPRL 
WINVPIFKD 
EARYIiYDQLA 
NYRISKSRYD 
TLFEEKIHLD 
YWFWLLTR 
CGKAQNSTEL 
LIKKRASGEL 
SAFRKVKYSG 



31 
I 

LHIYHAVKDR 
TLWRPEYGSY 
GCPGFTLPEV 
KNTPSPPIET 
TICPIVMALS 
SIDSYLSKCG 
DANESDHFEN 
VILSYKLDFL 
AAEEYTLMSI 
MTVARWMREF 
SKTDSSN 



41 

I 

HKDVLKWGDE 
MIEGTPGQPY 
KPNPVBGGAS 
FTEDDEASRA 
AASPFYRGYV 
EKYNDIDLTI 
IQSTNWQTMR 
IPLSKVDENM 
DTIINGKEGV 
lANHPDYKQD 



51 

I 

VEYMLVSFDH 
GGTMSEFNTV 
KSLFFPDEAI 
SKPDHIYMDA 
SDIDCRWGVI 
DKEIYEQLLQ 
FKPPPPNSDI 
KVAQKRDAVD 
FPGLIPILNS 
SVITDEMNYS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



296 



wo 02/086443 

Seq ID NO: 295 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 247-816 



1 11 

I 1 
AGTGTTCGGC TGGGGCAGGC 
GGCCAAACGG GATCGGTGCT 
CCTAGGGGGC ACATTTCCCA 
GGGAGGCGCC ACAACTTCAC 
GAAACAATGA CCGATAAAAC 
CCCAGGGAAT GTGACAGTCC 
AAACAAGGAG CAGGAGACAG 
ATGACAGGAC ATGCTATTCC 
TTCAGCAAAG ATAGGATGAT 
ACCAGCAGTT TCTCTGGAGA 
CAACGAGAAA TTAATGCTGA 
CAA7WVTATG AAAA/UVTCTT 
AAGCGATTTT TTGAATCCAT 
AAGCACCTTA AGAAGAAACT 
CACACCCCAA ATGCAT7\ATC 
TTCTACAATG GAGCAGGATA 
CTTCCAGAGG CTAAGAAATT 
TTGAAAGGAT AACTTGTGTT 
TAACCCATTA GGTAAATACT 



21 31 

I I 
ACGCTGTGGC TGGCTACTTC 
TCTGGTGAGA CGCCTCCCCA 
CAACTCCCAG AGGGCAGGTT 
TGCCATTTTG TGAGGTGCCG 
AGAGAAGGTG GCTGTAGATC 
TTCGTATCAG AAAAGGCAGA 
CCTTATTGCA GGCTCTGCCA 
ACCCAGCCAA TTGGATTCTC 
GCAGAAACCT GGTAGCAATG 
TGACCTAGAA TGCAGAGAAA 
TATAAAACGT AAATTAGTGA 
CGAAATGCTT GAAGGAGTGC 
CATCAAGGAA GCAGCAAGAT 
GAAACGTATG ATTTGAGAAT 
TCGTTAATGA TTGAGGAGAG 
TTGCTGAAGT CTCCTGGCAT 
TCTGTTAGTA AAAGATGTTC 
TTGGTTATTT TGTATTCCCA 
ATTACAGTCG TGGTTTCTGC 



41 51 

1 I 
CCTTCCTCCC ATCCCCCTTG 
TGCACATCAC TCCCA66TGC 
TCTAGAAAGT GCCACCAGTG 
CCGTCTCTCC TCCAGCAAGG 
CTGAAACTGT GTTTAAACGT 
GGATGGCCCT GTTGGCAAGG 
TGTCCAAAGA AAAGAAGCTT 
AGATTGATGA CTTCACTGGT 
CACCTGTGGG AGGAAACGTT 
CAGCCTCCTC TCCCAAAAGC 
AGGAACTCCG ATGCGTTGGA 
AAGGACCTAC TGCAGTCAGG 
GTATGAGACG AGACTTTGTT 
ACTTGTCCCT GGAGGATTAT 
AAAAGGATCA GATTGCTGTT 
ATGTTACCGA ATCAAATAGC 
TTTTTCCCAA AGCATTTTAT 
CCTGTGCTGG TAGATATTAT 
A 



Seq ID NO: 296 Protein sequence: 
Protein Accession Eos sequence 



1 11 21 31 41 51 

1 1 1 1 1 1 ■ 

MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMAIiLARKQ GAGDSLIAGS AMSKEKKLMT 
GHAIPPSQLD SQIDDFTGFS KDRMMQKPGS NAPVGGNVTS SFSGDDLECR ETASSPKSQR 
EINADIKRKL VKELRCVGQK YEKIFEMIiEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 

LKKKLKRMI 



Seq ID NO: 297 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 247-815 



1 11 21 31 41 51 

I I I I I I 

AGTGTTCGGC TGGGGCAGGC ACGCTGTGGC TGGCTACTTC CCTTCCTCCC ATCCCCCTTG 
GGCCAT^CGG GATCGGTGCT TCTGGTGAGA CGCCTCCCCA TGCACATCAC TCCCAGGTGC 
CCTAGGGGGC ACATTTCCCA CAACTCCCAG AGGGCAGGTT TCTAGAAAGT GCCACCAGTG 
GGGAGGCGCC ACAACTTCAC TGCCATTTTG TGAGGTGCCG CCGTCTCTCC TCCAGCAAGG 
GAAACAATGA CCGATAAAAC AGAGAAGGTG GCTGTAGATC CTGAAACTGT GTTTAAACGT 
CCCAGGGAAT GTGACAGTCC TTCGTATCAG AAAAGGCAGA GGATGGCCCT GTTGGCAAGG 
AAACAAGGAG CAGGAGACAG CCTTATTGCA GGCTCTGCCA TGTCCAAAGA AAAGAAGCTT 
ATGACAGGAC ATGCTATTCC ACCCAGCCAA TTGGATTCTC AGATTGATGA CTTCACTGGT 
TTCAGCAAAG ATAGGATGAT GCAGAAACCT GGTAGCAATG CACCTGTGGG AGGAAACGTT 
ACCAGCAGTT TCTCTGGAGA TGACCTAGAA TGCAGAGAAA CAGCCTCCTC TCCCAAAAGC 
CAACAAGAAA TTAATGCTGA TATAAAACGT AT^TTAGTGA AGGAACTCCG ATGCGTTGGA 
CAAAAATATG AAAAAATCTT CGAAATGCTT G7UVGGAGTGC AAGGACCTAC TGCAGTCAGG 
AAACGATTTT TTGAATCCAT CATCAAGGAA GCAGCAAGAT GTATGAGACG AGACTTTGTT 
AAGCACCTTA AGAAGAAACT GAAACGTATG ATTTGAGAAT ACTTGTCCCT GGAGGATTAT 
CACACCCCAA ATGCATAATC TCATTAATGA TTGAGGAGAG AAAAGGATCA GATTGCTGTT 
TTCTACAATG GAGCAGGATA TTGCTGAAGT CTCCTGGCAT ATGTTACCGA ATCAACTGGC 
CTTCCAGAGG CTAAGAAATT TCTGTTAGTA AAAGATGTTC TTTTTCCCAA AGCGTTTTAT 
TTGAAAGGAT AACTTGTGTT TTGGTTATTT TGTATTCCCA CCTGTGCTGG TAGATATTAT 
TAACCCATTA GGTAAATACT ATTACAGTCG TGGTTTCTGC A 



Seq ID NO: 2 98 Protein sequence: 
Protein Accession #: Eos sequence 



1 11 21 31 41 51 

I I I 1 I I 

MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKEKKLMT 
GHAIPPSQLD SQIDDFTGFS KDRMMQKPGS NAPVGGNVTS SFSGDDLECR ETASSPKSQQ 
EINADIKRKL VKELRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 
LKKKLKRMI 



Seq ID NO: 299 DNA secjuence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 247-815 



1 11 21 31 41 51 

I 1 I I I I 

AGTGTTCGGC TGGGGCAGGC ACGCTGTGGC TGGCTACTTC CCTTCCTCCC ATCCCCCTTG 

GGCCAAACGG GATCGGTGCT TCTGGTGAGA CGCCTCCCCA TGCACATCAC TCCCAGGTGC 

CCTAGGGGGC ACATTTCCCA CAACTCCCAG AGGGCAGGTT TCTAGAAAGT GCCACCAGTG 

GGGAGGCGCC ACAACTTCAC TGCCATTTTG TGAGGTGCCG CCGTCTCTCC TCCAGCT^GG 

GAAACAATGA CCGATAAAAC AGAGAAGGTG GCTGTAGATC CTGAAACTGT GTTTAAACGT 

CCCAGGGi^AT GTGACAGTCC TTCGTATCAG AAAAGGCAGA GGATGGCCCT GTTGGCAAGG 

AAACAAGGAG CAGGAGACAG CCTTATTGCA GGCTCTGCCA TGTCCAAAGC AAAGAGCTTA 
TGACAGGACA TGCTATTCCA CCCAGCCAAT TGGATTCTCA GATTGATGAC TTCACTGGTT 



297 



5 
10 
15 
20 



WO 02/086443 

TCAGCAAAGA TAGGATGATG CAGAAACCTG GTAGCAATGC 
CCAGCAGTTT CTCTGGAGAT GACCTAGAAT GCAGAGAAAC 
AACAAGAAAT TAATGCTGAT ATAAAACGTA AATTAGTGAA 
AAAAATATGA AAAAATCTTC GAAATGCTTG AAGGAGTGCA 
AACGATTTTT TGAATCCATC ATCAAGGAAG CAGCAAGATG 
AGCACCTTAA GAAGAAACTG AAACGTATGA TTTGAGAATA 
ACACCCCA7VA TGCATTVATCT CATTAATGAT TGAGGAGAGA 
TCTACAATGG AGCAGGATAT TGCTGAAGTC TCCTGGCATA 
TTCCAGAGGC TAAGAAATTT CTGTTAGTAA AAGATGTTCT 
TGAAAGGATA ACTTGTGTTT TGGTTATTTT GTATTCCCAC 
AACCCATTAG GTAAATACTA TTACAGTCGT GGTTTCTGCA 

Seq ID NO: 3 00 Protein sequence: 
Protein Accession #: Eos sequence 



ACCTGTGGGA 
AGCCTCCTCT 
GGAACTCCGA 
AGGACCTACT 
TATGAGACGA 
CTTGTCCCTG 
AAAGGATCAG 
TGTTACCGAA 
TTTTCCCAAA 
CTGTGCTGGT 



GGAAACGTTA 
CCCAAAAGCC 
TGCGTTGGAC 
GCAGTCAGGA 
GACTTTGTTA 
GAGGATTATC 
ATTGCTGTTT 
TCAACTGGCC 
GCGTTTTATT 
AGATATTATT 



11 



21 



41 



51 



31 

1 I 1 i I 

MTDKTBKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKAKKLMT 
GHAIPPSQLD SQIDDFTGFS KDRMMQKPGS NAPVGGNVTS SFSGDDLECR ETASSPKSQQ 
EINADIKRKL VKBIjRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 

LKKKLKRMI 



540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 



PCT/US02/12476 



60 
120 
IBO 



25 



30 



35 



40 



45 



50 



55 



Seq ID NO: 301 DNA sequence 

Nucleic Acid Accession Eos sequence 

Coding sequence: 247-812 



AGTGTTCGGC 
GGCCAAACGG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
TGACAGGACA 
TCAGCAAAGA 
CCAGCAATTT 
AACAAGAAAT 
AATATGAAAA 
GATTTTTTGA 
ACCTTAAGAA 
CCCCAT^TGC 
ACAATGGAGC 
CAGAGGCTAA 
AAGGATAACT 
CCATTAGGTA 



11 

1 

TGGGGCAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
T6CTATTCCA 
TGGGATGATG 
CTCTGGAGAT 
TAATGCTGAT 
AATCTTCGAA 
ATCCATCATC 
GAAACTGAAA 
ATAATCTCAT 
AGGATATTGC 
GAAATTTCTG 
TGTGTTTTGG 
AATACTATTA 



21 
I 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
CCCAGCCAAT 
CAGAAACCTG 
GACCTAGAAT 
ATAAT^TGTC 
ATGCTTGAAG 
AAGGAAGCAG 
CGTATGATTT 
TAATGATTGA 
TGAAGTCTCC 
TTAGTAAAAG 
TTATTTTGTA 
CAGTCGTGGT 



31 

1 

TGGCTACTTC 
CGCCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TGGATTCTCA 
GTAGCAATGC 
GCAGAGGAAT 
AAGTAGTGAA 
GAGTGCAAGG 
CAAGATGTAT 
GAGAATACTT 
GGAGAGAAAA 
TGGCATATGT 
ATGTTCTTTT 
TTCCCACCTG 
TTCTGCA 



41 
1 

CCTTCCTCCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGA 
GATTGATGAC 
ACCTGTGGGA 
AGCCTCCTCT 
GG/VAATCCGA 
ACCTACTGCA 
GAGACGAGAC 
GTCCCTGGAG 
GGATCAGATT 
TACCGAATCA 
TCCCAAAGCG 
TGCTGGTAGA 



51 
1 

ATCCCCCTTG 
TCCCAGGTGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAGCTTA 
TTCACTGGTT 
GGAAATGTTA 
CCCAAAAGCC 
TGCCTTGGAC 
GTCAGGAAAC 
TTTGTTAAGC 
GATTATCACA 
GCTGTTTTCT 
ACTGGCCTTC 
TTTTATTTGA 
TATTATTAAC 



Seq ID NO: 302 Protein sequence: 
Protein Accession #: Eos secjuence 



11 



21 
I 



31 



41 



51 



MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKEKKLMT 
GHAIPPSQLD SQIDDFTGFS KDGMMQKPGS NAPVGGNVTS NFSGDDLECR GIASSPKSQQ 
EINADIKCQV VKEIRCLGQY EKIFEMLEGV QGPTAVRKRF FESIIKEAAR CMRRDFVKHL 

KKKLKRMI 



60 
120 
180 
240 

300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 
120 
180 



60 
65 
70 
75 
80 
85 



Seq ID NO: 303 DNA sequence 

Nucleic Acid Accession Eos sequence 

Coding sequence: 247-815 



AGTGTTCGGC 
GGCCAAACAG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
TGACAGGACA 
TCAGCAAAGA 
CCAGCAGTTT 
AACAAGAAAT 
AAAAATATGA 
AACGATTTTT 
AGCACCTTAA 
ACACCCCAAA 
TCTACAATGG 
TTCCAGAGGC 
TGAAAGGATA 
AACCCATTAG 



11 
I 

TGGGACAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
TGCTATTCCA 
TAGGATGATG 
CTCTGGAGAT 
TAATGCTGAT 
AAAAATCTTC 
TGAATCCATC 
GAAGAAACTG 
TGCATAATCT 
AGCAGGATAT 
TAAGAAATTT 
ACTTGTGTTT 
GTAAATACTA 



21 

1 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
CCCAGCCAAT 
CAGAAACCTG 
GACCTAGAAT 
ATAAAACGTA 
GAAATGCTTG 
ATCAAGGAAG 
A7UVCGTATGA 
CGTTAATGAT 
TGCTGAAGTC 
CTGTTAGTAA 
TGGTTATTTT 
TTACAGTCGT 



31 
i 

TGGCTACTTC 
CGTCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TGGATTCTCA 
GTAGCAATGC 
GCAGAGAAAC 
AATTAGTGAA 
AAGGAGTGCA 
CAGCAAGATG 
TTTGAGAATA 
TGAGGAGAGA 
TCCTGGCATA 
AAGATGTTCT 
GTATTCCCAC 
GGTTTCTGCA 



41 
1 

CCTTCCTTCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGC 
GATTGATGAC 
ACCTGTGGGA 
AGCCTCCTCT 
GGAACTCCGA 
AGGACCTACT 
TATGAGACGA 
CTTGTCCCTG 
AAAGGATCAG 
TGTTACCGAA 
TTTTCCCAAA 
CTGTGCTGGT 



51 

I 

ATCCCCCTTG 
TCCCAGATGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAGCTTA 
TTCACTGGTT 
GGAAACGTTA 
CCCAAAAGCC 
TGCGTTGGAC 
GCAGTCAGGA 
GACTTTGTTA 
GAGGATTATC 
ATTGCTGTTT 
TCAACTGGCC 
GCGTTTTATT 
AGATATTATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



Seq ID NO: 304 Protein sequence: 
Protein Accession #: Eos sequence 



298 



wo 02/086443 

1 11 21 31 41 51 

I I I I I I 

MTOKTEKVAV DPETVFKRPR EC33SPSYQKR QRMAIiIiARKQ GAGDSIiIAGS AMSKAKKLMT 

GRAIPPSQLD SQIDDFTGFS KDRMMQKP6S NAPVGGNVTS SFSODDIiBCR BTASSPKSQQ 

BINADIKRKIi VKEIiRCVGQK YBKIFEMIiEG VQOPTAVRKR FFESIIKBRA RCMRRDFVKH 

LKKKLKRMI 



60 
120 
180 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



Seq ID NO: 305 DNA sequence 
Nucleic Acid Accession Eos 
Coding sequence: 87-689 



sequence 



1 

1 

CGTG6AG6CA 
CCAGACTAGC 
AGATGTCCGC 
CAGAGGTCCC 
TGTCCGGGAA 
ATCGGGAAAT 
CTCCCAAAAG 
JUVTCCACJWA 
ATAATTTAAA 
AGTATGAGAA 
CTGCTAAAGT 
AGGAGGAGGA 
TTAGAGTAGG 
ATTAGGTTTA 
AATTGTCAGT 
AAGTTGTACA 
CTGTGCACTT 
ATTTGTAAGG 
TATCTATAGT 
GCGTTGAGGC 
GAGGCTGGAC 
GTATATAGTG 
CATGAGAATA 
TATAGAACTC 
CTCCTGTACT 
GAAATGTTTT 
AGTCAATTTC 
CTCCCTATAA 
TGAAGGAGAG 
ATGAAGTCTG 
AGGAAGGTGG 
CCTATTTTGT 
AAATTAAGGC 
ACATTATTTG 
AGTGAGGGTA 
TTGGAAACAC 
AGGGCAGGCT 
GCCTGCTCAT 
GAAGGAGCTT 
GTTGGGGTGA 
CTGATGTGTA 
GTGAGTGTTG 
CGCAGGAGTG 
GTTGAGAAAC 
TACGAGTTAT 
CATCAGAACT 
AAAACTCGGT 
GCCATGTCCT 
CTAGGCCAAG 
GGTCAA/^GG 
CAGGGAAGGG 
GTGGGGGAGC 
TGTCAAGTTG 
CACACTGTGG 
AAGTGGCATG 
TTCTTAAGAT 
CTACTCCCTC 
AGGCTGTGAT 
GTATTTGGGG 



11 
I 

GCTAGCGCGA 
GAACAATACA 
TTATGCCTTC 
TGTCAATTTT 
AGAGAAATCT 
GAAGGATTAT 
GCCACCGTCT 
CCCCGGCATC 
TGACAGTGAA 
GGATGTTGCT 
TGCCCGGAAA 
6GAGGAGGAG 
GGAGCGCCGT 
ATTACAAAAT 
GGTTTACATG 
TATTTCCAAA 
TGCTGTTGGT 
TGGTGGTAAC 
TTGTAAAAAG 
TGTGGGGAAG 
CTGTTGACTC 
ACATAGCATT 

TTCATTGTCA 
TAAAC7VCGAT 
TGAAGTTAAA 
TGACTCACAG 
ATGTGGTAGC 
GGCTACTTGA 
GAGGAGTTAG 
GTGATTAGGA 
GGGGCCAAAT 
CTTATTGTTT 
TGGTGCCCAA 
TGTGGGATGG 
CAAACACCCC 
AATGGAATCA 
AAGTTTAGCT 
GGTTTGTGTG 
GGGGAGATGG 
TATACATCAT 
CTATTGCCCA 
TTTTTGTGCT 
TTGCATGTCT 
GGTCACGGTC 
GTTTGTCCTG 
TGTGAGGTTT 
TGTCACTTGG 
ATTCGGQAGC 
GAGTGATTTG 
CCAAGGATGG 
AGTTTAGCCA 
AGGCCACTTG 
CAAGATTGCT 
ATGTTACCTA 
GCCAACCTGT 
TAACCACCTC 
GTAGTGACTA 
ACGTTGGATG 



21 

1 

G6CTGGGGAG 
GTCAGGATGG 
TTTGTGCAGA 
GCGGAATTTT 
AAATTTGATG 
GGACCAGCTA 
GGATTCTTCC 
TCTATTGGAG 
AAGCAGCCTT 
GACTATAAGT 
AAGGTGGAAG 
GATGAATAAA 
AATTGACACA 
TTGATCACGA 
T^GTGGCCAT 
CATTTTTAAA 
GTGACAAGGC 
TATGGTTATT 
AACAAAACAA 
ATGCCTTTTG 
TGCAGGGGGC 
CTGCTGCC^^T 
TAAGTGCGGT 
GCAAAGCAAA 
TOGCAACGTT 
TAAACAGTAT 
CAGTGAACAA 
TTCTTTTATT 
AGCTACTGTG 
GAGAACGACA 
CTGAGGCTAT 
GCATTGCTAA 
TTCTCTTTCA 
CATTTGGGGT 
GGTGGTGGGG 
AAGGAAGATG 
ACCATTTCTG 
CATTCACTGG 
TCAGTGGTTA 
CCACAGTAGC 
TACTGTCCGT 
GCATTAATAT 
ATTAATTTTA 
GGAGGCGGTG 
ACAGCCTGAT 
AATGTGTTCC 
GCCCAGAGGC 
CATTCTAAGC 
TGTTGCCAGC 
TTAAGTGGTG 
AAAGGGGTAA 
GATGATCTTT 
GTCCATTAGC 
CTTCTAGTGG 
AGGCTTAGGC 
TGCTTTTTTT 
ACCCCATTCT 
TTGTCTGTGT 
CATTCATTTT 



31 
i 

CGCTGAGCCG 
CTAAAGGTGA 
CATGCAGAGA 
CCAAGAAGTG 
AAATGGCAAA 
AGGGAGGCAA 
TGTTCTGTTC 
ACGTGGCAAA 
ACATCACTAA 
CGAAAGGAAA 
AGGAAGATGA 
G7UUVCTGTTT 
TCTCTTATTT 
TCATATTGTA 
GGGTGTCTGG 
ATGAAAAGGC 
ATTTAAAGAT 
GGCTAGAAAT 
CCGAGACAAA 
QGAGAGGCTG 
ATCCATTTAG 
CTTAGCTGTG 
AGTTTTTAAA 
GAGTCACTGC 
CTGTTATTTT 
TACATTTTTA 
ACCCCCACTC 
ACTCAGTGGC 
TGATTTTGTT 
TAGGCAAGGT 
CTAGGTTTAA 
ACAGCAATTT 
CCCCTACCCC 
CTTGAGCCTG 
TAGGGGACGG 
ATAGGCTCCA 
AGCACTAAAT 
AAATGTAGAT 
TATTAGTGGG 
AAGTGGTGAC 
AGCAATGAAG 
TTGGGTGTGT 
AGAGAAAGCA 
TCCTCTCCGC 
CTCTTATGTG 
TCTAGTTCTA 
ACTTGTTCCA 
TAAAGCTTTA 
CTCGTCAAAT 
CGCGTCTATC 
CTTTTGTGCT 
GATTAGGCAA 
TGGGGCAGCA 
AATAATGCCC 
TTAGCTTGAT 
TTTTTTTTCC 
TGAATGACAT 
CTCCTGTGTG 
CTGTAATAAA 



41 
1 

CGCGTCGTGC 
CCCCAAGAAA 
AGT^CATAAG 
CTCTGAGAGG 
GGCAGATAAA 
GAAGAAGAAG 
AGAATTCCGC 
AAAGCTGGGT 
GGCGGCAAAG 
GTTTGATGGT 
AGAAGAGGAG 
ATCTGTCTCC 
GAGAAGTGTC 
GTCTCTCAAA 
AGCACCCTGA 
ACTCTCGTGT 
GTTTCTGGCA 
CCTGAGTTTT 
CCCTTGATGC 
TAGCTCAGGG 
CTTCAGGTTG 
GACAAAGGGG 
CTGTTTGTTT 
ATCAATGAAA 
TTTTGTATGT 
AAACTCTTCT 
CATTGTATTT 
CAGCTCACTT 
TGTGTCTGAG 
TCAGCAGCCT 
CTTTTGTCCC 
CAGAGTGTAT 
CCGTGCTCCT 
CTGCTGGTCT 
TATCCTTTTT 
TCTTGGGCCA 
GTATCATGAA 
TGATGTTCAA 
TAGTGTAACA 
ACTAAATACC 
GATACAGTAC 
ATGTTTGAGG 

CCTGTCGGGT 
TTCATAGCCA 
GAAAATGACC 
GAATTTCCCC 
GCTTCCCAAT 
ATGGAAGAGA 
TCATAACTAG 
TCCAAAGTAG 
ACATTGAGTT 
AGATCACTAC 
TAGTTTCTCT 
TTCTGGGCCC 
CCCATTTAAA 
TTTATCCTTC 
TGTCTGTTCT 
G 



51 

1 

CCTGCGCTGC 
CCAAAGGGCA 
AAGAAAAACC 
TGGAAGACGA 
GTGCGCTATG 
GATCCTAATG 
CCCAAGATCA 
GAGATGTGGA 
CTGAAGGAGA 
GCAAAGGGTC 
GAGGAAGAAG 
TTGTGAATAC 
TGTTGCCCTC 
GTGCTCTAGA 
AACTGTATCA 
TCTCCTCACT 
TTTTCTTTTT 
CAACTGTATA 
TCCTTGCTCG 
CGTGCACTGT 
TCTTGTTTCT 
GGTCAGCTGG 
TTAAACAAAC 
GTTCAAGAAC 
TTAGAATGCT 
CTATTATAAC 
GGAGACTGGC 
AGGGCTGAGA 
TGGCATTCAG 
TCCAAGQTAT 
ACCTCCACCC 
GGTGTGTCAA 
GGCACATATC 
CCTGGATGCC 
TTGCTCCTAC 
CCTGAGCTAT 
AAGTTGAATG 
TGTTAAACTG 
TTTTATCCAG 
ATTTTGAAGG 
TGTGTTGTGG 
CTATGAAACA 
AAAATTCACT 
CCTGGATGAG 
TTCGCTCTCC 
ACTAATTTAA 
TCCTGCTTCA 
TCGTGATGTG 
AACAACCTGC 
ATGTACCAAC 
CTAAGCAGAA 
TTAAAGAGGC 
TCAACGTTTT 
GAGATGATGT 
ACTGTCTGTG 
AGGATAGTAC 
GGAAAGAACA 
TGTCACAAAT 



60 
120 

180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
312 0 
3180 
3240 
3300 
3360 
3420 
3480 



Seq ID NO: 306 Protein sequence: 
Protein Accession ft: NP_005333.1 



31 



41 



51 

I I I I I I 

^fAKGDPKKPK GKMSAYAFFV QTCREEHKKK NPEVPVNFAE FSKKCSEHWK TMSGKEKSKF 
DEMAKADKVR YDREMKDYGP AKG6KKKKDP NAPKRPPSGF FLFCSEFRPK IKSTNPGISI 
GDV/^KKLGEM WNNtNDSEKQ PYITKAAKLK EKYEKDVADY KSKGKFDGAK GPAKVARKKV 



60 
120 
180 



Seq ID NO: 307 DNA sequence 
85 Nucleic Acid Accession #: NM_022342 
Coding sequence: 1. .2178 



299 



wo 02/086443 



11 



21 



31 
I 



41 



51 



ATGGGTACTA GGAAAAAAGT TCATGCATTT GTCCGTGTCA AACCCACCGA TGACTTTGCT 60 

CATGAAATGA TCAGATACGG AGATGACAAA AGAAGCATTG ATATTCACTT AAAAAAAGAC 120 

ATTCGGAGAG GAGTTGTCAA TAACCAACAG ACAGACTGGT CGTTTAAGTT GGATGGAGTT 180 

TTCACGATG CCTCCCAGGA CTTGGTTTAT GAGACAGTTG CAAAGGATGT GGTTTCTCAG 240 

CCCTCGATG GCTATAATGG CACCATCATG TGTTATGGGC AGACGGGAGC TGGCAAGACA 300 

ACACCATGA TGGGGGCAAC TGAGAATTAC AAGCACCGGG GGATCCTCCC TCGTGCCCTG 3 60 

AGCAGGTTT TTAGGATGAT CZGAAGAACGC CCCACACATG CCATCACTGT GCGTGTTTCC 42 0 

ACTTGGAAA TCTATAATGA GAGCCTGTTT GATCTCCTGT CCACTCTGCC CTATGTTGGA 4 80 

CCTCAGTCA CACCAATGAC CATCGTGGAA AACCCTCAAG GAGTCTTCAT TAAGGGCTTG 540 

CAGTTCACC TCACAAGTCA GGAGGAGGAT GCATTCAGCC TCCTTTTTGA GGGTGAGACC 600 

ACAGGATTA TAGCCTCCCA CACTATGAAC AAAAACTCTT CCAGATCACA CTGCATTTTC 660 

CCATCTACT TAGAGGCCCA TTCCCGGACC TTATCAGAGG AAAAGTACAT CACTTCCAAA 720 

TTAACTTGG TGGATCTGGC AGGCTCAGAG AGGCTGGGGA AGTCTGGGTC TGAGGGCCAA 780 

TCCTGAAGG AAGCCACCTA CATCTUVCAAA TCGCTCTCAT TCCTGGAGCA GGCCATCATT 840 

CCCTTGGGG ACCAGAAGCG GGACCACATC CCCTTTCGGC AGTGCAAGCT CACCCACGCT 900 

TGAAGGACT CGTTAGGGGG AAACTGCAAT ATGGTCCTCG TGACAAACAT CTATGGAGAA 960 

CTGCCCAGT TAGAAGAAAC GCTATCTTCZA CTGAGATTTG CCAGCAGGAT GAAGCTAGTC 1020 

CCACTGAGC CTGCCATCAA TGAAAAGTAT GATGCTGAGA GAATGGTCAA GAACCTGGAG 1080 

AGGAACTAG CACTACTCAA GCAGGAGCTG GCTATCCATG ACAGCCTGAC CAACCGCACC 1140 

TTGTGACCT ATGACCCCAT GGATGAAATC CAGATTGCTG AGATCAACTC CCAGGTGCGG 120 0 

GGTACCTGG AGGGGACACT GGACGAGATC GACATAATCA GCCTTAGACA GATCAAG6AG 12 60 

TGTTCAACC AGTTCCGGGT GGTTCTGAGC CAACAGGAAC AGGAAGTGGA GTCCACTTTG 1320 

GCAGGAAGT ACACCCTCAT TGACAGGAAT GACTTTGCAG CCATTTCTGC TATCCAGAAG 1380 

CGGGGCTTG TGGATGTTGA TGGCCACCTA GTGGGTGAGC CTG7WVGGACA AAACTTTGGA 1440 

TCGGAGTCG CCCCTTTCTC TACCAAACCT GGGAAGAAAG CCAAGTCCAA GAAGACATTC 1500 

AAGAGCCAC TCAGGCCCGA CACCCCACCC TCCAAACCAG TGGCCTTTGA GGAGTTTAAG 1560 

ATGAGCAAG GTAGTGAGAT CAAGCGAATT TTCAAAGAAA ACAAATCCAT CTTGAATGAA 1620 

GGAGGAAAA GGGCCAGCGA GACCACACAG CACATCAATG CCATCAAGCG GGAGATTGAT 1680 

TGACCAAGG AGGCCCTGAA TTTCCAGAAG TCACTACGGG AGAAGCAAGG CAAGTACGAA 1740 

ACAAGGGGC TGATGATCAT CGATGAGGAA GAATTCCTGC TGATCCTCAA GCTCAAAGAC 1800 

TCAAGAAGC AGTACCGCAG CX3AGTACCAG GACCTGCGTG ACCTCAGGGC TGAGATCCAG 1860 

ATTGCCAGC ACCTAGTGGA TCAGTGTCGC CACCGCCTGC TCATGGAATT TGACATCTGG 1920 

ACAATGAGT CCTTTGTCAT CCCTGAGGAC ATGCAGATGG CACTGAAGCC AGGCGGCAGC 1980 

TCCGGCCAG GCATGGTCCC TGTGAACAGG ATTGTGTCTC TGGGAGAAGA TGACCAGGAC 2040 

AATTCAGCC AGCTGCAGCA GAGGGTGCTT CCTGAGGGCC CTGATTCCAT CTCCTTCTAC 2100 

ATGCCAAAG TC/WVGATAGA GCAGAAGCAT AATTACTTGA AAACCATGAT GGGCCTCCAG 2160 
AGGCACATA GAAAATAG 



Seq ID NO: 308 Protein sequence: 
Protein Accession #; NP_071737 



1 11 21 31 41 51 

I I I I I I 

MGTRKKVHAF VRVKPTDDFA HEMIRYGDDK RSIDIHIiKKD XRRGWNNQQ TDWSFKLDGV 60 

LHDASQDLVY ETVAKDWSQ ALDGYNGTIM CYGQTGAGKT YTMMGATENY KHRGILPRAL 120 

QQVFRMIEBR PTHAITVRVS YLEIYNESLF DLLSTIiPYVG PSVTPMTIVE NPQGVFIKGL 180 

SVHIiTSQEED APSIibFEGET NRIIASHTMN KNSSRSHCIF TIYLEAHSRT LSEEKYITSK 240 

INLVDLAGSE RLGKSGSEGQ VLKEATYINK SLSFIiEQAII ALGDQKRDHI PPRQCKLTHA 300 

LKDSLGGNCN MVLVTNIYGE AAQLEBTLSS LRFASRMKLV TTEPAINEKY DAERMVKNLE 360 

KELiALiLKQEIj AIHDSLTNRT FVTYDPMDEI QIAEINSQVR RYLEGTLDEI DIISLRQIKE 420 

VPNQFRWIjS QQEQEVESTIi RRKYTIiIDRN DFAAISAIQK AGLVDVDGHL VGEPEGQNFG 480 

LGVAPFSTKP GKKAKSKKTF KEPIiRPDTPP SKPVAFEEFK NEQGSEINRI FKENKSILNE 540 

RRKRASETTQ HINAIKREID VTKEAI»NFQK SLREKQGKYE NKGLMIIDEE EFLLILKLKD 600 

LKKQYRSEYQ DIiRDLRAEIQ YCQHLVDQCR HRLLMEFDIW YNESFVIPED MQMALKPGGS 660 

IRPGMVPVNR IVSLGEDDQD KFSQLQQRVL PEGPDSISFY NAKVKIEQKH NYLKTMMGLQ 720 
QAHRK 



Seq ID NO: 309 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



1 11 21 31 41 51 

I I I I I I 

.p.j,^^^^.j^^.p TTTTTTTTAA TGCCTGCTGT CATGCTCTGT CTACCAGGGT GAATTTCCAA 60 

AAATTTCTGC ATAGCAATTT TAGCCAAAAC TATATATGTT CTGGGGAGG^ TAGGCATAGG 120 

CACATTGAAG ACCAAAGGAA AGAGTGAAGA AGTGTAGTTG GGTCATTGTG AATGGATGTT 180 

TAGATTGTCA AGAAAAGTGG GCCAGAGGCC CCACCTCACA CTAGGACGGC AATTGCCTCT 240 

CATTAGTATC TCAGGCACCA TGGGTCTTAT TTGGTGTCAT AAGAAACACC CTCAACAAAG 300 

TAATGAACCC TCAGCCTCCA GCTTCTCTTC TTCGGGATTC TTCTTAGGGC CTCCTTTTTC 360 

CTTTTATGTT TCCAGTACCC TGAATTTCTT ATTCCCATCC CCCATTAAAA TCTGCTTCAA 420 

AGAAAAAACA AGAAGGACAC ATTCACTTTA AGATCCAAAT GAATGATAAG AGCTTAAAAC 480 
ATTATACTTA TCAGTATTAT TTGCATTTTT ATAGAAACCA AAACCATATT TCAACAAC 



Seq ID NO: 310 DNA sequence 

Nucleic Acid Accession #: NM_018622.2 

Coding sec[uence: 1-1140 

1 11 21 31 41 51 

I I I I I i 

ATGGCGTGGC GAGGCTGGGC GCAGAGAGGC TGGGGCTGCG GCCAGGCGTG GGGTGCGTCG 60 
GTGGGCGGCC GCAGCTGCGA GGAGCTCACT GCGGTCCTAA CCCCGCCGCA GCTCCTCGGA 120 
CGCAGGTTTA ACTTCTTTAT TCAACAAAAA TGCGGATTCA GAAAAGCACC CAGGAAGGTT 180 
GAACCTCGAA GATCAGACCC AGGGACAAGT GGTGAAGCAT ACAAGAGAAG TGCTTTGATT 240 
CCTCCTGTGG AAGAAACAGT CTTTTATCCT TCTCCCTATC CTATAAGGAG TCTCATAAAA 300 
CCTTTATTTT TTACTGTTGG GTTTACAGGC TGTGCATTTG GATCAGCTGC TATTTGGCAA 360 



300 



wo 02/086443 

TATGAATCAC TGAAATCCAG GGTCCAGAGT TATTTTGATG GTATAAAAGC TGATTGGTTG 420 

GATAGCATAA GACCACAAAA AGAAGGAGAC TTCAGAAAGG AGATTAACAA GTGGTGGAAT 480 

AACCTAAGTG ATGGCCAGCG GACTGTGACA GGTATTATAG CTGCAAATGT CCTTGTATTC 540 

TGTTTATGGA GAGTACCTTC TCTGCAGCGG ACAATGATCA GATATTTCAC ATCGAATCCA 600 

GCCTCAAAGG TCCTTTGTTC TCCAATGTTG CTGTCAACAT TCAGTCACTT CTCCTTATTT 660 

CACATGGCAG CAAATATGTA TGTTTTGTGQ AGCTTCTCTT CCAGCATAGT GAACATTCTG 720 

GGTCAAGAGC AGTTCATGGC AGTGTACCTA TCTGCAGGTG TTATTTCCAA TTTTGTCAGT 780 

TACCTGGGTA 7VAGTTGCCAC AGGAAGATAT GGACCATCAC TTGGTGCATC TGGTGCCATC 840 

ATGACAGTCC TCGCAGCTGT CTGCACTAAG ATCCCAGAAG GGAGGCTTGC CATTATTTTC 900 

CTTCCGATGT TCACGTTCAC AGCAGGGAAT GCCCTGAAAG CCATTATCGC CATGGATACA 960 

GCAGGAATGA TCCTGGGATG GAAATTTTTT GATCATGCGG CACATCTTGG GGGAGCTCTT 1020 

TTTGGAATAT GGTATGTTAC TTACGGTCAT GAACTGATTT GGAAGAACAG GGAGCCGCTA 1080 
GTGAAAATCT GGCATGAAAT AAGGACTAAT GGCCCCAAAA AAGGAGGTGG CTCTAAGTAA 

Seq ID NO: 311 Protein sequence: 
Protein Accession #: NP_061092.2 

1 11 21 31 41 51 

I I I I I I 

MAWRGWAQRG WGCGQAWGAS VGGRSCEELT AVLTPPQLLG RRFNFFIQQK CGFRKAPRKV 60 

EPRRSDPGTS GEAYKRSALI PPVBETVFYP SPYPIRSLIK PLFFTVGFTG CAFGSAAIWQ 12 0 

YESLKSRVQS YFDGIKADWL DSIRPQKEGD FRKEINKWWN NLSDGQRTVT GIIAANVLVF 180 

CLWRVPSLQR TMIRYFTSNP ASKVLCSPML LSTFSHFSLF HMAANMYVLW SFSSSIVNIL 240 

GQEQFMAVYL SAGVISNFVS YLGKVATGRY GPSLGASGAI MTVLAAVCTK IPEGRLAIIP 300 

LPMFTFTAGN ALKAIIAMDT AGMILGWKFF DHAAHLGGAL FGIWYVTYGH EIiIWKNREPL 360 
VKIWHEIRTN GPKKGGGSK 

Seq ID NO: 312 DNA sequence 
Nucleic Acid Accession #r NM__000625 
Coding sequence: 195.. 3656 

1 11 21 . 31 41 51 

I I I I I I 

CTCTCGGCCA CCTTTGATGA GGGGACTGGG CAGTTCTAGA CAGTCCCGAA GTTCTCAAGG 60 

CACAGGTCTC TTCCTGGTTT GACTGTCCTT ACCCCGGGGA GGCAGTGCAG CCAGCTGCAA 120 

GCCCCACAGT GAAGAACATC TGAGCTCAAA TCCAGATAAG TGACATAAGT GACCTGCTTT 180 

GTAT^GCCAT AGAGATGGCC TGTCCTTGGA AATTTCTGTT CAAGACCAAA TTCCACCAGT 240 

ATGCAATGAA TGGGGAAAAA GGCATCAACA ACAATGTGGA GAAAGCCCCC TGTGCCACCT 300 

CCAGTCCAGT GACACAGGAT GACCTTCAGT ATCACAACCT CAGCAAGCAG CAGAATGAGT 3 60 

CCCCGCAGCC CCTCGTGGAG ACGGGAAAGA AGTCTCCAGA ATCTCTGGTC AAGCTGGATG 420 

CAACCCCATT GTCCTCCCCA CGGCATGTGA GGATCAAAAA CTGGGGCAGC GGGATGACTT 480 

TCCAAGACAC ACTTCACCAT AAGGCCAAAG GGATTTTAAC TTGCAGGTCC AAATCTTGCC 540 

TGGGGTCCAT TATGACTCCC AAAAGTTTGA CCAGAGGACC CAGGGACAAG CCTACCCCTC 600 

CAGATGAGCT TCTACCTCAA GCTATCGAAT TTGTCAACCA ATATTACGGC TCCCTCAAAG 660 

AGGCAAAAAT AGAGGAACAT CTGGCCAGGG TGGAAGCGGT AACAAAGGAG ATAGAAACAA 72 0 

CAGTAACCTA CCAACTGACG GGAGATGAGC TCATCTTCGC CAGCAAGCAG GCCTGGCGCA 7 80 

ATGCCCCACG CTGCATTGGG AGGATCCAGT GGTCCAACCT GCAGGTCTTC GATGCCCGCA 840 

GCTGTTCCAC TGCCCGGGAA ATGTTTGAAC ACATCTGCAG ACACGTGCGT TACTCCACCA 900 

ACAATGGCAA CATCAGGTCG GCCATCACCG TGTTCCCCCA GCGGAGTGAT GGCAAGCACG 960 

ACTTCCGGGT GTGGAATGCT CAGCTCATCC GCTATGCTGG CTACCAGATG CCAGATGGCA 1020 

GCATCAGAGG GGACCCTGCC AACGTGGAAT TCACTCAGCT GTGCATCGAC CTGGGCTGGA 1080 

AGCCCAAGTA CGGCCGCTTC GATGTGGTCC CCCTGGTCCT GCAGGCCAAT GGCCGTGACC 1140 

CTGAGCTCTT CGTiAATCCCA CCTGACCTTG TGCTTGAGGT GGCCATGGAA CATCCCAAAT 1200 

ACGAGTGGTT TCGGGAACTG GAGCTAAA6T GGTACGCCCT GCCTGCAGTG GCCAACATGC 1260 

TGCTTGAGGT GGGCGGCCTG GAGTTCCCAG GGTGCCCCTT CAATGGCTGG TACATGGGCA 1320 

CAGAGATCGG AGTCCGGGAC TTCTGTGATG TCCAGCGCTA CAACATCCTG GAGGAAGTGG 13 80 

GCAGGAGAAT GGGCCTGGAA ACGCACAAGC TGGGCTCGCT CTGGAAAGAC CAGGCTGTCG 1440 

TTGAGATCAA CATTGCTGTG CTCCATAGTT TCCAGAAGCA GAATGTGACC ATCATGGACC 150 0 

ACCACTCGGC TGCAGAATCC TTCATGAAGT ACATGCAGAA TGAATACCGG TCCCGTGGGG 1560 

GCTGCCCGGC AGACTGGATT TGGCTGGTCC CTCCCATGTC TGGGAGCATC ACCCCCGTGT 1620 

TTCACCAGGA GATGCTGAAC TACGTCCTGT CCCCTTTCTA CTACTATCAG GTAGAGGCCT 1680 

GGAAAACCCA TGTCTGGCAG GACGAGAAGC GGAGACCCAA GAGAAGAGAG ATTCCATTGA 1740 

AAGTCTTGGT CAAAGCTGTG CTCTTTGCCT GTATGCTGAT GCGCAAGACA ATGGCGTCCC 1800 

GAGTCAGAGT CACCATCCTC TTTGCGACAG AGACAGGAAA ATCAGAGGCG CTGGCCTGGG 1860 

ACCTGGGGGC CTTATTCAGC TGTGCCTTCA ACCCCAAGGT TGTCTGCATG GATAAGTACA 1920 

GGCTGAGCTG CCTGGAGGAG GAACGGCTGC TGTTGGTGGT GACCAGTACG TTTGGCAATG 1980 

GAGACTGCCC TGGCAATGGA GAGAAACTGA AGAAATCGCT CTTCATGCTG AAAGAGCTCA 2 040 

ACAACAAATT CAGGTACGCT GTGTTTGGCC TCGGCTCCAG CATGTACCCT CGGTTCTGCG 210 0 

CCTTTGCTCA TGACATTGAT CAGAAGCTGT CCCACCTGGG GGCCTCTCAG CTCACCCCGA 2160 

TGGGAGAAGG GGATGAGCTC AGTGGGCAGG AGGACGCCTT CCGCAGCTGG GCCGTGCA/VA 222 0 

CCTTCAAGGC AGCCTGTGAG ACGTTTGATG TCCGAGGCAA ACAGCACATT CAGATCCCCA 2280 

AGCTCTACAC CTCCAATGTG ACCTGGGACC CGCACCACTA CAGGCTCGTG CAGGACTCAC 2340 

AGCCTTTGGA CCTCAGCAAA GCCCTCAGCA GCATGCATGC CAAGAACGTG TTCACCATGA 2400 

GGCTCAAATC TCGGCAGAAT CTACAAAGTC CGACATCCAG CCGTGCCACC ATCCTGGTGG 2460 

AACTCTCCTG TGAGGATGGC CAAGGCCTGA ACTACCTGCC GGGGGAGCAC CTTGGGGTTT 2520 

GCCCAGGCAA CCAGCCGGCC CTGGTCCAAG GCATCCTGGA GCGAGTGGTG GATGGCCCCA 2580 

CACCCCACCA GGCAGTGCGC CTGGAGGCCC TGGATGAGAG TGGCAGCTAC TGGGTCAGTG 2640 

ACAAGAGGCT GCCCCCCTGC TCACTCAGCC AGGCCCTCAC CTACTTCCTG GACATCACCA 2700 

CACCCCCAAC CCAGCTGCTG CTCCAA/y^GC TGGCCCAGGT GGCCACAGAA GAGCCTGAGA 2760 

GACAGAGGCT GGAGGCCCTG TGCCAGCCCT CAGAGTACAG CAAGTGGAAG TTCACCAACA 2 82 0 

GCCCCACATT CCTGGAGGTG CTAGAGGAGT TCCCGTCCCT GCGGGTGTCT GCTGGCTTCC 2880 

TGCTTTCCCA GCTCCCCATT CTGAAGCCCA GGTTCTACTC CATCAGCTCC CCCCGGGATC 2940 

ACACGCCCAC GGAGATCCAC CTGACTGTGG CCGTGGTCAC CTACCACACC CGAGATGGCC 3000 

AGGGTCCCCT GCACCACGGC GTCTGCAGCA CATGGCTCAA CAGCCTGAAG CCCCAAGACC 3060 

CAGTGCCCTG CTTTGTGCGG AATGCCAGCG GCTTCCACCT CCCCGAGGAT CCCTCCCATC 3120 



301 
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CTTGCATCCT CATCGGGCCT GGCACAGGCA 
6GCTCCAT6A CTCCCASCAC AA6G6AGTGC 
GCCGCCGCCC AGATGAGGAC CACATCTACC 
GGGTGCTGCA TGCGGTGCAC ACAGCCTATT 
TTCAGGACAT CCTGCGGCAG CAGCTGGCCA 
CAGGCCACCT CTATGTTTGC GGGGATGTGC 
AGCAGCTGGT GGCTGCCAAG CTGAAATTGA 
AGCTCAAGAG CCAGAAGCGC TATCACGAAG 
CGAAQAAGGA CAQGOTQOCQ GTGCAGCCCA 
CTACAGGAGG G6TTAAAGCT GCCXSQCACAG 
CTGAGGTCAC AGGGCCTGGG QAGATGOAlGG 
TTCCTCAACG TTGCTCCCCA TCAAGCXCTT 
ATTGATCGGA GCCTC 

Seq ID NO: 313 Protein sequence: 
Protein Accession #: NP OOOSIS 



PCT/US02/12476 



TCGCGCCCTT 
GGGGAGGCCG 
AGGAGGAGAT 
CCCGCCTGCC 
GCGAGGTGCT 
GCATGGCCCG 
ATGAGGAGCA 
ATATCTTTGG 
GCAGCCTGGA 
AACTTAAGGA 
AAAGTGATAT 
TACTTGACCT 



CCGCAGTTTC 
CATGACCTTG 
GCTGGAGATG 
TGGCAAGCCC 
CCGTGTGCTC 
GGACGTGGCC 
GGTCGAGGAC 
TGCTGTATTT 
GATGTCAGCG 
TGGAGCCAGC 
CCCCCAGCCT 
CCTAACAAGT 



TGGCAGCAAC 
GTGTTTGGGT 
GCCCAGAAGG 
AAGGTCTATG 
CACAAGGAGC 
CACACCCTGA 
TATTTCTTTC 
CCTTACGAGG 
CTCTGAGGGC 
TCTGCATTAT 
CAAGTCTTAT 
AGCACCCTGG 



MACPWKFLFK 
VETGKKSPES 

TPKSLTRGPR 
LTGDELIFAT 
RSAITVFPQR 
RFDWPLVLQ 
GLEFPGCPFN 
AVLHSFQKQN 
LNYVIiSPFYY 
ILFATETGKS 
NGEKLKKSLF 
ELSGQEDAPR 
SKZU^SSMHAK 
PALVQGILER 
LLLQKLAQVA 
PILKPRFYSI 
VRNASGFHLP 
EDHIYQEEML 
VCGDVRMARD 
VAVQPSSLEM 



11 
I 

TKFHQYAMNG 
LVKLDATPLS 

DKPTPPDELL 
KQAWRNAPRC 
SDGKHDFRVW 
ANGRDPELFB 
GWYMGTEIGV 
VTIMDHHSAA 
YQVEAWKTHV 
EALAWDLGAIj 
MLKELNNKFR 
SWAVQTFKAA 
NVFTMRLKSR 
WDGPTPHQA 
TEEPERQRLE 
SSPRDHTPTE 
EDPSHPCILI 
EMAQKGVLHA 
VAHTLKQIiVA 
SAL 



21 

1 

BKGINNNVBK 
SPRHVRIKNW 
PQAIEFVNQY 
IGRIQWSNLQ 
NAQLIRYAGY 
IPPDLVDBVA 
RDFCDVQRYN 
ESFMKYMQNE 
WQDEKRRPKR 
FSCAFNPKW 
YAVFGLGSSM 
CETFDVRGKQ 
QNLQSPTSSR 
VRLEALDESG 
ALCQPSEYSK 
IHLTVAWTY 
GPGTGIAPFR 
VHTAYSRDPG 
AKLKLNEEQV 



31 

1 

APCATSSPVT 
GSGMTFQDTL 
YGSLKEAKIE 
VFDARSCSTA 
QMPDGSIRGD 
MEHPKYEWPR 
ILBEVGRRMG 
YRSRGGCPAD 
REIPLKVLVK 
CMDKYRLSCL 
YPRFCAFAHD 
HIQIPKLYTS 
ATILVELSCE 
SYWVSDKRLP 
WKFTNSPTFL 
HTRDGQGPLH 
SFWQQRLHDS 
KPKVYVQDIIi 
EDYFFQLKSQ 



41 

I 

QDDLQYHNLS 
HHKAKGILTC 
EHLARVEAVT 
REMFEHICRH 
PANVEFTQLC 
ELELKWYALP 
LBTHKLASLW 
WIWLVPPMSG 
AVLFACMLMR 
EEERLLI.WT 
IDQKLSHLGA 
NVTWDPHHYR 
DGQGLNYLPG 
PCSLSQALTY 
EVLEEFPSLR 
HGVCSTWLNS 
QHKGVRGGRM 
RQQLASBVLR 
KRYHBDIFGA 



51 
1 

KQQNESPQPL 
RSKSCLGSIM 
KEIETTVTYQ 
VRYSTNNGNI 
IDLGWKPKYG 
AVANMLLBVG 
KDQAWEINI 
SITPVFHQEM 
KTMASRVRVT 
STFGNGDCPG 
SQLTPMGEGD 
LVQDSQPLDL 
EHLGVCPGNQ 
FLDITTPPTQ 
VSAGFLLSQL 
LKPQDPVPCF 
TLVFGCRRPD 
VliHKEPGHLY 
VFPYEAKKDR 



Seq ID NO; 314 DNA sequence 
Nucleic Acid Accession #: XM_087254 
Coding sequence: 47.. 2332 



AGAGTACGTG 
ATGTTCAATT 
AACACCAGAC 
CTTATCCCAT 
AATTAAAGAA 
CAATGTTCAA 
GTTGGAGTAC 
TGGTATTGTG 
GGAACGGTAC 
AATTGTTCAG 
TCTCCCTAAA 
TTTGAAAGGG 
GGAAATAGAT 
GGCAGCTGTT 
AGACAGACTA 
AGTATGGGTA 
CCATTTTCAT 
TGCTGAACAA 
GCTGGTAGTG 
GGAAGTTTGC 
AAAAGTAATA 
TGGTGCTAAT 
AGAAGGAAGA 
CAAATTGCTT 
TTTTTTTTAT 
GTTTTCTCAG 
TTCCCTACCT 
AAATAAGCCC 
TCTTTATTGG 
ACTAATAGGG 
TGGCACTTTG 
TCATTTTTGG 
ATTTTCCTTG 
TGTGTTTATT 
TACATGTCTA 
TACTGAAAAG 
CTGTTTCCCG 
AGGAAGATGT 
CAACGACA6G 



11 

1 

TTTACAGATA 
AATGGCATGA 
TCTTCAGAAG 
CTTACAACCA 
CATGATCTCT 
ACTGACTGCA 
TATGCATCTT 
TTTATTGGCA 
AAACTGCTTC 
GCACCTTCAG 
TGTATAGGTG 
CTAAGAACTC 
AAACGCATAT 
TTCCAGTTCA 
CAAGATAAAG 
CTTACTGGGG 
AGAACCATGA 
TTGAGGCAGC 
GATGGGACCA 
AGAAATTGTT 
AGACTAATAA 
GACGTAAGCA 
CAGGCTGCAA 
TTTGTTCATG 
AAGAATGTGT 
CAAACATTGT 
ATTCTGATAT 
ACCCTTTATC 
ACCATCCTGG 
AAAGATACAT 
GTCTTCACAG 
ACTTGGATCA 
TTTTATGGAG 
CAGCTCCTGT 
TTTCTTGATA 
GCACAGCTTA 
GAAGGAGAAG 
A6TCCAACCC 
AGCATCTTGA 



21 

I 

AAACTGGTAC 
AATACCAAGA 
GAAACTTATC 
GTTCCTCTTT 
TCTTTAAAGC 
CTGGTGATGG 
CACCAGATGA 
ATTCTGAAGA 
ATATTCTGGA 
GTGAGAAGTT 
GAGAAATAGA 
TGTGTATAGC 
TTGAAGCCAG 
TAGAGAAAGA 
TTCGAGAAAC 
ATAAACATGA 
ACATCCTTGA 
TTGCCAGAAG 
GCCTATCTCT 
CAGCTGTATT 
AAATATCACC 
TGATACAAGA 
GAAACAGTGA 
GTCATTTTTA 
GCTTTATCAC 
ATGACAGCGT 
ATAGTCTTTT 
GAGACATTAG 
GCTTCAGTCA 
CTCTGCTTGG 
TCATGGTTAT 
ACCATCTCGT 
GGATTCTCTG 
CAAGTGGTTC 
TCATAAAGAA 
CTGAAACAAA 
CAGCGTGTGC 
ACATCAGCAG 
CTCTCTCCAC 



31 

I 

ACTGACAGAA 
AATTAATGGT 
TTATCTTAGT 
CAGAACCAGT 
AGTCAGTCTC 
TCCCTGGCAA 
AAAGGCTCTA 
AACTATGGAG 
ATTTGATTCA 
ATTATTTGCT 
AAAAACCAGA 
ATATAG7VAAA 
GACTGCCTTG 
CCTGATATTA 
TATTGAAGCA 
AACAGCTGTT 
ACTTATAAAC 
AATTACAGAG 
TGCACTCAGG 
AT6CTGTCGT 
TGAGAAACCT 
AGCCCATGTT 
CTATGCAATA 
TTATATTAGA 
ACCCCAGTTT 
GTACCTGACT 
GGAACAGCAT 
TAAAAACCGC 
TGCCTTTATT 
AAATGGCCAG 
TACAGTCACA 
TACCTGGGGA 
GCCATTTTTG 
TGCTTGGTTT 
GGTCTTTGAC 
TGCAGGTATC 
ATCTGTTGGA 
ATCATGGAGT 
AATGGACTCA 



41 

1 

i^VTGAGATGC 
AGACTTGTAC 
AGTTTATCCC 
CCTGAAAATG 
TGTCACACTG 
TCCAACCTGG 
GTAGT^GCTG 
GTTAAAACTC 
GATCGTAGGA 
AAAGGAGCTG 
ATTCATGTAG 
TTTACATC7VA 
CAGCAGCGGG 
CTTGGAGCCA 
TTGAGAATGG 
AGTGTGAGTT 
CAGAAATCAG 
GATCATGTGA 
GAGCATGAAA 
ATGGCTCCAC 
ATAACATTGG 
GGCATAGGAA 
GCCAGATTTA 
ATAGCTACCC 
TTATATCAGT 
TTATACAATA 
GTAGACCCTC 
CTCTTAAGTA 
TTCTTTTTTG 
ATGTTTGGAA 
GTAAAGATGG 
TCTATTATAT 
GGCTCCCAGA 
GCCATAATCC 
CGACACCTCC 
AAGTGCTTGG 
AGAATGCTGG 
GCATCGGATC 
TCTACTTGTT 



51 

AGTTTCGGGA 
CCGAAGGACC 
ATCTTAACAA 
AAACTGAACT 
TACAGATTAG 
CACCATCGCA 
CTGCAAGGAT 
TTGG7^A2\ACT 
GAATGAGTGT 
AGTCATCAAT 
ATGAATTTGC 
AAGAGTATGA 
AAGAGAAATT 
CAGCAGTAGA 
CTGGTATCAA 
TATCATGTGG 
ACAGCGAGTG 
TTCAGCATGG 
AACTATTTAT 
TGCAGAAAGC 
CTGTTGGTGA 
TCATGGGTAA 
AGTTCCTCTC 
TTGTACAGTA 
TCTACTGTTT 
TTTGTTTTAC 
ATGTGTTACA 
TTAAAACATT 
GATCCTATTT 
ACTGGACATT 
CTCTGGAAAC 
TTTATTTTGT 
ATATGTATTT 
TCATGGTTGT 
ACCCTACAAG 
ACTCCATGTG 
AACGAGTTAT 
CTTTCTATAC 
AAAG6GGCA6 



3180 
3240 
3300 

3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
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TAGTACTTTG TGGGAGCCAG TTCACCTCCT TTCCTAAAAT TCAGTGTGAT CACCCTGTTA 2400 

ATGGCCACAC TAGCTCTGAA ATTAATTTCC AAAATCTTTG TAGTAGTTCA TACCCACTCA 2460 

GAGTTATAAT GGCAAACAAA CAGAAAGCAT TAGTACAAGC CCCTCCCAAC ACCCTTAATT 252 0 

TGAATCTGAA CATGTTAAAA TTTGAGAATA AAGAGACATT TTTCATCTCT TTGTCTGGTT 2580 

TGTCCCTTGT GCTTATGGGA CTCCTAATGG CATTTCAGTC TGTTGCTGAG GCCATTATAT 2640 

TTTAATATAA ATGTAGAAAA AAGAGAGAAA TCTTAGTAAA GAGTATTTTT TAGTATTAGC 2700 

TTGATTATTG ACTCTTCTAT TTAAATCTGC TTCTGTAAAT TATGCTGAAA GTTTGCCTTG 2760 

AGAACTCTAT TTTTTTATTA GAGTTATATT TAAAGCTTTT CATGGGA7\AA GTTAATGTGA 2820 

ATACTGAGGA ATTTTGGTCC CTCAGTGACC TGTGTTGTTA ATTCATTAAT GCATTCTGAG 2880 

TTCACAGAGC AAATTAGGAG AATCATTTCC AACCATTATT TACTGCAGTA TGGGGAGTAA 2940 

ATTTATACCA ATTCCTCTAA CTGTACTGTA ACACAGCCTG TAAAGTTAGC CATATAAATG 3000 

CAAGGGTATA TCATATATAC AAATCAGGAA TCAG6TCCGT TCACCGAACT TCAAATTGAT 3060 

GTTTACTAAT ATTTTTGTGA CAGAGTATAA AGACCCTATA GTGGGTAAAT TAGATACTAT 3120 

TAGCATATTA TTAATTTAAT GTCTTTATCA TTGGATCTTT TGCATGCTTT AATCTGGTTA 3180 

ACATATTTAA ATTTGCTTTT TTTCTCTTTA CCTGAAGGCT CTGTGTATAG TATTTCATGA 3240 

CATCGTTGTA CAGTTTAACT ATATCAATAA AAAGTTTGGA CAGTATTTAA ATATTGCAAA 3300 

TATGTTTAAT TATACAAATC AGAATAGTAT GGGTAATTAA ATGAATACAA AAAGAAGAGC 3360 

CTCTTTCTGC AGCCGACTTA GACATGCTCT TCCCTTTCTA TAAGCTAGAT TTTAGAATAA 3420 

AGGGTTTCAG TTAATAATCT TATTTTCAGG TTATGTCATC TAACTTATAG CTiAACTACCA 3480 

CAATACAGTG AGTTCTGCCA GTGTCCCAGT ACAAGGCATA TTTCAGGTGT GGCTGTGGAA 3540 

TGTAAAAATG CTCAACTTGT ATCAGGTAAT GTTAGCAATA AATTAAATGC TAAGAATGAT 3600 

TAATCGGGTA CATGTTACTG TAATTAACTC ATTGCACTTC AAAACCTAAC TTCCATCCTG 3 660 

AATTTATCAA GTAGTTCAGT ATTGTCATTT GTTTTTGTTT TATTGAAAAG TAATGTTGTC 3720 

TTAAGATTTA GAAGTGATTA TTAGCTTGAG AACTATTACC CAGCTCTAAG CAAATAATGA 3780 

TTGTATACAT ATTAAGATAA TGGTTAAATG CGGTTTTACC AAGTTTTCCC TTGAAAATGT 3840 

AATTCCTTTA TGGAGATTTA TTGTGCAGCC CTAAGCTTCC TTCCCATTTC ATGAATATAA 3900 

GGCTTCTAGA ATTGGACTGG CAGGGGAAAG AATGGTAGAG ACAGAAATTA AGACTTTATC 3 960 

CTTGTTTGCT TGTAAACTAT TATTTTCTTG CTAATGTAAC ATTTGTCTGT TCCAGTGATG 4020 

TAAGGATATT AAGTTATTAA GCTAAATATT AATTTTCAAA AATAGTCCTT CTTTAACTTA 4080 

GATATTTCAT AGCTGGATTT AGGAAGATCT GTTATTCTGG AAGTACTAAA AAGAATAATA 4140 

CAACGTACAA TGTCTGCATT CACTAATTCA TGTTCCA6AA GAGGAAATAA TGAAGATATA 4200 

CTCAGTAGAG TACTAGGTGG GAGGATATG6 AAATTTGCTC ATAA7VATCTC TTATAAAACG 4260 

TGCATATAAC AAAATGACAC CCAGTAGGCC TGCATTACAT TTACATGACC GTGTTTATTT 4320 

GCCATCAAAT AAACTGAGTA CTGACACCAG ACAAAGACTC CAAAGTCATA AAATAGCCTA 4380 

TGACCAACTG CAGCAAGACA GGAGGTCAGC TCGCCTATAA TG6TGCTTAA AGTGTGATTG 4440 

ATGTAATTTT CTGTACTCAC CATTTGAAGT TAGTTAAGGA GAACTTTATT TTTTTAAAAA 4500 

AAGTAAATGG CAACCACTAG TGTGCTCATC CTGAACTGTT ACTCCAAATC CACTCCGTTT 4560 

TTAAAGCAAA ATTATCTTGT GATTTTAAGA AAAGAGTTTT CTATTTATTT AAGAAAGTAA 462 0 

CAATGCAGTC TGCAAGCTTT CAGTAGTTTT CTAGTGCTAT ATTCATCCTG TAAAACTCTT 4680 

ACTACGTAAC CAGTAATCAC AAGGAAAGTG TCCCCTTTGC ATATTTCTTT AAAATTCTTT 4740 

CTTTGGAAAG TATGATGTTG ATAATTAACT TACCCTTATC TGCCAAAACC AGAGCAATU^T 4800 

GCTAAATACG TTATTGCTAA TCAGTGGTCT CAAATCX5ATT TGCCTCCCTT TGCCTCGTCT 4860 

GAGGGCTGTA AGCCTGAAGA TAGTGGCAAG CACCAAGTCA GTTTCCAAAA TTGCCCCTCA 4920 

GCTGCTTTAA GTGACTCAGC ACCCTGCCTC AGCTTCAGCA GGCX3TAGGCT CACCCTGGGC 4 980 

GGAGCAAAGT ATGGGCCAGG GAGAACTACA GCTACGAAGA CCTGCTGTCG AGTTGAGAAA 5040 

AGGGGAGAAT TTATGGTCTG AATTTTCTAA CTGTCCTCTT TCTTGGGTCT AAAGCTCATA 5100 

ATACACAAAG GCTTCCAGAC CTGAGCCACA CCCAGGCCCT ATCCTGAACA GGAGACTAAA 5160 

CAGAGGCA7A TCAACCCTAG GAAATACTTG CATTCTGCCC TACGGTTAGT ACCAGGACTG 5220 

AGGTCATTTC TACTGGJ\AAA GATTGTGAGA TTGAACTTAT CTGATCGCTT GAGACTCCTA 5280 

ATAGGCAGGA GTCAAGGCCA CTAGAAAATT GACAGTTAAG AGCCAAAAGT TTTTAAAATA 5340 

TGCTACTCTG AAAAATCTCG TGAAGGCTGT AGGAAAAGGG AGAATCTTCC ATGTTGGTGT 5400 

TTTTCCTGTA AAGATCAGTT TGGGGTATGA TATAAGCAGG TATTAATAAA AATAACACAC 5460 

CAAAGAGTTA CGTAAAACAT GTTTTATTAA TTTTGGTCCC CACGTACAGA CATTTTATTT 5520 

CTATTTTGAA ATGAGTTATC TATTTTCATA AAAGTAAAAC ACTATTAAAG TGCTGTTTTA 5580 

TGTGAAATAA CTTGAATGTT GTTCCTATAA A/iAATAGATC ATAACTCATG ATATGTTTGT 5640 

AATCATGGTA ATTTAGATTT TTATGAGGAA TGAGTATCTG GAAATATTGT AGCAATACTT 5700 

GGTTTAAAAT TTTGGACCTG AGACACTGTG GCTGTCTAAT GTAATCCTTT AAAAATTCTC 5760 

TGCATTGTCA GTAAATGTAG TATATTATTG TACAGCTACT CATAATTTTT TAAAGTTTAT 5820 
GAAGTTATAT TTATCAAATA AAAACTTTCC TATAT 

Seq ID NO: 315 Protein sequence: 
Protein Accession #: XP_087254 



1 11 21 31 41 51 

I I I I I I 

MQFRECSING MKYQEINGRL VPEGPTPDSS EGNLSYLSSL SHLNNLSHLT TSSSFRTSPE 60 

NETEHKEHD LFFKAVSLCH TVQISNVQTD CTGDGPWQSN LAPSQLEYYA SSPDEKALVE 120 

AAARIGIVFI GNSBETMEVK TLGKLERYKL LHILEFDSDR RRMSVIVQAP SGEKLLFAKG 180 

AESSILPKCI GGEIEKTRIH VDEFALKGLR TLCIAYRKFT SKEYEEIDKR IFEARTALQQ 240 

REEKLAAVFQ FIEKDLILLG ATAVEDRLQD KVRETIEALR MAGIKVWVLT GDKHETAVSV 300 

SLSCGHFHRT MNILELINQK SDSECABQLR QLARRITEDH VIQHGLWDG TSLSLALREH 360 

EKLFMEVCRN CSAVLCCRMA PIiQKAKVIRL IKISPEKPIT LAVGDGANDV SMIQEAHVGI 420 

GIMGKEGRQA ARNSDYAIAR FKFLSKLLFV HGHFYYIRIA TLVQYFFYKN VCFITPQFLY 480 

QFYCIiFSQQT LYDSVYLTLY NICFTSLPIL lYSLLEQHVD PHVLQNKPTL YRDISKNRLL 540 

SIKTFLYWTI LGFSHAFIFF FGSYLLIGKD TSIiLGNGQMP GNWTFGTLVF TVMVITVTVK 600 

MALETHFWTW INHLVTWGSI IFYFVFSLFY GGILWPFLGS QNMYFVFIQL LSSGSAWFAI 660 

ILMWTCLFL DIIKKVFDRH LHPTSTBKAQ LTBTNAGIKC LDSMCCPPEG EAACASVGRM 720 
IiERVIGRCSP THISRSWSAS DPFYTNDRSI LTDSTMDSST C 

Seq ID NO: 316 DNA sequence 
Nucleic Acid Accession #: NM_004473 
Coding sequence : 661 . . 1791 

1 11 21 31 41 51 

I I I i I I 

CTCGCCAGCG GTCCGCGGGG CTGGAGACCC ACGCCGTGGA GAGGACCAGC CTCAGGTCGC 60 
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CCCGCCTGGG CCCGCGCCCC GACCTCGCTG CCCCCGCCTC GCCTCTCTGC CCGTGGCGCT 120 

•TACCGCCACC TTGGCCTCGG GGGCAGGGCA TGGGCGGCCC CCGCCAGATC GCCCAGCX3CC 180 

AGTACTAACT GCCCTCGCTC TGGCCTTCGA GCCCX3AAGCC TCTTCTGCGC GCACAACCTA 240 

GGCAGTAATC CTAAACTAGC GGGCACCACA GACCAGCTGC AGCCACCCCA ACCCAGGGAT 300 

CACTTCCGGA CCCCTCGACC GCCCGGCACC AGCGCGCAAG GGACCCTTCA GCCGGAGACC 360 

AGAGTCCAGT CCCGGTCGCG AGGCCACCGC CGCTGCCCGC CTCGAGTU^GC ACAACGCGGG 420 

CTGAGCCGTC GGCTAGCGGG TCACTCCCGA GCCTCTGTCT GCACCGCGCC AGCCCCAGAC 480 

CACGGACGCT GAGCCTCCAG CGCGCGCCAG CCTGGGCCGC TGGGCTCTCC GGGCCAGCCC 540 

GCGACGATCC CCTGAGCTCT CCGCAGAAGG GCCGAGCGTC CGTTCCGGGG ACGCCAGGCC 600 

CGCCCCCGCC CCCCGACAGC CGCGGGGATC CAGAGCCCGG GGGTGCGGGA CGCCCGCGCC 660 

ATGACTGCCG AGAGCGGGCC GCCGCCGCCG CAGCCGGAGG TGCTGGCTAC CGTGAAGGAA 720 

GAGCGCGGCG AGACGGCAGC AGGGGCCGGG GTCCCAGGGG AGGCCACGGG CCGCGGGGCG 780 

GGCGGGCGGC GCCGCAAGCG CCCCCTGCAG CGCGGGAAGC CGCCCTACAG CTACATCGCG 840 

CTCATCGCCA TGGCCATCGC GCACGCGCCC GA6CGCCGCC TCACGCTGGG CGGCATCTAC 900 

AAGTTCATCA CCGAGCGCTT CCCCTTCTAC CGCGACAACC CCAAAAAGTG GCAGAACAGC 960 

ATCCGCCACA ACCTCACACT CAACGACTGC TTCCTCAAGA TCCCGCGCGA GGCCGGCCGC 1020 

CCGG6TAAGG GCAACTACTG GGCGCTCGAC CCCAACGCGG AGGACATGTT CGAGAGCGGC 1080 

AGCTTCCTGC GCCGCCGCAA GCGCTTCAAG CGCTCGGACC TCTCCACCTA CCCGGCTTAC 1140 

ATGCACGACG CGGCGGCTGC CGCAGCCGCC GCTGCCGCAG CCGCCGCCGC CGCCGCCGCC 1200 

GCCGCCATCT TCCCAGGCGC GGTGCCCGCC GCGCGCCCCC CCTACCCGGG CGCCGTCTAT 1260 

GCAGGCTACG CGCCGCCGTC GCTGGCCGCG CCGCCTCCAG TCTACTACCC CGCGGCGTCG 1320 

CCCGGCCCTT GCCGCGTCTT CGGCCTGGTT CCTGAGCGGC CGCTCAGCCC AGAGCTGGGG 13 80 

CCCGCACCGT CGGGGCCCGG CGGCTCTTGC GCCTTTGCCT CCGCCGGCGC CCCCGCTACC 1440 

ACCACCGGCT ACCAGCCCGC AGGCTGCACC GGGGCCCGGC CGGCCAACCC CTCTGCCTAT 1500 

GCGGCTGCCT ACGCGGGCCC CGACGGCGCG TACCCGCAGG GCGCCGGCAG TGCGATCTTT 1560 

GCCGCTGCTG GCCGCCTGGC GGGACCCGCT TCGCCCCCAG CGGGCGGCAG CAGTGGCGGC 1620 

GTGGAGACCA CGGTGGACTT CTACGGGCGC ACGTCGCCCG GCCAGTTCGG AGCGCTGGGA 1680 

GCCTGCTACA ACCCTGGCGG GCAGCTCGGA GGGGCCAGTG CAGGCGCCTA CCATGCTCGC 1740 

CATGCTGCCG CTTATCCCGG TGGGATAGAT CGGTTCGTGT CCGCCATGTG AGCCAGCGTA 18 00 

GGGACGAAAA CTCATAGACA CATCGGCTGT TCACACGTTC CCCGCAACCT GAGAACGAAC 1860 

AGGAATGGAG AGAGGACTCA ACTGGGACCC ACGTGGAAAA GACCGAGCAG GCCACAGAGG 1920 

CTCGGTCTCC CCGCGCACAG CGTAGGCACC CTGTGTACTC TGTA/^CGGG AGGAGGTGGG 1980 

GCGAGGCAGC CAGAGCCCTT GGACTGGCAG AGGGACCCTC GATGGAGCGA AGCCCTCAAA 2 040 

CGGGATGCTT TCTGGCATTC TATCGGGGAG GGTCCTTGGC GGTAACCAGA GGGCAGCGTA 2100 

GTGTCAACAC CAGAGACCAG GATCCAAATT GTGGGGAATC AGTTTCAGCC TTCCATGTGC 2160 

TGCCGGAACT CGGGCCTTTT TACGCGGTTC GTCCTCTAGT GCCTTTAACT GCGTTACTAC 222 0 

AATAAAAGGC TGCGGCAGCG CCTTTCTTCT TAAAGTGAGG AGGACAAATT TGCAAAAGAA 2280 

ATAGGCTTTT CTTCTTTTTT AAATTGGAGA AATCTCTGCT CTGGTTGACC TGGGCTGGTT 2340 

TTCCCTGTCT CTGAGAACTT GAGACCTAGC TCCGAGTTGA ACTGTGCGTC AGCACTCCAG 2400 

TCCCATCACC TGAACCTTCA GTCTCCCCCA TCTGTTACAC TAGAGGGCTG CAGGACTCTA 2460 

TCCACCGCCC CCGGGTTATC ATTCAGGGCC CCATCATCTT GGATGCTGCC CTGCGTATTT 252 0 

GGCAGCAATG GTGGGCCACC CAGGGCCTCT GAGTAGCCAC CCAAAGCCTA GCCGCTGTTC 2580 

TAGGGAACGG AAAAGAGTTC ATGGCCAAGC GTCTAACCTA AAGTCCCAGG ATTGGCTCCA 2640 

GGCAGCAATT ATATCATAAC TTATTGAACT TTTGAGCAGG ACGTGCTGGT AATTTCATGG 2700 

CTGTTACTGC CCAGTCATAA ATCTGCTTTT CCATTATAAG GCAGAGAGAA GTACATTCGT 2760 

TCATTTGTCC ACTGTTTCTT GTCATCACGC AGCCCTGGAC CCAAAGGGTG AACTAAAGTT 2 82 0 

TAAGGAGATG AGAGGATTCA AGGAGCCCGT TGGTGACGCC TTTCAGTAGC TGGGGAGGGC 2 88 0 

TCTTCCATCC CCAGCACCCC CTGCTACACC TCAGCAGCCT CCCCCATGCA AAAAGGAAAG 2 940 

AG7W\AATTA AGTTAGGGCA GTCAGTAAAG TGAGCTTTAG AAAGAAACTG GAATTTTAAC 3000 

TTCATTTTGT ATCTTGCTTA AGTAGCAGGC TCACTAAAAT TAGAGAAAGT CCAATAACTC 3060 

TCCCCCTTTC CCTTGAGAAA TCTTTAAGTT TCGATTCTGG AGCAAAAACT TTCAGCATTA 3120 

AATATTTCAG AGGCTCCATT CACAGCTTTC AGATAAACTG GAGTGTTCAG ATGGACTGTT 3180 

TTAATAAAAA TCTTTGAGCA AGTGAGTTAT GGCAAGAGAA ACTCAGCCTC TTTCTGTATA 3240 

AACTTAACAG GGAAGGGCTG GGGTGTGAAA AAGAAGATTG TATGAAAACC ATTGGTAATT 3300 

TTTATTTTTT ATTTTTGGGA CTGCACTATC CTGTTCACGA AGACATGTGA ACTTGGTTCA 3360 

GTCCAAATGG GGATTTGTAT AAACCAGTGC TCTCCATTAG AAATATGGTG CAAGCCACAT 3420 

ATGTAATTTT AAATATTCTA GTAGCCACAT TAATAAAGTN AAAAGAAACA AAAAAAAAAA 3480 
AA 

Seq ID NO: 317 Protein sequence: 
Protein Accession #i NP_004464 

1 11 21 31 41 51 

f 1 I I I I 

FKHLTHYROI DTRANSCRIP TIONFACTOR TTFMTAESGP PPPQPEVLAT VKEERGETAA 60 

GAGVPGEATG RGAGGRRRKR PLQRGKPPYS YIALIAMAIA HAPERRLTLG GIYKFITERF 120 

PFYRDNPKKW QNSIRHNLTL NDCFLKIPRE AGRPGKGNYW ALDPNAEDMF ESQSFDRRRK 180 

RFKRSDLSTY PAYMHDAAAA AAAAAAAAAA AAAAAIFPGA VPAARPPYPG AVYAGYAPPS 240 

IiAAPPPVYYP AASPGPCRVF GLVPERPLSP ELGPAPSGPG GSCAFASAGA PATTTGYQPA 300 

GCTGARPANP SAYAAAYAGP DGAYPQGAGS AIFAAAGRLA GPASPPAGGS SGGVETTVDF 360 
YGRTSPGQFG ALGACYNPGG QLGGASAGAY HARHAAAYPG GIDRFVSAM 

Seq ID NO: 318 DNA sequence 
Nucleic Acid Accession #: NM_005688 
Coding sequence; 126.. 4439 

1 11 21 31 41 51 

i 1 |. I I I 

CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 60 

AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 120 

AGAAGATGAA GGATATCGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT GGGTATAGAA 180 

GTGTGAGGGA GAGAACCAGC ACTTCTGGGA CGCACAGAGA CCGTG7UVGAT TCCAAGTTCA 240 

GGAGAACTCG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 300 

TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 360 

GAAAGTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCCG GACTACTTCC AAACACCAGC 420 

ACCCAGTGGA CAATGCTGGG CTTTTTTCCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 
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CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 540 

ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TA6AGAGACT GTGGCAAGAA GAGCT6AATG 600 

AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

TCATCCTGTC CATCGTGTGC CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 720 

TCATGGTGAA ACACCTCTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 

TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 960 

TTTGCTCCAA CGATGGGCAG AGAATGTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCT6 1020 

GAGGACCCGT TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 1080 

GCTTCCTGGG ATCAGCTGTT TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAACGTGTC CAGAAGATGA 1200 

ATGAAGTTCT TACTTACATT AAATTTATCA AAATGTATGC CTGGGTCAAA GCATTTTCTC 1260 

AGAGTGTTCA J^AAAATCCGC GAGGAGGAGC GTCGGATATT GGAAAAAGCC GGGTACTTCC 1320 

AGGGTATCAC TGTGGGTGT6 GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 1380 

CTGTTCATAT GACCCTGGGC TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

TCTTCAATTC CATGACTTTT GCTTTGAAAG TAACACCGTT TTCAGTAAAG TCCCTCTCAG 1500 

AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA 1560 

TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAi\AATGCC ACCTTGGCAT 162 0 

GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 1680 

ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

AGGCGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1800 

CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC 1860 

ACAGCATCGA TCTGGAGATC CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 1920 

GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 

TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

TGAGAGACTWV CATCCTGTTT GGG7\AGGAAT ATGATGAAGA AAGATACi^AC TCTGTGCTGA 2100 

ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 

GAGAGCGAGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 222 0 

TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 2340 

TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT TGAGATCTU^T TCAAAAAAGG 2520 

AAACCAGTGG TTCACAGAAG AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 

AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2 640 

GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 2700 

TCCTGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2760 

GGTTGAGTTA CTGGATCAA6 CAAGGAAGCG GGAACACCAC TGTGACTCGA GGGTU^CGAGA 2820 

CCTGGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2880 

CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2 940 

GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3000 

CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 3060 

TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 

TGGTGTTCTT CT6TGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 

GGCCCCTTGT CATCCTGTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 

AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 33 60 

AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 3420 

CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 3480 

TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 

TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3600 

CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3 660 

AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3720 

AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAG7A AGTATCCTTC ACGATCAAAC 3780 

CTAAAGAGAA GATTGGCATT GTGGGGCGGA CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 

CCCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 3 900 

GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3 960 

TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 402 0 

TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 40 80 

TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 

GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 4200 

CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 

GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 4320 

TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 43 80 

GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 

TCCTCCCTGT TGACGAA6TC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 

CCCCTCATCG CGTCCTCCTA CCGAT^CCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4560 

GTTCCGGATT GGGTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 462 0 

ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 

GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTAGCTATA 4740 

TCTATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 

TATTAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 4860 

TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4920 

CTCTAGCTGG TGGTTTCACG GTGCCAGGTT TTCTGGGTGT CCAAAGGAA6 ACGTGTGGCA 4980 

ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 

GAGACGGGTG GGCGGCTGGA GACCATGCAG AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 

CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 5160 

TTTCACTCCC TCCATCAAGA ATGGGGATCA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 

TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 5280 

TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 

GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG TGGCACTTTT TCATTTGCCT 5400 

ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT TCGTGGGTCT GTTTTCCTTT 5460 

CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 5520 

CAGCTCTTGC TAATCAGTGT CTCACACTGG CGTAGAAGTT TTTGTACTGT AAAGAGACCT 5580 

ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC GCAAACCCCC TTTGTGCTGT 5640 

GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC TGCTGTCATC AGTTGAATGG TCAGCGTTGC 5700 
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ATGTCGTGAC CAACTAGACA TTCTGTCGCC TTAGCATGTT TGCTGAACAC CTTGTGGAAG 5760 
CAAAAATCTG AAAATGTG7UV TAAAATTATT TTGGATTTTG TAAAAAAAAA AA7WVAAAAAA 5820 
AAAAAAAAAA AAAAAAAA 



Seq ID NO: 319 Protein sequence: 
Protein Accession #: NP_005679 

1 11 21 31 41 51 

I I I ] 1 1 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGLS 60 

LDASMHSQLR ILDEEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGLFSCM TFSWLSSLAR 120 

VAHKKGELSM EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 180 

IiSIVCLMITQ LAGFSGPAFM VKHLLEYTQA TBSNLQYSLL LVLGLLLTEl VRSWSLALTW 240 

ALNYRTGVRI* RGAILTMAFK KILKLKNIKE KSLGEIiINIC SNDGQRMFEA AAVGSLIiAGG 300 

PWAILGMIY NVIILGPTGF LGSAVPILFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 3 60 

VLTYIKFIKM YAWVKAFSQS VQKIREEERR ILEKAGYFQG ITVGVAPIW VIASWTFSV 420 

HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMEEVHMIK 480 

NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTBHQA 540 

VLAEQKGHLIj LDSDERPSPE EBEGKHIHLG HLRLQRTLHS IDLEIQEGKIj VGICGSVGSG 600 

KTSLISAILG QMTLLEGSIA ISGTFAYVAQ QAWIUIATLR DNILFGKEYD BERYNSVLNS 660 

CCLRPDLAIL PSSDIiTEIGE RGANLSGGQR QRISI*ARALY SDRSIYILDD PLSALDAHVG 72 0 

NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVIFMKEGC ITERGTHEEL MNLNGDYATI 780 

FNNLLLGETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPEEGQL VQLEEKGQGS 840 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWWL SYWIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT LRASSRLHDE bFRRILRSPM 960 

KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FFCVGMIAGV FPWFLVAVGP 1020 

LVILFSVLHI VSRVLIRELK RLDNITQSPF LSHITSSIQG LATIHAYNKG QEFLHRYQEL 1080 

LDDNQAPFFL FTCAMRWLAV RLDLISIALI TTTGLMIVLM HGQIPPAYAG LAISYAVQLT 1140 

GLFQFTVRIiA SETEARFTSV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 1200 

RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 1260 

IGLADLRSKIi SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNF SVGERQLLCI ARALLRHCKI LILDEATAAM DTETDLLIQE TIREAFADCT 13 80 
MLTIAHRLHT VLGSDRIMVL AQGQWEFDT PSVLLSNDSS RFY7U4FAAAB NKVAVKG 



Seq ID NO: 320 DNA sequence 

Nucleic Acid Accession #: AK022089.1 

Coding sec[uence; 181-1488 

1 11 21 31 41 51 

I I I I I I 

AGCAGTTGCA CAACTTCCAG CAACTTTCTC AGCCGGCTAC TAATGAGCTG AAAGCCAGGA 60 

ACATCCGAGG AGAAGAGAAA GCTTCCAGCC CTCCTCCCTT CACCCTGGAA ATCCAGACAC 120 

CCCCACCCCC ACCCTCAGAT CACTTTAAGA TAATTTCTTT ATTCGTTTGC CCGACAGACC 180 

ATGGCTCCCT TTGGAAGAAA CTTGCTAi\AG ACTCGGCATA AAAACAGATC TCCAACTAAA 240 

GACATGGATT CAGAAGAGAA GGAAATTGTG GTTTGGGTTT GCCAAGAAGA GAAGCTTGTC 300 

TGTGGGCTGA CTJ\AACGCAC CACCTCTGCT GATGTCATCC AGGCTTTGCT TGAGGAACAT 3 60 

GAGGCTACGT TTGGAGAGAA ACGATTTCTT CTGGGGAAGC CCAGTGATTA CTGCATCATA 420 

GAGTUiGTGGA GAGGCTCCGA AAGGGTTCTT CCTCCACTAA CTAGAATCCT 6AAGCTTTGG 480 

JVTVAGCGTGGG GAGATGAGCA GCCCAATATG CAATTTGTTT TGGTTA7VAGC AGATGCTTTT 540 

CTTCCAGTTC CTTTGTGGCG GACAGCTGAA GCCAAATTAG TGCAAAACAC AGAAAAATTG 600 

TGGGAGCTCA GCCCAGCAAA CTACATGAAG ACTTTACCAC CAGATAAACA AAAAAGAATA 660 

GTCAGGAAAA CTTTCCGGAA ACTGGCTAAA ATTAAGCAGG ACACAGTTTC TCATGATCGA 720 

GATAATATGG AGACATTAGT TCATCTGATC ATTTCCCAGG ACCATACTAT TCATCAGCAA 780 

GTCAAGAGAA TGAAAGAGCT GGATCTGGAA ATTGAAAAGT GTGAAGCTAA GTTCCATCTT 840 

GATCGAGTAG AAAATGATGG AGAAAACTAT GTTCAGGATG CATATTTAAT GCCCAGTTTC 900 

AGTGAAGTTG AGCAAAATCT AGACTTGCAG TATGAGGAAA ACCAGACTCT GGAGGACCTG 960 

AGCGAAAGTG ATGGAATTGA ACAGCTGGAA GAACGACTGA AATATTACCG AATACTCATT 102 0 

GATAAGCTCT CTGCTGAAAT AGAAAAAGAG GTAAAAAGTG TTTGCATTGA TATAAATGAA 1080 

GATGCGGAAG GGGAAGCTGC AAGTGAACTG GAAAGCTCTA ATTTAGAGAG TGTTAAGTGT 1140 

GATTTGGAGA AAAGCATGAA AGCTGGTTTG AAAATTCACT CTCATTTGAG TGGCATCCAG 1200 

AAAGAGATTA AATACA6TGA CTCATTGCTT CAGATGAAAG CAAAAGAATA TGAACTCCTG 1260 

GCCAAGGAAT TC7y\TTCACT TCACATTAGC AACAAAGATG GGTGCCAGTT AAAGGAAAAC 1320 

AGAGCGAAGG AATCTGAGGT TCCCAGTAGC AATGGGGAGA TTCCTCCCTT TACTCAAAGA 13 80 

GTATTTAGCA ATTACACAAA TGACACAGAC TCGGACACTG GTATCAGTTC TAACCACAGT 1440 

CAGGACTCCG AAACAACAGT AGGAGATGTG GTGCTGTTGT CAACATAGTT CC7\ATGGCTC 1500 

CTTTCTGACC TGCTTTCATG TTTTAATGTT TGTTTAATTT AATAGGAAAC CTCATTTTAA 1560 

ATATAACACT CAAAAAAATG T/^TCATAT TGTAGTATTC AATAGTTAAT AAAAACTCGA 1620 
GAAATGTGTT GTTTCTG 

Seq ID NO: 321 Protein sequence: 
Protein Accession #: NP_005438.1 

1 11 21 31 41 51 

111111 

MAPFGRNIiLK TRHKNRSPTK DMDSEEKEIV VWVCQEEKLV CGLTKRTTSA DVIQALLEEH 60 

EATFGEKRFL LGKPSDYCII EKWRGSERVIj PPLTRILKLW KAWGDEQPNM QFVLVKADAF 120 

LPVPLWRTAE AKLVQNTEKL WEIiSPANYMK TLPPDKQKRI VRKTFRKIiAK IKQDTVSHDR 180 

DNMETLVHIiI ISQDHTIHQQ VKRMKELDLE lEKCEAKFHL DRVENDGBNY VQDAYLMPSF 240 

SEVEQNIiDLQ YEENQTLEDL SESDGIEQLE ERLKYYRILI DKLSAEIEKE VKSVCIDINE 300 

DAEGEAASEL ESSNLESVKC DLEKSMKAGL KIHSHLSGIQ KEIKYSDSLL QMKAKEYELL 360 

AKEFNSIiHIS NKDGCQLKEN RAKESEVPSS NGEIPPFTQR VFSNYTNDTD SDTGISSNHS 420 
QDSETTVGDV VLLST 

Seq ID NO: 322 DNA sequence 

Nucleic Acid Accession #: NM_030920.1 
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Coding sequence: 317-1123 

1 11 21 31 41 51 

I I I I I I 

AGCATTGAAG GGGAAGGAAC TGCGGGTGTG GTGTGTGTAT GTGTGTGTGT ATGTGTGTGC 60 

GGCGCGTGCG TGCGTGTGTG TGCGCGCGCT AGTGTGTGGA CAAGGAGGTG GGGGCAGCTG 120 

AGTTAGAGTC CCAACTCTTG GACTCCATTT GCTATTCTCT TCTTTCTCCC CCACACCTAT 180 

CTGGTGGTGG TAGTGGGCGT TTATATTTGC GTTCCTTTTC ATTCATTTCT AAATCTCTTA 240 

AAAATTTTGG GTTGGGGGTA TTGGGGAAGG CAGGAAAGGG AAAAGGAGAG TAGTAGCTGA 300 

AGAGCAAbAG GAGGACATGG A6ATGAAGAA GAAGATTAAC CTGGAGTTAA GGAACAGATC 360 

CCCGGAGGAG GTGACAGAGT TAGTCCTTGA TAATTGCCTG TGTGTCAATG GGGAAATTGA 42 0 

AGGCCTGAAT GATACTTTCA AAGAACTAGA ATTTCTGAGT ATGGCTAATG TGGAACTAAG 480 

TTCGCTGGCC CGGCTTCCCA GCTTAAATAA ACTTCGAAAA TTGGAGCTTA GTGATAATAT 540 

AATTTCTGGA GGCTTGGAAG TCCTGGCAGA GAAATGTCCA AATCTTACCT ACCTCAATCT 600 

GAGTGGAAAC AAAATAAAAG ATCTCAGTAC AGTAGAAGCT CTGCAAAATC TTAAAAATTT 660 

GAAAAGTCTT GACCTGTTTA ACTGTGAGAT CACAAACCTG GAAGATTATA GAGAAAGTAT 72 0 

TTTTGAACTA CTGCAGCAAA TCACATACTT AGATGGATTT GATCAGGAGG ATAATGAAGC 780 

GCCGGACTCT GAAGAGGAGG ATGATGAGGA TGGAGATGAA GATGATGAAG AGGAAGAGGA 840 

AAATGAAGCT GGTCCACCGG AAGGATATGA GGAAGAGGAG GAGGAAGAGG AAGAGGAGGA 900 

TGAGGATGAG GATGAAGATG AAGATGAAGC AGGTTCAGAG TTGGGAGAGG GAGAAGAGGA 960 

AGTGG6CCTC TCATACTTAA TGAAAGAAGA AATTCAGGAT GAAGAAQATG ATGATGACTA 1020 

TGTTGAAGAA GGGGAAGAAG AGGAAGAAGA GGAAGAAGGA GGTCTTCGAG GGGAGAAGAG 1080 

GAAACGAGAT GCTGAAGACG ATGGAGAGGA AGAAGATGAC TAGATCATTC TAAGACCAGA 1140 

TTCTCTAATG TTTCTGGGTG TGCAATAGAG TGATCACATC TTTGTTTCTT CATGTACGAT 12 00 

AGCTATCCCT ACAGAAGATA ATGTGTAACT TTTTATAGGA AAAGTGTGGT TTTACTATTT 1260 

TTGCCTTATC ATTCCAAATA AGAACTAGTC TGTTAATGAT CATATTGTAT GTAGAGAAAA 1320 

ATTTTCATTG ACTCCCATTG TGGAATTCCC TAGCAATTTA TTTAGACTTA ATTTTTTAAA 1380 

TTCAAGCTTA CTGTATTAGT CATTTTTAGC CCATAATTAA AACATGATCA CTTTTAAACA 1440 

GGTGTAGTAT GGTGCATTTC ATTCCTTATT TATAGATTAA CTGAAATTAC AGTTTGCTAT 1500 

AATATAAAAT GACAATAGTC TCTTGAGTGG TAAGTTGGTT ATTTTTTTAG AGGTGATCCA 1560 

GGAATCTTTA GTTTGAAGGC AGTTACCTTT tTTTTTTTTT TTTTTTTTTG ACTAAGAGTG 1620 

TTTGGTTGCT TTTTTGTCAC AAGTAACTTG GAAAATAGAA GCAGAATAGT AAAGGTTCTA 1680 

TTCAGCAACA TAGTTCATGG ATTTTGTGGA GGTTCTATTC AGTAATATGG TTCATGGATT 1740 

TAGTGGTGAC TGATAAQATT TTATTTTTGA AGGT^AAAATT QCTTATACTA AGTCCAGAGA 1800 

CATGCAGGTG AGCCCTTTTG TCAGGCTGCA AATCATGACA TGCCGATGGT TGTTTATTTT 1860 

GTTTTTAGGT GTGCATTCTT TTTCTTCTTA GCAATTCCTT TATGATCACC TTCCCTTCTT 192 0 

GTTTCACTCC CTCCCGCTCT CTCAAAAGGA ACTTGGGAAA CTTGTGAAAC CCAGGAAAAC 1980 

CTTTAGTCTT ATACCTCAAC TACX3TTTCAG TCCTGTCTGG GTTTTAAATA AGTGAAGTAG 2040 

AAGAAATT6A GTATTTTCTG ACATAAGAAT ATATTATCAA TACAGTTTTA TGCAGTAAGC 2100 

TCTCCTTACC ATAAATGTTT CTTGGTTGAC AACATCTAAG ACAATATTAG TGGGATGAAG 2160 

AAAGTVAAAGC AGGGGTGCTT TTGGAAGCAG TGTTAGTGTT CCTCAAAAGT CGGAACAATT 2220 

GCCTGTTGAT ATATTAATAA GACATTAAAG TCAAATTTTA ATGTTGGCCT CTCAAATGAT 2280 

TTGGATACCA CTCTGCAAAG TATTTCTAAC CTTTAATTCC CAGTTTTAAA ACAGATATAA 2340 

TAATAGCATT TAATTGGAAT ATACTAGGCA GCTGGAAAAG TATTTGAAAC TAAATTGACA 2400 

TTAAAATTAA GATTTGTTTT CAAGTGGATG TCCATTAAAA GTAGAAAAAT ATTTGGGATA 2460 

AGTGAGTGTG TGTTTCCTTA CATGGCTACT AAATAAAATA TAATGAGTAT ACAAGTATAT 2520 

CTCCTCTTTT GCTATGGAGG CTCCATGTTC AAGGCAATGG CTTTTTAAAT CTTGGCTATC 2580 

TAAAATTTTT TCCCTTTGTT TTGAATATTT GTAAGTTTTT AAGAAGTTAG TGTCAGCAAA 2 640 

TTAATTGAAG TTATGCTTCT ATACTGGGAC ATATTTAAAT ACTGAGTATA GTACTGCTGC 2700 

TACTGCTTCT ACAATGTAAA ATGTATGACT TGGTGTTTTA AAGTAAT^T TATGATGTTA 2760 

CTTGTGGAGA AACTAAAAAT GTTGTACAAC TGACCGAT^G AAAACCCTTG GGGATAAGTT 2820 

TAGTGAGGGG ATTGGAATCC CCAAAAAGAT AACATTTTTC TTCTGCTTTT AAAAACTGAA 2880 

ATTCCCTGTT CTAGTTCCTA ACAATTCTCA TTACATACTA TGCCAGATTA CAAAATACTT 2 940 

ATTTTTAAAA TGAAATCTAT ATATTGACTT TCTTATCAAT CATCTTACTG TGCAATCAAA 3000 

ATTAGAGTAC TTTGGTTTGA AAACAACACT TAGAGCCTCC AGATAACTTT TAAGACTTAT 3 060 

TTAGCTTTGT GGGTGGTATT TTCATGCAAA TAAGTAAGGG TGGGTTTTAT ATTTTGTAGA 3120 

AGTTTTCGGT CCTATTTTAA TGCTCTTTGT ATGGCAGTAT GTATATATTG TGTTAAGTTC 3180 

CTCAAGAATC TCCTTJ^/^AAA CTTTGAAGTT AATACTTTTG TGCAACTGTG TTTTGAATAA 3240 
AGCCATGACA GTGTTAAAAA CAAAC 

Seq ID NO; 323 Protein sequence: 
Protein Accession #: NP_112182.1 

1 11 21 31 41 51 

I I I I I I 

MEMKKKINLE LRNRSPEEVT ELVLDNCLCV NGEIEGLNDT FKELEFLSMA NVELSSLARL 60 

PSLNKLRKLE LSDNIISGGL EVLAEKCPNL TYLNLSGNKI KDLSTVEALQ NLKNLKSLDL 120 

FNCBITNLED YRESIFELLQ QITYLDGFDQ EDNEAPDSEE EDDEDGDEDD EEEEENEAGP 180 

PEGYEEEEBE EEBEDBDEDE DBDEAGSELG EGEBEVGIiSY LMKEEIQDEE DDDDYVEEGE 240 
EEEEEEEGGL RGEKRKRDAE DDGEEEDD 

Seq ID NO: 324 DNA sequence 
Nucleic Acid Accession #: NM_003812 
Coding sequence: 224.. 2722 

1 11 21 31 41 51 

I I I 1 I t 

TCCTCTGCGT CCCGCCCCGG GAGTGGCTGC GAGGCTAGGC GAGCCGGGAA AGGGGGCGCC 60 

GCCCAGCCCC GAGCCCCGCG CCCCGTGCCC CGAGCCCGGA GCCCCCTGCC CGCGGCGGCA 120 

CCATGCGCGC CGAGCCGGCG TGACCGGCTC CGCCCGCGGC CGCCCCGCAG CTAGCCCGGC 180 

GCTCTCGCCG GCCACACGGA GCGGCGCCCG GGAGCTATGA GCCATGAAGC CGCCCGGCAG 240 

CAGCTCGCGG CAGCCGCCCC TGGCGGGCTG CAGCCTTGCC GGCGCTTCCT GCGGCCCCCA 30 0 

ACGCGGCCCC GCCGGCTCGG TGCCTGCCAG CGCCCCGGCC CGCACGCCGC CCTGCCGCCT 3 60 

GCTTCTCGTC CTTCTCCTGC TGCCTCCGCT CGCCGCCTCG TCCCGGCCCC GCGCCTGGGG 420 

GGCTGCTGCG CCCAGCGCTC CGCATTGGAA TGAAACTGCA GAAAAAAATT TGGGAGTCCT 480 

GGCAGATGAA GACAATACAT TGCAACAGAA TAGCAGCAGT AATATCAGTT ACAGCAATGC 540 

AATGCAGAJA GAAATCACAC TGCCTTCAAG ACTCATATAT TACATCAACC AAGACTCGGA 600 
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AAGCCCTTAT CACGTTCTTG ACACAAAGGC AAGACACCAG 
CCATCTGGCC CAGGCAAGCT TCCAGATTGA AGCCTTCGGC 
CATACTGAAC AATGGTTTGT TGTCTTCTGA TTATGTGGAG 
ACCACAGTAC TCTAAGGGTG GAGAGCACTG TTACTACCAT 
5 AGACTCCAAG GTGGCTCTGT CAACCTGCAA TGGACTTCAT 

CTTCGTGTAT ATGATAGAGC CACTAGAGCT GGTTCATGAT 
ACATATAATC CAGAAAACCT TGGCAGGACA GTATTCTAAG 
GGAAAGAGGT GACCAGTGGC CCTTTCTCTC TGAATTACAG 
AGCAGTGAAT CCATCACGTG GTATATTTGA AGAAATGAAA 

10 TAATGATCAC AAAACGTATA AGAAGCATCG CTCTTCTCAT 
AAAGTCCGTG GTCAACCTTG TGGATTCTAT TTACAAGGAG 
CCTGGTGGCT GTAGAGACCT GGACTGAGAA GGATCAGATT 
GCAGATGCTC CATGAGTTCT CAAAATACCG GCAGCGCATT 
GCACCTCATC TCGCGGGTGA CATTTCACTA TAAGAGAAGC 

15 TGTCT6TTCT CGCACAAGAG GAGTTGGTGT GAATGAGTAT 

ACAAGTATTA TCGCAGAGCC TGGCTCAAAA CCTTGGAATC 
AAAGCCAAAA TGTGACTGCA CAGAATCCTG GGGTGGCTGC 
GTCCCATTCT CGAAAATTTT CAAAGTGCAG CATTTTGGAG 
AGGAGGTGGA GCCTGCCTTT TCAACAGGCC AACAAAGCTA 

20 AAATGGATAC GTGGAAGCTG GGGAGGAGTG TGATTGTGGT 
ATTATGCTGT AAGAAATGTT CCCTCTCCAA CGGGGCTCAC 
TAACAATACC TCATGTCTTT TTCAGCCACG AGGGTATGAA 
GTGTGATATT ACTGAATATT GTACTGGAGA CTCTGGTCAG 
GCAAGACX3GA TATGCATGCA ATCAAAATCA GGGCCGCTGC 

25 CAGAGACAAC CAGTGTCAGT ACATCTGGGG AACAAAGGCT 

CTATGAAAAG CTGAATACAG AAGGCACTGA GAAGGGAAAC 
GTGGATTCAG TGCAGCAAAC ATGATGTGTT CTGTGGATTC 
TCGAGCTCCA CGTATTGGTC AACTTCAGGG TGAGATCATT 
AGGCCGGGTG ATTGACTGCA GTGGTGCCCA TGTAGTTTTA 

30 CTATGTAGAA GATGGAACGC CATGTGGCCC GTCTATGATG 

ACAAATTCAA GCCCTAAATA TGAGCAGCTG TCCACTCGAT 
GGGCCATGGG GTGTGTAGTA ATGAAGCCAC CTGCATTTGT 
AGATTGCAGT ATCCGGGATC CAGTTAGGAA CCTTCACCCC 
GGGTCCTAGT GCCACCAATC TCATAATAGG CTCCATCGCT 

35 TATTGTCCTT GGGGGCACAG GCTGGGGATT TAAAAATGTC 

TACTCAGC7UV GGCCCCATCT GAATCAGCTG CGCTGGATGG 
ATTCTGGGTA TGACATACTC GCAGCAGTGT TACTGGAACT 
CTTTGGGTGG TAATGACTAC GGAGCTAAAG TTGGGGTGAC 
CTGTCTCTTT TGGAAATAAT GTCAAAGAAC ACCTTTCACC 

40 GGGGGCAA7\A GACCATGCTA TAAAAAGAAC TGTTCCAGAA 
ACGAAGGAAC AACACACACA CAAAAATTAA ATGCAATAAA 



CAAAAACATA 
TCCAAATTCA 
ATTCACTACG 
GG/y^GCATCA 
GGCATGTTTG 
GAGAAAAGCA 
CAAATGAAGA 
TGGTTGAAAA 
TATTTGGAAC 
GCACATACCA 
CAGCTCAACA 
GACATCACCA 
AAGCAGCATG 
AGTCTGAGTT 
GGTCTTCCAA 
CAATGGGAAC 
ATCATGGAGG 
TATAGAGACT 
TTTGAGCCCA 
TTTCATGTGG 
TGCAGCGACG 
TGCCGGGATG 
TGCCCACCAA 
TACAATGGCG 
GCAGGGTCTG 
TGCGGGAAGG 
TTACTCTGTA 
CCAACTTCCT 
GATGATGATA 
TGTTTAGATC 
TCCAAGGGTA 
GATTTCACCT 
CCCAAGGATG 
GGTGCCATCC 
AAGAAGAGAA 
ACACCGCCTT 
ATTAAGTTTG 
AAGGATGGGG 
ACCTGTCAGT 



GGAATCATTA 



ATAAGGCTGT 
TTCTTGACCT 
AAAATGGGAA 
GAGGCGTCAA 
AAGATGATAC 
CAGGTCGACC 
ATCTCACTAT 
GAAGGAAGAG 
TTATGATTGT 
ACAACTTTGC 
CCAGGGTTGT 
CCAACCCTGT 
CTGATGCTGT 
ACTTTGGAGG 
TGGCAGTGGC 
CTTCTAGCAG 
AAACAGGGGT 
TTTTACAGAG 
CX3G7U\TGTGG 
AATGCTATGG 
GGCCCTGCTG 
CTGTGAACGA 
ATCTTCATAA 
AGTGCAAGAC 
ACAAGTTCTG 
ATGGAGACCG 
CCAATCTTAC 
TCTACCATCA 
CGGATGTGGG 
GGAAGTGCCT 
AAGTCTGTTC 
GGGCAGGGAC 
AAGGACCCAA 
TGGTAGCAGC 
GGTTCGATCC 
GCACTGTTGG 
TAAACAAAAC 
TAAAAGAAAA 
AAACGGGGGA 
TCCCTAATGG 
AAAA 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



PCT/US02/12476 



45 
50 
55 
60 
65 



Seg ID NO: 325 Protein sequence: 
Protein Accession #: NP_003803 



1 
I 

MKPPGSSSRQ 
RPRAWGAAAP 

INQDSESPYH 
HYENGKPQYS 
KSTGRPHIIQ 
LELMIVNDHK 
ITTNPVQMLH 
LPMAVAQVLS 
RDFLQRGGGA 
SDGPCCNNTS 
NGECKTRDNQ 
LCTNLTRAPR 
LDRKCLQIQA 
KDEGPKGPSA 



11 

I 

PPIiAGCSIiAG 
SAPHWNETAE 
VLDTKARHQQ 
KGGEHCYYHG 
KTLAGQYSKQ 
TYKKHRSSHA 
EFSKYRQRIK 
QSLAQNLGIQ 
CLFNRPTKLF 
CLFQPRGYEC 
CQYIWGTKAA 
IGQLQGEIIP 
LNMSSCPLDS 
TNLIIGSIAG 



21 
1 

ASCGPQRGPA 
KNLGVLADED 

KHNKAVHLAQ 
SIRGVKDSKV 
MKNLTMERGD 
HTNNFAKSW 
QH7U3AVHIiIS 
WEPSSRKPKC 
EPTECGNGYV 
RDAVNECDIT 
GSDKFCYEKL 
TSFYHQGRVI 
KGKVCSGHGV 
AILVAAIVLG 



31 
I 

GSVPASAPAR 
NTLQQNSSSN 
ASFQIEAFGS 
ALSTCNGLHG 
QWPFLSELQW 
NLVDSIYKEQ 
RVTFHYKRSS 
DCTESWGGCI 
EAGEECDCGF 
EYCTGDSGQC 
NTEGTEKGNC 
DCSGAHWLD 
CSNEATCICD 
GTGWGFKNVK 



41 
1 

TPPCRLIiLVL 
ISYSNAMQKE 

KFILDLILNN 
MFEDDTFVYM 
LKRRKRAVNP 
LNTRWLVAV 
LSYPGGVCSR 
MEETGVSHSR 
HVECYGLCCK 
PPNLHKQDGY 
GKDGDRWIQC 
DDTDVGYVED 
FTWAGTDCSI 
KRRFDPTQQG 



51 

1 

LItLPPLAASS 
ITLPSRLIYY 

GLLSSDYVEI 
lEPLELVHDE 
SRGIFEEMKY 
ETWTEIOaQID 
TRGVGVNEYG 
KFSKCSIIiEY 
KCSLSNGAHC 
ACNQNQGRCY 
SKHDVFCGFL 
GTPCGPSMMC 
RDPVRNLHPP 
PI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



Seq ID NO: 326 DNA sequence 

Nucleic Acid Accession #: AK074418.1 

Coding sequence: 244-1515 



70 
75 
80 
85 



CTTTCTCCAA 
ACCTGAGAGC 
AACAGATGGA 
AAATGACAGA 
GTCATGGCGT 
GACTTTACCA 
TTCCCCGCAG 
GTGATATGGA 
ATAAGCAGAT 
GGATCCTTGA 
CACCAGTATG 
GTGATTGATG 
CAAAACCAAG 
TATTCCGATC 
ATCACCAACA 
ACCAAGGCAG 
6CGATGGA6A 
CAATACCGAA 
ACCX5AATGGA 



11 
I 

GACGGCCGGC 
CTGTGGAACC 
AAAAGTGAGG 
GCTGGAGTTC 
ATTACCAGGA 
CCTTGCGGGA 
CAGATTCTTC 
AGCGGCCACA 
TTGACATCCA 
CTCAGAACCC 
CTGGCATTTT 
ACCGCCTACC 
AGTTCTGGCC 
TGCACTATGG 
TCCATCTGCA 
GCTCCCTGAT 
ATGGGCTGGT 
GGGGCTGG6A 
GAGGGCX3CTG 



21 
I 

CATGCTCTCC 
TGCCCGTCTC 
GACCGGTGAG 
TGCTGTGCCT 
GCCTTCAGTG 
TCACTGCCTG 
CATAGGCCAG 
GGATCTACCA 
ACAAGGAGGC 
ACAGTACAGG 
CCGTTTCCGG 
TGTCCAGGGA 
CTGCCTGCTG 
CTTCCTCGAG 
CTCTTCCCCT 
AACCTGTGCC 
GAGTCTCCAT 
AGAAATTATC 
GAGTGATGGG 



31 

I 

TCCTCTGCCA 
CCCTCCTCCA 
TGACTTGCTG 
GGAAAGGACC 
GAGACCTCCA 
AGCATGGGCC 
AAGCTGCTCC 
GGGGGTCCTC 
GCAGCTGACT 
CAGAAGATCC 
TTCTGGCAAT 
GATAAATGCC 
GAGAAGGCCT 
GATGCCCTGG 
GTGGACCTGG 
ACTCCAAGTG 
GCCTACACTG 
TCCCTGTGGA 
TCTCAGGAGT 



41 

I 

GTCTCCTCCA 
TCAGACACAC 
CTAAAGTTTA 
TCGGAAGTCT 
TCATCAAGTT 
GGACGTTTAA 
AGGAAAAACG 
CTCACTTCAT 
GCTGGTTCCT 
TGATGGTCCA 
GTGGCCAGTG 
TCTTTGTGCG 
ATGCCAAGCT 
TGGACCTCAC 
TGAAGGCAGT 
GGCCTiACAGA 
TGACTGGGGC 
ACCCCTGGGG 
GGGAGGAAAC 



51 
I 

CCACTCTCTA 
CTGCCTAGGA 
TACCAGATGC 
TCTAAGGAGA 
CAAAGACCAG 
GGATGAGACA 
CCTCTCCAAT 
CCTGGATGAT 
GGCAGCACTG 
AAGCTTTTCA 
GGTGGAAGTG 
TCCTCGCCAC 
GCTCGGATCC 
AGGAGGCGTG 
GAAGACAGCG 
TACAGCACAG 
TGAGCAGATT 
CTGGGGCGA6 
CTGTGATCCG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



308 



wo 02/086443 

CGGAAAAGCC AGCTACATAA GAAACGGGAA GATGGCGAGT TTTGGATGTC GTGTCAAGAT 1200 

TTCCAACAGA AATTCATCGC CATGTTTATA TGTAGCGAAA TTCCAATTAC CCTGGACCAT 1260 

GGAAACACAC TCCACGAAGG ATGGTCCCAA ATAATGTTTA GGAAGCAAGT GATTCTAGGA 1320 

AACACTGCAG GAGGACCTCG GAATGATGCT CAATTCAACT TCTCTGTGCA AGAGCCAATG 13 80 

5 GAAGGCACCA ATGTTGTCGT GTGCGTCACA GTTGCTGTCA CACCATCAAA TTTGAAAGCA 1440 

GAAGATGCAA AATTTCCACT CGATTTCCAA GTGATTCTGG CTGGCTCACA GAAACACTGT 1500 

CCATiAGCTCA AATAATAAAT TCCGCCGCAA CTTCACCAT6 ACTTACCATC TGAGCCCTGG 1560 

GAACTATGTT GTGGTTGCAC AGACACGGAG AAAATCAGCG GAGTTCTTGC TCCGAATCTT 1620 

CCTGAAAATG CCAGACAGTG ACAGGCACCT GAGCAGCCAT TTCAACCTCA GAATGAAGGG 1680 

10 AAGCCCTTCA GAACATGGCT CCCAACAAAG CATTTTCAAC AGATATGCTC AGCAGGTATG 1740 

GTACCTAGCA CCCAGGGGCC TTACGTGGGA TTGGAGAAAG GGGACCTGAG GGAGGGACAG 1800 

CCCTCACAGG CCCTTACTGG GATGCAGAGA GGAGAAGTGA CTTGATGGAC TATTTTACCT 1860 

GCCTCTCTTC CTGGATCGTC TCCAGAACTG CTGTGGCTGC CAAGCTCGGT AGAGACGTGG 1920 

CGCCCCACCC AGTCTCATCC GGGGGACTTC AAGCTGGAAT GCAGAGCTTA GAAAGGGAGG 1980 

15 GGATAATTAT GGGGTGTGAG GTGCATTGCC CTCTAAATCT TTAAACAAGC AATTGGCAGT 2040 

AGCCCGTGAA ACCTTTCCTT CTCCTACTCG- GCCACCTCCC ACCAACCTGG CATCGTTCCT 2100 

CCCGGGAGCT AGCCAGCTTC AGAAAGCACA TACAGCATCC TTGCTGCCAA ACCACCTATG 2160 

TGCACACAGG ATTTCCTTAA TGGCTTAATA AACTGTTATA AAGAACTCCT TGACTTGTCA 2220 

GAATAAAATA GCTGCCAGGG GCTCTGCACA ATGAGCCTCT TACCGTTAAA AAAAAAAAAA 2280 

20 AAAAAAAT^AA AAAAAAAT^ AAAAAAAAAA 



PCT/US02/12476 



Seq ID NO: 327 Protein sequence: 
Protein Accession #: BAB85075,1 



25 

30 
35 



MAYYQEPSVE 
IWKRPQDLPG 
QYAGIFRFRF 
SDIjHYGFLED 
MENGLVSLHA 
KSQLHKKRED 
TA6GPKMDAQ 
KLK 



11 

1 

TSIIKFKDQD 
GPPHFILDDI 
WQCGQWVEW 
ALVDIiTGGVI 

ytvtgaeqiq 
gbfwmscqdf 
fnfsvqepmb 



21 

1 

FTTLRDHCLS 
SRFDXQQGGA 
IDDRLPVQGD 
TNIHLHSSPV 
YRRGWEBIIS 
QQKFIAMFIC 
GTNVWCVTV 



31 

1 

MGRTFKDETF 
ADCWFLAALG 
KCLFVRPRHQ 
DLVKAVKTAT 
LWNPWGWGET 
SEIPITLDHG 
AVTPSNLKAE 



41 

1 

PAADSSIGQK 
SLTQNPQYRQ 
NQEFWPCLLE 
KAGSIilTCAT 
EWRGRWSDGS 
NTLHEGWSQI 
DAKFPLDFQV 



51 

1 

liLQEKRIiSNV 
KILMVQSFSH 
KAYAKLLGSY 
PSGPTDTAQA 
QEWEETCDPR 
MFRKQVILGN 
ILAGSQKHCP 



60 
120 
180 
240 
300 
360 
420 



40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 328 DNA sequence 

Nucleic Acid Accession BCD 17490,1 

Coding sequence: 74-2788 



GTGGGTCACG 
TCGTGGTACT 
GCGTCGGCGA 
TGCCCTCACC 
CCTAGGCACA 
CATGGAAAGG 
TCTGGATGAT 
CATGCGGCAG 
GTATGACAGC 
GGCCACGGAG 
TCTCAAAGGC 
CCACCGCTTC 
GGAGCGCATC 
CTTGGCAGCC 
GCAGATCTTT 
CATCACCAAC 
GCTGAGGCAG 
TGGCGTCCTG 
GGGTCCTTTC 
GTCGGCCGGC 
CCGAATCCAG 
CATTCTCCTC 
CATCTATCAC 
CACTGTCATC 
GACCGATGAA 
GATCTTTGCC 
TCTGGCCCTG 
TATCAACGTG 
TGAGAAAGTG 
CACGGCGTAT 
GGTTCTGGCT 
CAGAACCAGC 
CGTCACCTCC 
CTACGACCCC 
CTTTGACATC 
CCGCTTCGTG 
GGCCAATGGC 
CCAGGAGGTC 
CCAGATGGAC 
GACAGGCAGC 
CCACGCGCGC 
CGTGATGCTG 
GACTTTTGCC 
GAAGCAGTTA 
CACTATT6AG 



11 
1 

TGAACCACTT 
GCTATGGCX3G 
GGCAATGATC 
TCCAGCCCTG 
GAGGGGCCCC 
GACTACCGCG 
GAGGACGTAG 
CGTGACCGGG 
GATGAGGAGG 
GACGGCGAGG 
CACTCTGTGC 
AAGAACTTCC 
AGCGACATGT 
AGGGAGCACG 
GATGAGGCTG 
CACATCCATG 
CTGCATCTGA 
CCCCAGCTCA 
TGCCAGTCCC 
CCCTTTGAGG 
GAGAGTCCAG 
GCAGATCTGG 
AACAACTATG 
CTAGCCAACC 
GAT6T6AAGA 
AGCATTGCTC 
TTCGGAGGGG 
CTCTTGTGCG 
TCCAGCCGAG 
GTCCAGCGGC 
GACCGAGGAG 
ATCCATGAGG 
CTGCAGGCTC 
TCGCTGACTT 
CTGTGTGTGG 
GTGGGCAGCC 
AGCGCTGCTG 
CTGAAGAAGT 
CAGGACAAGG 
ATCCCCATTA 
ATCCATCTGC 
GAGAGCTTCA 
CGCTACCTTT 
GTGGCAGAGC 
GTCCCTGAGA 



21 
1 

TTCGCGCGAA 
AATCATCGGA 
CTCTCACCTC 
GCCGTGACCT 
TGGAGGAAGA 
CCATCCCAGA 
AGGAGCTGAC 
AGGCTGGCCG 
ACGAGGAGCG 
AGGACGAGGA 
GCGAGTGGGT 
TGCGCACTCA 
GCAAAGAGAA 
TGCTGGCCTA 
CCCTGGAGGT 
TCCGCATCTC 
ACCAGCTGAT 
GCATGGTCAA 
AGAACCAGGA 
TCAACATGGA 
GCAAAGTGGC 
TGGACAGCTG 
ATGGCTCCCT 
ACGTGGCCAA 
TGATCACTAG 
CTTCCATCTA 
AGCCCAAAAA 
GAGACCCTGG 
CCATCTTCAC 
ACCCTGTCAG 
TGTGTCTCAT 
CCATGGAGCA 
GCTGCACGGT 
TCTCTGAG7\A 
TGAGGGACAC 
ACGTCAGACA 
AGCCCGCCAT 
ACATCATCTA 
TGGCCAAGAT 
CGGTGCGGCA 
GGGACTATGT 
TAGACACACA 
CATTCCGGCG 
AGGTGACATA 
AGGACTTGGT 



31 
I 

ACCTGGTTGT 
ATCCTTCACC 
CAGCCCTGGC 
TCCACCATTT 
AGAGGATGGA 
GCTGGACGCC 
GGCCAGTCAG 
GGGCCTGGGC 
CCCTGCCCGC 
GATGATCGAG 
GAGCATGGCG 
CGTCGACAGC 
CCGTGAGA6C 
CTTCCTGCCT 
GGTACTGGCC 
CCACCTGCCT 
CCGCACCAGT 
GTACAACTGC 
GGTGAAACCA 
GGAGACCATC 
GGCTGGCCGG 
CAAGCCAGGA 
CAACACTGCC 
GAAGGACAAC 
CCTCTCCAAG 
TGGTCATGAA 
CCCAGGTGGC 
CACAGCGAAG 
CACTGGCCAG 
CAGGGAGTGG 
TGATGAATTT 
ACAGAGCATC 
CATTGCTGCC 
CGTGGACCTC 
CGTGGACCCA 
CCACCCCAGC 
GCCCAACACG 
CGCCAAGGAG 
GTACAGTGAC 
CATCGAGTCC 
GATCGAAGAC 
GAAGTTCAGC 
TGACAACAAT 
TCAGCGCAAC 
GGATAAGGCT 



41 
I 

TGCTGTAGTG 
ATGGCATCCA 
CGAAGCTCCC 
GAGGATGAGT 
GAGGAGCTCA 
TATGAGGCCG 
AGGGAGGCAG 
CGCATGCGCC 
AAGCGCCGCC 
AGCATCGAGA 
GGCCCCCGGC 
CACGGCCACA 
CTGGTGGTGA 
GAGGCACCGG 
ATGTACCCCA 
CTGGTGGAGG 
GGGGTGGTGA 
AACAAGTGCA 
GGCTCCTGTC 
TATCAGAACT 
CTGCCCCGCT 
GACGAGATAG 
AATGGCTTCC 
AAGGTTGCTG 
GATCAGCAGA 
GACATCAAGA 
AAGCACAAGG 
TCGCAGTTTC 
GGGGCGTCGG 
ACCTTGGAGG 
GACAAGATGA 
TCCATCTCGA 
GCCAACCCCA 
ACAGAGCCCA 
GTCCAGGAC6 
AACAAGGAGG 
TATGGCGTGG 
AGGGTCCACC 
CTGAGGAAAG 
ATGATCCGCA 
GACGTCAACA 
GTCATGCGCA 
6AGCTGTTGC 
CGCTTTGGGG 
CGTCAGATCA 



51 
I 

GCGGAGAGGA 
GCCCGGCCCA 
GGCGTACTGA 
CCGAGGGGCT 
TTGGAGATGG 
AGGGACTGGC 
CAGAGCGGGC 
GTGGGCTCCT 
AGGTGGAGCG 
ACCTGGAGGA 
TGGAGATCCA 
ACGTCTTCAA 
ACTATGAGGA 
CGGAGCTGCT 
AGTACGACCG 
AGCTGCGCTC 
CCAGCTGCAC 
ATTTCGTCCT 
CTGAGTGCCA 
ACCAGCGTAT 
CCAAGGACGC 
AGCTGACTGG 
CTGTCTTTGC 
TAGGGGAACT 
TCGGAGAGAA 
GAGGCCTGGC 
TACGTGGTGA 
TCAAGTATAT 
CTGTGGGCCT 
CTGGGGCCCT 
ATGACCAGGA 
AGGCTGGCAT 
TAGGAGGGCG 
TCATCTCACG 
AGATGCTGGC 
AGGAGGGGCT 
AGCCCCTGCC 
CGAAGCTCAA 
AATCTATGGC 
TGGCGGAGGC 
TGGCCATCCG 
GCATGCGCAA 
TCTTCATACT 
CCCAGCAGGA 
ACATCCACAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



309 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CCTCTCTGCA TTTTATGACA GTGAGCTCTT CAGGATGAAC AAGTTCAGCC AC6ACCTGAA 2760 

AAGGAAAATG ATCCTGCAGC AGTTCTGAGG CCCTATGCCA TCCATAAGGA TTCCTTGGGA 2820 

TTCTGGTTTG GGGTGGTCAG TGCCCTCTGT GCTTTATGGA CACAAAACCA GAGCACTTGA 2880 

TGAACTCGGG GTACTAGGGT CAGGGCTTAT AGCAGGATGT CTGGCTGCAC CTGGCATGAC 2 940 

TGTTTGTTTC TCCAAGCCTG CTTTGTGCTT CTCACCTTTG GGTGGGATGC CTTGCCAGTG 3 0 00 

TGTCTTACTT GGTTGCTGAA CATCTTGCCA CCTCCGAGTG CTTTGTCTCC ACTCAGTACC 3060 

TTGGATCAGA GCTGCTGAGT TCAGGATGCC TGCGTGTGGT TTAGGTGTTA GCCTTCTTAC 3120 

ATGGATGTCA GGAGAGCTGC TGCCCTCTTG GCGTGAGTTG CGTATTCAGG CTGCTTTTGC 3180 

TGCCTTTGGC CAGAGAGCTG GTTGAAGATG TTTGTAATCG TTTTCAGTCT CCTGCAGGTT 3240 

TCTGTGCCCC TGTGGTGGAA GAGGGCACGA CAGTGCCAGC GCAGCGTTCT GGGCTCCTCA 33 00 

GTCGCAGGGG TGGGATGTGA GTCATGCGGA TTATCCACTC GCCACAGTTA TCAGCTGCCA 33 60 

TTGCTCCCTG TCTGTTTCCC CACTCTCTTA TTTGTGCATT CGGTTTGGTT TCTGTAGTTT 3420 
TAATTTTTAA TAAAGTTGAA TT^AAATATAA AATiAAAAAAA AAAAAA 

Seq ID NO: 329 Protein sequence: 
Protein Accession #; AAH17490.1 



PCT/US02/12476 



MAESSESFTM 
GPLEEEEDGE 
DREAGRGLGR 
SVREWVSMAG 
EHVLAYFLPE 
HLNQLIRTSG 
FEVNMEETIY 
NYDGSLNTAN 
lAPSIYGHED 
SRAIFTTGQG 
HEAMEQQSIS 
CWRDTVDPV 
KKYIIYAKER 
HLRDYVIEDD 
AEQVTYQRNR 
LQQF 



11 
I 

ASSPAQRRRG 
ELIGDGMERD 
MRRGLLYDSD 
PRLEIHHRFK 
APAELLQIFD 
WTSCTGVLP 
QNYQRIRIQE 
GFPVFATVIL 
IKRGLABALF 
ASAVGLTAYV 
ISKAGIVTSL 
QDEMLARFW 
VHPKLNQMDQ 
VNMAIRVMIiE 
FGAQQDTIBV 



21 
I 

NDPLTSSPGR 

YRAIPELDAY 
EEDEERPARK 
NFLRTHVDSH 
EAAIiBWIiAM 
QLSMVKYNCN 
SPGKVAAGRL 
ANHVAKKDNK 
GGEPKNPGGK 
QRHPVSREWT 
QARCTVIAAA 
GSHVRHHPSN 
DKVAKMYSDL 
SFIDTQKFSV 
PEKDLVDKAR 



31 

I 

SSRRTDALTS 

EAEGLALDDE 
RRQVERATED 
GHNVFKERIS 
YPKYDRITNH 
KCNFVLGPFC 
PRSKDAIIjIjA 
VAVGELTDED 
HKVRGDINVL 
LEAGALVLAD 
NPIGGRYDPS 
KEEEGLANGS 
RKESMATGSI 
MRSMRKTFAR 
QINIHNLSAF 



41 
i 

SPGRDLPPFE 

DVEELTASQR 
GEEDEEMIES 
DMCKENRESL 
IHVRISHLPL 
QSQNQEVKPG 
DLVDSCKPGD 
VKMITSLSKD 
LCGDPGTAKS 
RGVCLIDEFD 
LTFSENVDLT 
AAEPAMPNTY 
PITVRHIESM 
YLSFRRDNNE 
YDSELFRMNK 



Seq ID NO: 330 DNA sequence 
Nucleic Acid Accession #: M17254 
Coding sequence: 257-1645 



1 
i 

GTCCGCGCGT 
CGGGTCGCAC 
CTTTGGAGAC 
AAAGATGGCA 
TGGCTTACTG 
CTTATCAGTT 
GGCTAAGACA 
CCCACGCGTC 
GGAATGTAAC 
CAAAGGCGGG 
GGAGGAGAAG 
AGCAGATCCT 
AGAATATGGC 
GTGCAAGATG 
TCTCTCACAT 
TGATAAAGCC 
TGAGCCCCCC 
TGCTCAACCA 
TTATCAGATT 
GCTTTGGCAG 
GGAAGGCACC 
AGAGCGGAAG 
CTATGACAAG 
CCACGGGATC 
CTCA6ACCTC 
GCXXICACCCT 
CTGGAATTCA 
TTCTCATCTG 
CACCAGCCCA 
TGAAAAAAGC 
GAGGGAGTTA 
GGACATATCA 
AAGGACAAAG 
AATCCCACTA 
AACATACCGT 
TCAAAAACAA 
ACTGCATGGC 
CAGCTTTCTC 
ACTATGAACT 
AGGGGTGAAG 
TCTCAAGCAA 
ACTCGAGGGT 
GTAATGGAGA 
TGTCAAATGA 
TCATTATGTG 



11 
i 

GTCCGCGCCC 
TAACTCCCTC 
CCGAGGAAAG 
GAACCAAGGG 
AAGGACATGA 
6TGAGTGAGG 
GAGATGACCG 
CCTCAGCAGG 
CCTAGCCAGG 
AAGATGGTGG 
CACATGCCAC 
ACGCTATGGA 
CTTCCA6ACG 
ACCAAGGACG 
CTCCACTACC 
TTACAAAACT 
AGGAGATCAG 
TCTCCTTCCA 
CTTGGACCT^ 
TTCCTCCTGG 
AACGGGGAGT 
AGCAAACCCA 
AACATCATGA 
GCCCAGGCCC 
CCGTACATGG 
CCAGCCCTCC 
CCAACTGGGG 
GGCACTTACT 
TCGCCAGAAA 
TTTACTGGGG 
CTGAAGTCTT 
TCTGTGGACT 
TGCCAAAGAA 
ATGCAAACTG 
TTATAATGCC 
GAGAAAAGAC 
ATGTGCTGTT 
AAACTGTGAA 
AAAAGGTGGG 
AAGGAGGAGG 
TGAAGACTGG 
TCATGCAGTC 
AAGGGAAGTA 
AAATTTTAAC 
GGGGCTTTGT 



21 
I 

GCGTGTGCCA 
GGCGCCGACG 
CCGTGTTGAC 
CAACTAAAGC 
TTCAGACTGT 
ACCAGTCGTT 
CGTCCTCCTC 
ATTGGCTGTC 
TGAATGGCTC 
GCAGCCCAGA 
CCCCAAACAT 
GTACAGACCA 
TCAACATCTT 
ACTTCCAGAG 
TCAGAGAGAC 
CTCCACGGTT 
CCTGGACCGG 
CAGTGCCCAA 
CAAGTAGCCG 
AGCTCCTGTC 
TCAAGATGAC 
ACATGAACTA 
CCAAGGTCCA 
TCCAGCCCCA 
GCTCCTATCA 
CCGTGACATC 
GTATATACCC 
ACTAAAGACC 
CTCTATCGGA 
CTGGGGAAGG 
ACTACAGAAA 
GACCTTGTAA 
AGTGGTCTTA 
GGATGAAACT 
ATTTTAAGGA 
ACGAGAGAGA 
TTGGTTGAAA 
GATGACCCAA 
ACTGAGGATG 
AAGAGGCAGA 
ACTCAGGACA 
AGTGTTATAC 
GTAGAATTCA 
TGGAATTGTC 
TCTCCACAGG 



31 
I 

GCGCGCGTGC 
GCGGCGCTAA 
CAAAAGCAAG 
CGTCAGGTTC 
CCCGGACCCA 
GTTTGAGTGT 
CAGCGACTAT 
TCAACCCCCA 
AAGGAACTCT 
CACCGTTGGG 
GACCACGAAC 
TGTGCGGCAG 
GTTATTCCAG 
GCTCACCCCC 
TCCTCTTCCA 
AATGCATGCT 
TCACGGCCAC 
AACTGAAGAC 
CCTTGOU^T 
GGACAGCTCC 
GGATCCCGAC 
CGATAAGCTC 
TGGGAAGCGC 
CCCCCCGGAG 
CGCCCACCCA 
TTCCAGTTTT 
CAACACTAGG 
TGGCGGAGGC 
GAACATGAAT 
AAGCCGGGGA 
TGAGGAGGAT 
AAGACAGTGT 
AGAAATGTAT 
AAAGCAATAG 
AAACTACCTG 
CTGTGGCCCA 
TCAAATACAT 
AGTTTCCAAC 
TGTATAGAGT 
GAAGGAGGAG 
TTTGGGGACT 
CAAACCCAGT 
GAAACAAAAA 
T6ATATTTAA 
GTCAGGTAAG 



41 

1 

CTTGGCCGTG 
CCTCTCGGTT 
ACAAATGACT 
TGAACAGCTG 
GCAGCTCATA 
GCCTACGGAA 
GGACAGACTT 
GCCAGGGTCA 
CCTGATGAAT 
ATGAACTACG 
GAGCGCAGAG 
TGGCTGGAGT 
AACATCGATG 
AGCTACAACG 
CATTTGACTT 
AGAAACACAG 
CCCACGCCCC 
CAGCGTCCTC 
CCAGGCAGTG 
AACTCCAGCT 
GAGGTGGCCC 
AGCCGCGCCC 
TACGCCTACA 
TCATCTCTGT 
CAGAAGATGA 
TTTGCTGCCC 
CTCCCCACCA 
TTTTCCCATC 
CAAAAGTGCC 
AGAGATCCAA 
GCTAAi=AATG 
ATGTAGAAGC 
AAACTTTAGA 
AAACAACACA 
TATTTAAAAA 
TCAACA6ACG 
TCCGTTTGAT 
TCCTTTACAG 
GAGCGTGTGA 
ACCAGGCTGG 
GTGTACAATG 
GTTAGGAGAA 
TGCGCATCTC 
GAGAAACATT 
AGATGGCCTT 



51 
I 

DESEGLLGTE 
EAAERAMRQR 
lENLEDLKGH 
WNYEDLAAR 
VEELRSLRQL 
SCPECQSAGP 
EIELTGIYHN 
QQIGEKIFAS 
QFLKYIEKVS 
KMNDQDRTSI 
EPIISRFDIL 
GVEPLPQEVL 
IRMAETfflARI 
LLLFIIiKQLV 
FSHDLKRKMI 



51 

I 

CGCGCCGAGC 
ATTCCAGGAT 

CACAGAGAAA 
GTAGATGGGC 
TCAAGGAAGC 
CGCCACACCT 
CCAAGATGAG 
CCATCA7UUVT 
GCAGTGTGGC 
GCAGCTACAT 
TTATCGTGCC 
GGGCGGTGAA 
GGAAGGAACT 
CCGACATCCT 
CAGATGATGT 
ATTTACCATA 
AGTCGAAAGC 
AGTTAGATCC 
GCCAGATCCA 
GCATCACCTG 
GGCGCTGGGG 
TCCGTTACTA 
AGTTCGACTT 
ACAAGTACCC 
ACTTTGTGGC 
CAAACCCATA 
GCCATATGCC 
AGCGTGCATT 
TCAAGAGGAA 
AGACTCTTGG 
TCACGAATAT 
ATGAAGTCTT 
GTAGAGTTTG 
GTTTTGACCT 
TAGTTTCATA 
TTGATATGCA 
GGACAGCTGT 
TATTACCGGG 
TTGTAGACAG 
GAAAGAAACT 
AGTTATGGAG 
AGGACACAGC 
TTT CTTTGTT 
CAGGACCTCA 
CTTGGCTGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
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ACAATCAGAA ATCACGCAGG CATTTTGGGT AGGCGGCCTC CAGTTTTCCT TTGAGTCGCG 2760 

AACGCTGTGC GTTTGTCAGA ATGAAGTATA CAAGTCAATG TTTTTCCCCC TTTTTATATA 2820 

ATAATTATAT AACTTATGCA TTTATACACT ACGAGTTGAT CTCGGCCAGC CTU^GACACA 2880 

CGACAAAAGA GACAATCGAT ATAATGTGGC CTTGAATTTT AACTCTGTAT GCTTAATGTT 2 940 

TACAATATGA AGTTATTAGT TCTTAGAATG CAGAATGTAT GTAATAAAAT AAGCTTGGCC 3000 

TAGCATGGCA AATCAGATTT ATACAGGAGT CTGCATTTGC ACTTTTTTTA GTGACTAAAG 3060 

TTGCTTAATG AAAACATGTG CTGAATGTTG TGGATTTTGT GTTATAATTT ACTTTGTCCA 3120 
GGAACTTGTG CAAGGGAGAG CCAAGGAAAT AGGATGTTTG GCACCC 

Seq ID NO: 331 Protein sequence 
Protein Accession #s 7U\A52398 

1 11 21 31 41 51 

I I 1 I I I 

MIQTVPDPAA HIKEALSWS EDQSLFECAY GTPHLAKTEM TASSSSDYGQ TSKMSPRVPQ 60 

QDWLSQPPAR VTIKMECNPS QVNGSRNSPD ECSVAKGGKM VGSPDTVGMN YGSYMEEKHM 120 

PPPNMTTNER RVIVPADPTL WSTDHVRQWL EWAVKEYGLP DVNILLFQNI DGKELCKMTK 180 

DDFQRLTPSY NADILLSHLH YLRETPLPHL TSDDVDKALQ NSPRLMHARN TDLPYEPPRR 240 

SAWTGHGHPT PQSKAAQPSP STVPKTEDQR PQIiDPYQILG PTSSRLANPG SGQIQLWQFIj 300 

LELLSDSSNS SCITWEGTNG EFKMTDPDEV ARRWGERKSK PNMNYDKIiSR ALRYYYDKNI 360 

MTKVHGKRYA YKFDFHGIAQ ALQPHPPESS LYKYPSDLPY MGSYHAHPQK MNFVAPHPPA 420 

IiPVTSSSFFA APNPYWNSPT GGIYPNTRLP TSHMPSHLGT YY 462 

Seq ID NO: 332 DNA sequence 
Nucleic Acid Accession #: NM_000020 
Coding sequence: 283-1794 

1 11 21 31 41 " 51 

I I I I 1 t 

AGGAAACGGT TTATTAGGAG GGAGTGGTGG AGCTGGGCCA GGCAGGAAGA CGCTGGAATA 60 

AGAAACATTT TTGCTCCAGC CCCCATCCCA GTCCCGGGAG GCTGCCGCGC CAGCTGCGCC 120 

GAGCGAGCCC CTCCCCGGCT CCAGCCCGGT CCGGGGCCGC GCCGGACCCC AGCCCGCCGT 180 

CCAGCGCTGG CGGTGCAACT GCGGCCGCGC GGTGGAGGGG AGGTGGCCCC GGTCCGCCGA 240 

AGGCTAGCGC CCCGCCACCC GCAGAGCGGG CCCAGAGGGA CCATGACCTT GGGCTCCCCC 300 

AGGAAAGGCC TTCTGATGCT GCTGATGGCC TTGGTGACCC AGGGAGACCC TGTGAAGCCG 360 

TCTCGGGGCC CGCTGGTGAC CTGCACGTGT GAGAGCCCAC ATTGCAAGGG GCCTACCTGC 420 

CGGGGGGCCT GGTGCACAGT AGTGCTGGTG CGGGAGGAGG GGAGGCACCC CCAGGAACAT 480 

CGGGGCTGCX5 GGAACTTGCA CAGGGAGCTC TGCAGGGGGC GCCCCACCGA GTTCGTCAAC 540 

CACTACTGCT GCGACAGCCA CCTCTGCAAC CACAACGTGT CCCTGGTGCT GGAGGCCACC 600 

CAACCTCCTT CGGAGCAGCC GGGAACAGAT GGCCAGCTGG CCCTGATCCT GGGCCCCGTG 660 

CTGGCCTTGC TGGCCCTGGT GGCCCTGGGT GTCCTGGGCC TGTGGCATGT CCGACGGAGG 72 0 

CAGGAGAAGC AGCGTGGCCT GCACAGCGAG CTGGGAGAGT CCAGTCTCAT CCTGAAAGCA 7 80 

TCTGAGCAGG GCGACACGAT GTTGGGGGAC CTCCTGGACA GT6ACTGCAC CACAGGGAGT 840 

GGCTCAGGGC TCCCCTTCCT GGTGCAGAGG ACAGTGGCAC GGCAGGTTGC CTTGGTGGAG 900 

TGTGTGGGT^ AAGGCCGCTA TGGCGAAGTG TGGCGGGGCT TGTGGCACGG TGAGAGTGTG 960 

GCCGTCAAGA TCTTCTCCTC GAGGGATGAA CAGTCCTGGT TCCGGGAGAC TGAGATCTAT 102 0 

AACACAGTAT TGCTCAGACA CGACAACATC CTAGGCTTCA TCGCCTCAGA CATGACCTCC 1080 

CGCAACTCGA GCACGCAGCT GTGGCTCATC ACGCACTACC ACGAGCACGG CTCCCTCTAC 1140 

GACTTTCTGC AGAGACAGAC GCTGGAGCCC CATCTGGCTC TGAGGCTAGC TGTGTCCGCG 1200 

GCATGCGGCC TGGCGCACCT 6CACGTGGAG ATCTTCGGTA CACAGGGCAA ACCAGCCATT 1260 

GCCCACCGCG ACTTCAAGAG CCGC7\ATGTG CTGGTCAAGA GCAACCTGCA GTGTTGCATC 1320 

GCCGACCTGG GCCTGGCTGT GATGCACTCA CAGGGCAGCG ATTACCTGGA CATCGGCAAC 1380 

AACCCGAGAG TGGGCACCAA GCGGTACATG GCACCCGAGG TGCTGGACGA GCAGATCCGC 1440 

ACGGACTGCT TTGAGTCCTA O^AGTGGACT GACATCTGGG CCTTTGGCCT GGTGCTGTGG 1500 

GAGATTGCCC GCCGGACCAT CGTGAATGGC ATCGTGGAGG ACTATAGACC ACCCTTCTAT 1560 

GATGTGGTGC CCAATGACCC CAGCTTTGAG GACATGAAGA AGGTGGTGTG TGTGGATCAG 1620 

CAGACCCCCA CCATCCCTAA CCGGCTGGCT GCAGAGCGGG TCCTCTCAGG CCTAGCTCAG 1680 

ATGATGCGGG AGTQCTGGTA CCCAAACCCC TCTGCCCGAC TCACCGCGCT GCGGATCAAG 1740 

AAGACACTAC AAAAAATTAG CAACAGTCCA GAGAAGCCTA AAGTGATTCA ATAGCCCAGG 1800 

AGCACCTGAT TCCTTTCTGC CTGCAGGGGG CTGGGGGGGT GGGGGGCAGT GGATGGTGCC 1860 

CTATCTGGGT AGAGGTAGTG TGAGTGTGGT GTGTGCTGGG GATGGGCAGC TGCGCCTGCC 1920 

TGCTCGGCCC CCAGCCCACC CAGCCAAAAA TACAGCTGGG CTGAAACCTG ATCCCCTGCT 1980 

GTCTGGCCTG CTCAAAGCGG CAGGCTCCCT GACGCCTGGC TCTCTCCCCA CCCCTATGGC 2040 

CAGCATGGTG CACCCCCTAC CACTCCCGGG ACAGGATGCA AT^GAGGCTC CAGAGTCAGA 2100 

GTGCCAAGCC AGGGAATCCC AGTCCCAGAC TCAGAGCCCG GGCCTGCACT TTGCCCCCTG 2160 

CCCTTGATCA ACCCCACTGC CCCACCAGAG CTGCCAGGGT GGCACAGGGC CCTGTCCAGC 222 0 

CCCTGGCACA CACTTCCCTG CCAGGCCTCA GCCTCTAGCA TAAGCTCCAG AGAGCCAGGG 2280 

CCCATCAGTT TCTCTCTGTG GATTTGTATC TCAGCTCCAT GATGCCTTGG GCTTTCTGTC 2340 

TCCTCAACAA GAGTGCAGCT TGCTGAATGT CAGCTGCCTG AGAGAGCTGG GGCCTGACTT 2400 

ACTAGGGCAT TAAATCCTAA GAGGTCCTAC TGAGGTGTGG CAGGATCACA GGCCAGTGGA 2460 

AAAAGGGCAG GTCAGATGGG CAAGGCCCAG GACTTTCAGA TTAACTGAGA GGATATCGAG 252 0 

GCCAAGCATG GCAGGGGGAA GGTCAGTGGG TGTCAAGAGA CCCAGGTCTG ACCCCGGATG 2580 

TTTGCTCCAT GTGACAAAAG CAGGCCTGTC TCAGGACCTT TTCTTTTCTT TTTTCCTTCT 2640 

TTTTTTTTTT GACACGGAGT TTCGCTCTTG TTGTCCAGGC TAGAGTGCAA TGGCATGATC 2700 

CCAGCTCACC GCAACGTCTA CCTCCCAGGT TCAAATCATT CTCTTGCCTC AGACTCCCGA 2760 

GTAGCTGGGA TTACAGGCAC ATGCCACCAT GCCTGGCTAA TTTTGTATAT TTAGTAGAAA 2820 

CAGGGTTTCA CCATGCTGGC CATGCTGGTT CTCGAACTCC TGACCTCAGG TGTTCCACCT 2880 

ACCTCAGCCT CCCAAAGTGC TGGGGTTACA GGTGTGAGCC ATCGCGCCTG GCCAGGACCT 2 940 

TTGTTTCTTA TCTACATATT GGAAGATTTG GTCCTGATGT CCTTTGAGGC TTCTTTAGCT 3000 

CTAGTTCTCT GACACTTCAG CCTATATCAC AGCTAACTTC YTCAGTCTCA TCTATTCCTT 3060 

ATGCTCCAGC CCCTGGCAAT TTGCCTCAAG ATGGGGGTTT GAAAATAACT TTACCTGACT 3120 

CAAGGAGTGT CTGGAGCACC TCCTAGTCTA AGTCTGCAAG CTCCAGTTCT TGCCTAAAAC 3180 

CATGCCAGTG GCCACCCTTG GGCTCAGACA GCTCTGGGCC TTTTGACCAC AAGCCAGCCC 3240 

CTCGCCCTCT CTGTGGCATA GTCTTCTCTG CCCCAGGACT GCAGGGCGGC TTCCTCCAAG 3300 

GCTTCCAAGG CTCAAAAGAA ATTTGGCTCC ATCCAAGAAG GCTCCAGCTC CCCTACTGGC 3360 

CCCTGGCTTC AGGCCCACAC CCCTGGGCCA GGSCCAGAGA GTGT6TCTCA GGA6AATTCA 3420 

ATGGGCTCTA GAGAGACACA CAGAAAGTTT GGGCATTTGG GAAATTTTCA AGGRTGTATG 3480 
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TATGGYTCAC GTATGGWGCA G6TTGTCCTG GTCCYKGGGT GCT^GGGAAGT GGGCTGCAGG 3540 

GAAGTGGATT GGAGGGGAGC TTGAGGAATA TAAGGA6CX3G GGGTGGAGAC TCAGGCTATG 3600 

GACAAGGACA GCCCCAAGGT TGGGAAGACC TGGCCTTAGT CGTCCTCAGC CTAGGGCAGG 3660 

GCAGTGAAGA AAGCTCTCCC CGCTCCTGCT GTAATGACCC AGAGTAGCCT CCCCAGGCCG 3720 

GCATCTTATG TGTGTCTTCC ACCATCCTCA TGGTGGCACT TTTCTAGGCC TGTCTCCCAG 3780 

CATTGTGCAA GGCTCGGAAG AGAACCAGGA AGTGAAACTG GGTGAAAACA GAAAGCTCAA 3 840 

TGGATGGGCT AGGTTCCCAG ATCATTAGGG CAGAGTTTGC ACGTCCTCTG GTTCACTGGG 3900 

AATCCACCCA GCCCACGAAT CATCTCCCTC TTTGAAGGAT TTTWATTTCT ACTGGGTTTT 3960 

GGAACAAACT CCTGCTGAGA CCCCACA6CC AGAAACTGAA AGCAGCAGCT CCCCAAAGCC 4020 

TGGAAAATCC CTAAGAGAAG GCCTGGGGGA MAGGAAKTGG AGT6ACAGGG GACAGGTAGA 4080 

GAGAAGGGGG CCCAATGGCC AGGGAGTGAA GGAGGTGGCG TTGCTGAGAG CAGTCTGCAC 4140 

ATGCTTCTGT CTGAGTGCAG GAAGGTGTTC CAGGGTCGAA ATTACACTTC TCGTACCTGG 4200 

AGACGCTGTT TGTGGGAGCA CTGGGCTCAT GCCTGGCACA CAATAGGTCT GCAATAAACC 4260 
ATGGTTAAAT CCTGAAAAAA AAAAAAAAA 

Seq ID NO: 333 Protein sequence 
Protein Accession #: NP_000011 



1 11 21 31 41 51 

I 1 I I ' ' 

MTLGSPRKGL LMLLMALVTQ GDPVKPSRGP LVTCTCBSPH CKGPTCRGAW CTWLVREEG 60 

RHPQEHRGCG NL.HRELCRGR PTEFVNHYCC DSHLCNHNVS LVLEATQPPS EQPGTDGQLA 12 0 

LILGPVLALL ALVALGVLGL WHVRRRQEKQ RGLHSELGES SLILKASEQG DTMLGDLLDS 180 

DCTTGSGSGL PFLVQRTVAR QVALVECVGK GRYGEVWRGL WHGESVAVKI FSSRDEQSWF 240 

RETEIYNTVL LRHDNILGFI ASDMTSRNSS TQIjWLITHYH EHGSLYDFLQ RQTLEPHLAL 300 

RLAVSAACGL AHLHVEIFGT QGKPAIAHRD FKSRNVIiVKS NLQCCIADLG IiAVMHSQGSD 360 

YLDIGNNPRV GTKRYMAPEV LDEQIRTDCF BSYKWTDIWA FGLVLWEIAR RTIVNGIVED 420 

YRPPFYDWP NDPSFEDMKK WCVDQQTPT IPNRLAADPV LSGIiAQMMRE CWYPNPSARL 480 
TALRIKKTIiQ KISNSPEKPK VIQ 

Seq ID NO: 334 DNA sequence 

Nucleic Acid Accession #: NM_004126.1 

Coding sequence: 108-329 

1 11 21-31 41 51 

I I I I I I 

GGCACGAGCT CGTGCCGGCC TTCAGTTGTT TCGGGACGCG CCGAGCTTCG CCGCTCTTCC 60 
AGCGGCTCCG CTGCCAGAGC TAGCCCGAGC CCX3GTTCTGG GGCGAAAATG CCTGCCCTTC 120 
ACATCGAAGA TTTGCCAGAG AAGGAAAAAC TGAAAATGGA AGTTGAGCAG CTTCGCAAAG 180 
AAGTGAAGTT GCAGAGACAA GAAGTGTCTA AATGTTCTGA AGAAATAAAG AACTATATTG 240 
AAGAACGTTC TGGAGAGGAT CCTCTAGTAA AGGGAATTCC AGAAGACAAG AACCCCTTTA 300 
AAGAAAAAGG CAGCTGTGTT ATTTCATAAA TAACTTGGGA GAAACTGCAT CCTAAGTGGA 360 
AGAACTAGTT TGTTTTAGTT TTCCCAGATA AAACCAACAT GCTTTTTAAG GAAGGAAGAA 420 
TGAAATTAAA AGGAGACTTT CTTAAGCACC ATATAGATAG GGTTATGTAT AAAAGCATAT 480 
GTGCTACTCA TCTTTGCTCA CTATGCAGTC TTTTTTAAGA GAGCAGAGAG TATCAGATGT 540 
ACAATTATGG AAATAAGAAC ATTACTTGAG CATGACACTT CTTTCAGTAT ATTGCTTGAT 600 
GCTTCAAATA i\AGTTTTGTC TT 

Seq ID NO: 335 Protein sequence 
Protein Accession NP_004117.1 

1 11 21 31 41 51 

I 1 1 I ' ' 

MPALHIEDLP EKEKLKMEVE QLRKBVKLQR QQVSKCSEEI KNYIEERSGE DPLVKGIPED 60 
KNPFKEKGSC VIS 



Seq ID NO: 336 DNA sequence 
Nucleic Acid Accession #: NM__005795 
Coding sequence: 555-1940 

1 11 21 31 41 51 

111111 

GCACGAGGGA ACAACCTCTC TCTCTSCAGC AGAGAGTGTC ACCTCCTGCT TTAGGACCAT 60 

CAAGCTCTGC TAACTGAATC TCATCCTAAT TGCAGGATCA CATTGCAAAG CTTTCACTCT 120 

TTCCCACCTT GCTTGTGGGT AAATCTCTTC TGCGGAATCT CAGAAAGTAA AGTTCCATCC 180 

TGAGAATATT TCACAAAGAA TTTCCTTAAG AGCTGGACTG GGTCTTGACC CCTGGAATTT 240 

AAGAAATTCT TAAAGACAAT GTCAAATATG ATCCAAGAGA AAATGTGATT TGAGTCTGGA 300 

GACAATTGTG CATATCGTCT AATAATAAAA ACCCATACTA GCCTATAGAA AACAATATTT 360 

GAATAATAAA AACCCATACT AGCCTATAGA AAACAATATT TGAAAGATTG CTACCACTAA 42 0 

AAAGAAAACT ACTACAACTT GACAAGACTG CTGCAAACTT CAATTGGTCA CCACAACTTG 480 

ACAAGGTTGC TATAAAACAA GATTGCTACA ACTTCTAGTT TATGTTATAC AGCATATTTC 540 

ATTTGGGCTT AATGATGGAG AAAAAGTGTA CCCTGTATTT TCTGGTTCTC TTGCCTTTTT 600 

TTATGATTCT TGTTACAGCA G7\ATTAGAAG AGAGTCCTGA GGACTCAATT CAGTTGGGAG 660 

TTACTAGAAA TAAAATCATG ACAGCTCAAT ATGAATGTTA CCAAAAGATT ATGCT^AGACC 720 

CCATTCAACA AGCAGAAGGC GTTTACTGCA ACAGAACCTG GGATGGATGG CTCTGCTGGA 780 

ACGATGTTGC AGCAGGAACT GAATCAATGC AGCTCTGCCC TGATTACTTT CAGGACTTTG 840 

ATCCATCAGA AAAAGTTACA AA6ATCTGTG ACCAAGATGG AAACTGGTTT AGACATCCAG 900 

CAAGCAACAG AACATGGACA AATTATACCC AGTGTAATGT TAACACCCAC GAGAAAGTGA 960 

AGACTGCACT AAATTTGTTT TACCTGACCA TAATTGGACA CGGATTGTCT ATTGCATCAC 102 0 

TGCTTATCTC GCTTGGCATA TTCTTTTATT TCAAGAGCCT AAGTTGCCAA AGGATTACCT 1080 

TACACAAAAA TCTGTTCTTC TCATTTGTTT GTAACTCTGT TGTAACAATC ATTCACCTCA 1140 

CTGCAGTGGC CAACAACCAG GCCTTAGTAG CCACAAATCC TGTTAGTTGC AAAGTGTCCC 1200 

AGTTCATTCA TCTTTACCTG ATGGGCTGTA ATTACTTTTG GATGCTCTGT GAAGGCATTT 1260 

ACCTACACAC ACTCATTGTG GTGGCCGTGT TTGCAGAGAA GCAACATTTA ATGTGGTATT 1320 

ATTTTCTTGG CTGGGGATTT CCACTGATTC CTGCTTGTAT ACATGCCATT 6CTAGAAGCT 1380 
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TATATTACAA TGACAATTGC TGGATCAGTT CTGATACCCA TCTCCTCTAC ATTATCCATG 1440 

GCCCAATTTG TGCTGCTTTA CTGGTGAATC TTTTTTTCTT GTTAAATATT GTACGC6TTC ISOO 

TCATCACCAA GTTAAAAGTT ACACACCAAG CGGAATCCAA TCTGTACATG AAAGCTGTGA 1560 

GAGCTACTCT TATCTTGGTG CCATTGCTTG GCATTGAATT TGTGCTGATT CCATGGCGAC 1620 

CTGAAGGAAA GATTGCAGAG GAGGTATATG ACTACATCAT GCACATCCTT ATGCACTTCC 1680 

AGGGTCTTTT GGTCTCTACC ATTTTCTGCT TCTTTAATGG AGAGGTTCAA GCAATTCTGA 1740 

GAAGAAACTG GAATCAATAC AAAATCCAAT TTGGAAACAG CTTTTCCAAC TCAGAAGCTC 1800 

TTCGTAGTGC GTCTTACACA GTGTCAACAA TCAGTGATGG TCCAGGTTAT AGTCATGACT 1860 

GTCCTAGTGA ACACTTAAAT GGAATiAAGCA TCCATGATAT TGAAAATGTT CTCTTAAAAC 192 0 

CAGAAAATTT ATATAATTGA AAATAGAAGG ATGGTTGTCT CACTGTTTGG TGCTTCTCCT 1980 

AACTCAAGGA CTTGGACCCA TGACTCTGTA GCCAGAAGAC TTCAATATTA AATGACTTTG 2040 

GGGAATGTCA TAAAGAAGAG CCTTCACATG AAATTAGTAG TGTGTTGATA AGAGTGTAAC 2100 

ATCCAGCTCT ATGTGGGAAA AAAGATJJVTCC TGGTTTGTAA TGTTTGTCAG TAAATACTCC 2160 

CACTATGCCT GATGTGACGC TACTAACCTG ACATCACCAA GT6TGGAATT GGAGAAAAGC 2220 

ACAATCAACT TTTCTGAGCT GGTGTAAGCC AGTTCCAGCA CACCATTGAT GAATTCAAAC 2280 

AAATGGCTGT AAAACTAAAC ATACATGTTG GGCATGATTC TACCCTTATT CSCCCCAAGA 2340 

GACCTAGCTA AGGTCTATAA ACATGAAGGG AAAATTAGCT TTTAGTTTTA AAACTCTTTA 2400 

TCCCATCTTG ATTGGGGCAG TTGACTTTTT TTTTTTCCCA GAGTGCCGTA GTCCTTTTTG 2460 

TAACTACCCT CTCAAATGGA CAATACCAGA AGTGAATTAT CCCTGCTGGC TTTCTTTTCT 2520 

CTAXGAAAAG CAACTGAGTA CAATTGTTAT GATCTACTCA TTTGCTGACA CATCAGTTAT 2580 

ATCTTGTGGC ATATCCATTG TGGAAACTGG ATGAACAGGA TGTATAATAT GCAATCTTAC 2640 

TTCTATATCA TTAGGAAAAC ATCTTAGTTG ATGCTACAAA ACACCTTGTC AACCTCTTCC 2700 

TGTCTTACCA AACAGTGGGA GGGAATTCCT AGCTGTAAAT ATAAATTTTG CCCTTCCATT 2760 

TCTACTGTAT AAACAAATTA GCAATCATTT TATATAAAGA AAATCAATGA AGGATTTCTT 2820 

ATTTTCTTGG AATTTTGTAA AAAGAAATTG TGAAAAATGA GCTTGTAAAT ACTCCATTAT 2880 

TTTATTTTAT AGTCTCAAAT CAAATACATA CAACCTATGT AATTTTTAAA GCAAATATAT 2940 

AATGC2\ACAA TGTGTGTATG TTAATATCTG ATACTGTATC TGGGCTGATT TTTTAAATAA 3000 
AATAGAGTCT GGAATGCT 

Seq ID NO: 33 7 protein sequence 
Protein Accession #: NP_005786»1 

1 11 21 31 41 51 

I I I I I I 

MEKKCTLYFL VLLPFFMILV TAEIiEESPED SIQLGVTRNK IMTAQYECYQ KIMQDPIQQA 60 

EGVYCNRTWD GWLCWNDVAA GTESMQLCPD YFQDFDPSEK VTKICXJQDGN WFRHPASNRT 120 

WTNYTQCNVN THEKVKTALN LFYLTIIGHG LSIASLLISL GIFFYFKSLS CQRITLHKNL 180 

FFSFVCNSW TIIHLTAVAN NQALVATNPV SCKVSQFIHL YLMGCNYFWM LCEGIYLHTL 240 

IWAVPAEKQ HLMWYYFLGW GFPLIPACIH AIARSLYYND NCWISSDTHL LYIIHGPICA 300 

ALIiVNIiFFLL NIVRVLITKL KVTHQAESNL YMKAVRATLI LVPLLGIEFV LIPWRPEGKI 360 

AEEVYDYIMH ILMHFQGLLV STIFCFFNGE VQAILRRIWN QYKIQFGNSF SNSEALRSAS 420 
YTVSTISDGP GYSHDCPSEH LNGKSIHDIE NVLLKPBNLY N 

Seq ID NOt 338 DNA sec[uence 
Nucleic Acid Accession #: NM_001795 
Coding sequence: 25-2379 

1 11 21 31 41 51 

I i I I I I 

GCACGATCTG TTCCTCCTGG GAAGATGCAG AGGCTCATGA TGCTCCTCGC CACATCGGGC 60 

GCCTGCCTGG GCCTGCTGGC AGTGGCAGCA GTGGCAGCAG CAGGTGCTAA CCCTGCCCAA 120 

CGGGACACCC ACAGCCTGCT GCCCACCCAC CGGCGCCAAA AGAGAGATTG GATTTGGAAC 180 

CAGATGCACA TTGATGAAGA GAAAAACACC TCACTTCCCC ATCATGTAGG CAAGATCAAG 240 

TCAAGCGTGA GTCGCAAGAA TGCCAAGTAC CTGCTCAAAG GAGAATATGT GGGCAAGGTC 3 00 

TTCCGGGTCG ATGCAGAGAC AGGAGACGTG TTCGCCATTG AGAGGCTGGA CCGGGAGAAT 360 

ATCTCAGAGT ACCACCTCAC TGCTGTCATT GTGGACAAGG ACACTGGTGA AAACCTGGAG 420 

ACTCCTTCCA GCTTCACCAT CAAAGTTCAT GACGTGAACG ACAACTGGCC TGTGTTCACG 480 

CATCGGTTGT TCAATGCGTC CGTGCCTGAG TCGTCGGCTG TGGGGACCTC AGTCATCTCT 540 

GTGACAGCAG TGGATGCAGA CGACCCCACT GTGGGAGACC ACGCCTCTGT CATGTACCAA 600 

ATCCTGAAGG GGAAAGAGTA TTTTGCCATC GATAATTCTG GACGTATTAT CACAATAACG 660 

AAAAGCTTGG ACCGAGAGAA GCAGGCCAGG TATGAGATCG TGGTGGAAGC GCGAGATGCC 720 

CAGGGCCTCC GGGGGGACTC GGGCACGGCC ACCGTGCTGG TCACTCTGCA AGACATCAAT 780 

GACAACTTCC CCTTCTTCAC CCAGACCAAG TACACATTTG TCGTGCCTGA AGACACCCGT 840 

GTGGGCACCT CTGTGGGCTC TCTGTTTGTT GAGGACCCAG ATGAGCCCCA GAACCGGATG 900 

ACCAAGTACA GCATCTTGCG GGGCGACTAC CAGGACGCTT TCACCATTGA GACAAACCCC 960 

GCCCACAACG AGGGCATCAT CAAGCCCATG AAGCCTCTGG ATTATGAATA CATCCAGCAA 1020 

TACAGCTTCA TCGTCGAGGC CACAGACCCC ACCATCGACC TCCGATACAT GAGCCCTCCC 1080 

GCGGGTWVACA GAGCCCAGGT CATTATCAAC ATCACAGATG TGGACGAGCC CCCCATTTTC 1140 

CAGCAGCCTT TCTACCACTT CCAGCTGAAG GAAAACCAGA AGAAGCCTCT GATTGGCACA 12 00 

GTGCTGGCCA TGGACCCTGA TGCGGCTAGG CATAGCATTG GATACTCCAT CCGCAGGACC 12 60 

AGTGACAAGG GCCAGTTCTT GCGAGTCACA AAAAAGGGGG ACATTTACAA TGAGAAAGAA 1320 

CTGGACAGAG AAGTCTACCC CTGGTATAAC CTGACTGTGG AGGCCAAAGA ACTGGATTCC 1380 

ACTGGAACCC CCACAGGAAA AGAATCCATT GTGCAAGTCC ACATTGAAGT TTTGGATGAG 1440 

AATGACAATG CCCCGGAGTT TGCCAAGCCC TACCAGCCCA AAGTGTGTGA GAACGCTGTC 1500 

CATGGCCAGC TGGTCCTGCA GATCTCCGCA ATAGACAAGG ACATAACACC ACGAAACGTG 1560 

AAGTTCAAAT TCACCTTGAA TACTGAGAAC AACTTTACCC TCACGGATAA TCACGATAAC 162 0 

ACGGCCAACA TCACAGTCAA GTATGGGCAG TTTGACCGGG AGCATACCAA GGTCCACTTC 1680 

CTACCCGTGG TCATCTCAGA CAATGGGATG CCAAGTCGCA CGGGCACCAG CACGCTGACC 1740 

GTGGCCGTGT GCAAGTGCAA CGAGCAGGGC GAGTTCACCT TCTGCX3AGGA TATGGCCGCC 1800 

CAGGTGGGCG TGAGCATCCA GGCAGTGGTA GCCATCTTAC TCTGCATCCT CACCATCACA 1860 

GTGATCACCC TGCTCATCTT CCTGCGGCGG CGGCTCCGGA AGCAGGCCCG CGCGCACGGC 192 0 

AAGAGCGTGC CGGAGATCCA CGAGCAGCTG GTCACCTACG ACGAGGAGGG CGGCGGCGAG 1980 

ATGGACACCA CCAGCTACGA TGTGTCGGTG CTCAACTCGG TGCGCCGCGG CGGGGCCAAG 2040 

CCCCCGCGGC CCGCGCTGGA CGCCCGGCCT TCCCTCTATG CGCAGGTGCA GAAGCCACCG 2100 

AGGCACGCGC CTGGGGCACA CGGAGGGCCC GGGGAGATGG CAGCCATGAT CGAGGTGAAG 2160' 

AAGGACGAGG CGGACCACGA CGGCXSACGGC CCCCCCTACX3 ACACGCT6CA CATCTACX3GC 2220 

TACGAGGGCT CCGAGTCCAT AGCCX3AGTCC CTCAGCTCCC TGGGCACCGA CTCATCCGAC 2280 
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TCTGACGTGG ATTACGACTT CCTTAACGAC TGGGGACCCA GGTTTAAGAT GCTGGCTGAG 2340 

CTGTACGGCT CGGACCCCCG GGAGGAGCTG CTGTATTAGG CGGCCGAGGT CACTCTGGGC 2400 

CTGGGGACCC AAACCCCCTG CAGCCCAGGC CAGTCAGACT CCAGGCACCA CAGCCTCCTW^ 2460 

AAATGGCAGT GACTCCCCAG CCCAGCACCC CTTCCTCGTG GGTCCCAGAG ACCTCATCAG 2 520 

CCTTGGGATA GCAAACTCCA GGTTCCTGAA ATATCCAGGA ATATATGTCA GTGATGACTA 2580 

TTCTCAAATG CTGGCAAATC CAGGCTGGTG TTCTGTCTGG GCTCAGACAT CCACATAACC 2640 

CTGTCACCCA CAGACCGCCG TCTAACTCAA AGACTTCCTC TGGCTCCCCA AGGCTGCAAA 2700 

GCAAT^CAGA CTGTGTTTAA CTGCTGCAGG GTCTTTTTCT AGGGTCCCTG AACGCCCTGG 2760 

TAAGGCTGGT GAGGTCCTGG TGCCTATCTG CCTGGAGGCA AAGGCCTGGA CAGCTTGACT 2 82 0 

TGTGGGGCAG GATTCTCTGC AGCCCATTCC CAAGGGAGAC TGACCATCAT GCCCTCTCTC 2 880 

GGGAGCCCTA GCCCTGCTCC AACTCCATAC TCCACTCCAA GTGCCCCACC ACTCCCCAAC 2 940 

CCCTCTCCAG GCCTGTCAAG AGGGAGGAAG GGGCCCCATG GCAGCTCCTG " ACCTTGGGTC 30 00 

CTGAAGTGAC CTCACTGGCC TGCCATGCCA GTAACTGTGC TGTACTGAGC ACTGAACCAC 3060 

ATTCAGGGAA ATGCTTATTA AACCTTGAAG CAACTGTGAA TTCATTCTGG AGGGGCAGTG 3120 

GAGATCAGGA GTGACAGATC ACAGGGTGAG GGCCACCTCC ACACCCACCC CCTCTGGAGA 3180 

AGGCCTGGAA GAGCTGAGAC CTTGCTTTGA GACTCCTCAG CACCCCTCCA GTTTTGCCTG 3240 

AGAAGGGGCA GATGTTCCCG GAGATCAGAA GACGTCTCCC CTTCTCTGCC TCACCTGGTC 33 0 0 

GCCAATCCAT GCTCTCTTTC TTTTCTCTGT CTACTCCTTA TCCCTTGGTT TAGAGGAACC 3360 

CAAGATGTGG CCTTTAGCAA AACTGACAAT GTCCAAACCC ACTCATGACT GCATGACGGA 3420 

GCCGAGCATG TGTCTTTACA CCTCGCTGTT GTCACATCTC AGGGAACTGA CCCTCAGGCA 3480 

CACCTTGCAG AAGGAAGGCC CTGCCCTGCC CAACCTCTGT GGTCACCCAT GCATCATTCC 3540 

ACTGGAACGT TTCACTGCAA ACACACCTTG GAGAAGTGGC ATCAGTCAAC AGAGAGGGGC 3600 

AGGGAAGGAG ACACCAAGCT CACCCTTCGT CATGGACCGA GGTTCCCACT CTGGCAAAGC 3660 

CCCTCACACT GCAAGGGATT GTAGATAACA CTGACTTGTT TGTTTTAACC AATAACTAGC 3 72 0 

TTCTTATAAT GATTTTTTTA CTAATGATAC TTACAAGTTT CTAGCTCTCA CAGACATATA 3780 

GAATAAGGGT TTTTGCATAA Ti^AGCAGGTT GTTATTTAGG TTAACAATAT TAATTCAGGT 3840 

TTTTTAGTTG GAAA/UVCAAT TCCTGTAACC TTCTATTTTC TATAATTGTA GTAATTGCTC 3900 

TACAGATAAT GTCTATATAT TGGCCAAACT GGTGCATGAC AAGTACTGTA TTTTTTTATA 3 960 
CCTAAATAAA GAAAAATCTT TAGCCTGGGC AACAAT^AAAA 

Seq ID NO: 33 9 Protein sequence 
Protein Accession #: NP_001786 

1 11 21 31 41 51 

I I I I 1 1 

MQRLMMLLAT SGACLGLLAV 7VAVAAAGANP AQRDTHSLLP THRRQKRDWI WNQMHIDEEK 60 

NTSLPHHVGK IKSSVSRKNA KYLLKGEYVG KVFRVDAETG DVFAIERLDR ENISEYHLTA 120 

VIVDKDTGEM LETPSSFTIK VHDVNDNWPV FTHRLFNASV PESSAVGTSV ISVTAVDADD 180 

PTVGDHASVM YQILKGKEYF AIDNSGRIIT ITKSLDREKQ ARYEIWEAR DAQGLRGDSG 240 

TATVLVTLQD INDNFPFFTQ TKYTFWPED TRVGTSVGSL FVEDPDEPQN RMTKYSILRG 300 

DYQDAFTIET NPAHNEGIIK PMKPLDYEYI QQYSFIVEAT DPTIDLRYMS PPAGNRAQVI 3 60 

INITDVDEPP IFQQPFYHFQ LKBNQKKPLI GTVLAMDPDA ARHSIGYSIR RTSDKGQPFR 420 

VTKKGDIYNE KELDREVYPW YNLTVEAKEL DSTGTPTGKE SIVQVHIBVIj DENDNAPEFA 480 

KPYQPKVCEN AVHGQLVLQI SAIDKDITPR NVKFKFTLNT ENNFTLTDNH DNTANITVKY 540 

GQFDREHTKV HFLPWISDN GMPSRTGTST LTVAVCKCNE QGEFTFCEDM AAQVGVSIQA 600 

WAILLCILT ITVITLLIFL RRRLRKQARA HGKSVPEIHE QLVTYDEEGG GEMDTTSYDV 660 

SVLNSVRRGG AKPPRPALDA RPSLYAQVQK PPRHAPGAHG GPGEMAAMIE VKKDEADHDG 720 

DGPPYDTLHI YGYEGSBSIA ESLSSLGTDS SDSDVDYDFIi NDWGPRFKMIi AELYGSDPRE 780 
ELLY 



Seq ID NO: 340 DNA sequence 
Nucleic Acid Accession NM__003088 
Coding sequence: 112-1593 

1 11 21 31 41 51 

I I I I I I 

GCGGAGGGTG C6TGCGGGCC GCGGCAGCCG AACAAAGGAG CAGGGGCGCC GCCGCAGGGA 60 

CCCGCCACCC ACCTCCCGGG GCCGCGCAGC GGCCTCTCGT CTACTGCCAC CATGACCGCC 12 0 

AACGGCACAG CCGAGGCGGT GCAGATCCAG TTCGGCCTCA TCAACTGCGG CAACAAGTAC 180 

CTGACGGCCG AGGCGTTCGG GTTCAAGGTG AACGCGTCCG CCAGCAGCCT GAAGAAGAAG 240 

CAGATCTGGA CGCTGGAGCA GCCCCCTGAC GAGGCGGGCA GCGCGGCCGT GTGCCTGCGC 300 

AGCCACCTGG GCCGCTACCT GGCX3GCXK3AC AAGGACGGCA ACGTGACCTG CGAGCGCGAG 3 60 

GTGCCCGGTC CCGACTGCCG TTTCCTCATC GTGGCGCACG ACGACGGTCG CTGGTCGCTG 420 

CAGTCCGAGG CGCACCGGCG CTACTTCGGC GGCACCGAGG ACCGCCTGTC CTGCTTCGCG 480 

CAGACGGTGT CCCCCGCCGA GAAGTGGAGC GTGCACATCG CCATGCACCC TCAGGTCAAC -540 

ATCTACAGTG TCACCCGTAA GaSCTACGOG CACCTGAGCG CGOSGCCGGC CGACGAGATC 600 

GCCGTGGACC GCGACGTGCC CTGGGGCGTC GACTCGCTCA TCACCCTCGC CTTCCAGGAC 660 

CAGCGCTACA GCGTGCAGAC CGCCGACCAC CGCTTCCTGC GCCACGACGG GCGCCTGGTG 720 

GCGCGCCCCG AGCCGGCCAC TGGCTACACG CTGGAGTTCC GCTCCGGCAA GGTGGCCTTC 7 80 

CGCGACTGCG AGGGCCGTTA CCTGGCGCCG TCGGGGCCCA GCGGCACGCT CAAGGCGGGC 840 

AAGGCCACCA AGGTGGGCAA GGAGGAGCTG TTTGCTCTGG AGCAGAGCTG CGCCCAGGTC 90 0 

GTGCTGCAGG CGGCCAACGA GAGGAACGTG TCCACGCGCC AGGGTATGGA CCTGTCTGCC 960 

AATCAGGACG AGGAGACCGA CCAGGAGACC TTCCAGCTGG AGATCGACCG CGACACCAAA 1020 

AAGTGTGCCT TCCGTACCCA CACGGGCAAG TACTGGACGC TGACGGCCAC CGGGGGCGTG 1080 

CAGTCCACCG CCTCCAGCAA GAATGCCAGC TGCTACTTTG ACATCGAGTG GCGTGACCGG 1140 

CGCATCACAC TGAGGGCGTC CAATGGCAAG TTTGTGACCT CCAAGAAGAA TGGGCAGCTG 12 0 0 

GCCGCCTCGG TGGAGACAGC AGGGGACTCA GAGCTCTTCC TCATGAAGCT CATCAACCGC 1260 

CCCATCATCG TGTTCCGCGG GGAGCATGGC TTCATCGGCT GCCGCAAGGT CACGGGCACC 1320 

CTGGACGCCA ACCGCTCCAG CTATGACGTC TTCCAGCTGG AGTTCAACGA TGGCGCCTAC 1380 

AACATCAAAG ACTCCACAGG CAAATACTGG ACGGTGGGCA GTGACTCCGC GGTCACCAGC 1440 

AGCGGCGACA CTCCTGTGGA CTTCTTCTTC GAGTTCTGCG ACTATAACAA GGTGGCCATC 1500 

AAGGTGGGCG GGCGCTACCT GAAGGGCGAC CACGCAGGCG TCCTGAAGGC CTCGGCGGAA 1560 

ACCGTGGACC CCGCCTCGCT CTGGGAGTAC TAGGGCCGGC CCGTCCTTCC CCGCCCCTGC 1620 

CCACATGGCG GCTCCTGCCA ACCCTCCCTG CTAACCCCTT CTCCGCCAGG TGGGCTCCAG 1680 

GGCGGGAGGC AAGCCCCCTT GCCTTTCAAA CTGGAAACCC CAGAGAAAAC GGTGCCCCCA 1740 

CCTGTCGCCC CTATGGACTC CCCACTCTCC CCTCCGCCCG GGTTCCCTAC TCCCCTCGGG 1800 
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TCAGCGGCTG CGGCCTGGCC 
CGGGGCGAGT CTGGCACCTC 
GAAGCGGCTA AGGGACGGTT 
TTTGCCTCTC CCAGCCACCT 
CTGTCAGTGG CCCTCCCTGG 
CGGGAGGGCT AGGACTGACC 
CTCCCACGTG GGAGAGGCTC 
ACAGGGTCTG CCCGCTGCAC 
GGGCCGTCTT CCTCCTGTCT 
CAAATCAGTA TTTTTTTTAA 
GTAGTAGCGA GTGATCTGGC 
CCCCCTCTTT CCGTCCTTCC 
GCCAGAGCCC CTGCTGTGAT 
CGCCCCCTCC GGGAGCCCTG 
TCCCCTW^CAT GCATCTCACT 
TATAACTCTA AACGCCCATG 
AGTCTGC 

Seq ID NO: 341 Protein sequence 
Protein Accession #: NP_003079 

1 11 21 31 41 51 

I I 1 1 I I 

MTANGTAEAV QIQFGLINCG NKYLTAEAFG FKVNASASSL KKKQIWTLEQ PPDEAGSAAV 60 
CLRSHLGRYL AADKDGNVTC EREVPGPDCR FLIVAHDDGR WSLQSEAHRR YFGGTEDRLS 120 
CFAQTVSPAE KWSVHIAMHP QVNIYSVTRK RYAHLSARPA DEIAVDRDVP WGVDSLITLA 180 
FQDQRYSVQT ADHRFIiRHDG RLVARPEPAT GYTLEFRSGK VAFRDCEGRY LAPSQPSGTL 240 
KAGKATKVGK DELFALEQSC AQWLQAANE RNVSTRQGMD LSANQDEETD QETFQLEIDR 300 
DTKKCAFRTH TGKYWTLTAT GGVQSTASSK NASCYFDIEW RDRRITLRAS NGKFVTSKKN 360 
GQLAASVETA GDSELFLMKL INRPIIVFRG EHGFIGCRKV TGTLDANRSS YDVFQLEFND 420 
GAYNIKDSTG KYWTVGSDSA VTSSGDTPVD FFFEFCDYNK VAIKVGGRYL KGDHAGVLKA 480 
SAETVDPASI. WEY 



Seq ID NO: 342 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 
Coding sequence : 660 1705 

1 11 21 31 41 51 

1 1 1 I I 1 

CGCTCCGCAC ACATTTCCTG TCGCGGCCTA AGGGAAACTG TTGGCCGCTG GGCCCGCGGG 60 

GGGATTCTTG GCAGTTGGGG GGTCCGTCGG GAGCGAGGGC GGAGGGGAAG GGAGGGGGAA 120 

CCGGGTTGGG GAAGCCAGCT GTAGAGGGCG GTGACCGCGC TCCAGACACA GCTCTGCGTC 180 

CTCGAGCGGG ACAGATCCAA GTTGGGAGCA GCTCTGCGTG CGGGGCCTCA GAGAATGAGG 240 

CCX3GCGTTCG CCCTGTGCCT CCTCTGGCAG GCGCTCTGGC CCGGGCCGGG CGGCGGCGAA 300 

CACCCCACTG CCGACCGTGC TGGCTGCTCG GCCTCGGGGG CCTGCTACAG CCTGCACCAC 360 

GCTACCATGA AGCGGCAGGC GGCCGAGGAG GCCTGCATCC TGCGAGGTGG GGCGCTCAGC 420 

ACCGTGCGTG CGGGCGCCGA GCTGCGCGCT GTGCTCGCGC TCCTGCGGGC AGGCCCAGGG 480 

CCCGGAGGGG GCTCCAAAGA CCTGCTGTTC TGGGTCGCAC TGGAGCGCAG GCGTTCCCAC 540 

TGCACCCTGG AGAACGAGCC TTTGCGGGGT TTCTCCTGGC TGTCCTCCGA CCCCGGCGGT 600 

CTCGAT^GCG ACACGCTGCA GTGGGTGGAG GAGCCCCAAC GCTCCTGCAC CGCGCGGAGA 660 

TGCGCGGTAC TCCAGGCCAC CGGTGGGGTC GAGCCCGCAG CTGGAAGGAG ATGCGATGCC 720 

ACCTGCGCGC CAACGGCTAC CTGTGCAAGT ACCAGTTTGA GGTCTTGTGT CCTGCGCCGC 780 

GCCCCGGGGC CGCCTCTAAC TTGAGCTATC GCGCGCCCTT CCAGCTGCAC AGCGCCGCTC 840 

TGGACTTCAG TCCACCTGGG ACCGAGGTGA GTGCGCTCTG CCGGGGACAG CTCCCGATCT 900 

CAGTTACTTG CATCGCGGAC GAAATCGGCG CTCGCTGGGA CAAACTCTCG GGCGATGTGT 960 

TGTGTCCCTG CCCCGGGAGG TACCTCCGTG CTGGCAAATG CGCAGAGCTC CCTAACTGCC 1020 

TAGACGACTT GGGAGGCTTT GCCTGCGAAT GTGCTACGGG CTTCGAGCTG GGGAAGGACG 1080 

GCCGCTCTTG TGTGACCAGT GGGGAAGGAC AGCCGACCCT TGGGGGGACC GGGGTGCCCA 1140 

CCAGGCGCCC GCCGGCCACT GCTVACCAGCC CCGTGCCGCA GAGAACATGG CCAATCAGGG 1200 

TCGACGAGAA GCTGGGAGAG ACACCACTTG TCCCTGAACA AGACAATTCA GTAACATCTA 1260 

TTCCTGAGAT TCCTCGATGG GGATCACAGA GCACGATGTC TACCCTTCAA ATGTCCCTTC 1320 

AAGCCGAGTC AAAGGCCACT ATCACCCCAT CAGGGAGCGT GATTTCCAAG TTTAATTCTA 13 80 

CGACTTCCTC TGCCACTCCT CAGGCTTTCG ACTCCTCCTC TGCCGTGGTC TTCATATTTG 1440 

TGAGCACAGC AGTAGTAGTG TTGGTGATCT TGACCATGAC AGTACTGGGG CTTGTCAAGC 1500 

TCTGCTTTCA CGAAAGCCCC TCTTCCCAGC CAAGGAAGGA GTCTATGGGC CCGCCGGGCC 1560 

TGGAGAGTGA TCCTGAGCCC GCTGCTTTGG GCTCCAGTTC TGCACATTGC ACAAACAATG 1620 

GGGTG7VAAGT CGGGGACTGT GATCTGCGGG ACAGAGCAGA GGGTGCCTTG CTGGCGGAGT 1680 
CCCCTCTTGG CTCTAGTGAT GCATAG 

Seq ID NO: 343 Protein sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

I I I I I I 

MGKDFMTKTP KAFATKAKID KWDLIKLKSF CTAKETIIRV NSQPTDWQKT FAIYPSDKGV 60 

lARIYKELEQ lYKKKKPTKT LRTHFLSRPK GNCWPLGPRG DSWQLGGPSG ARAEGKGGGT 12 0 

GLGKPAVEGG DRAPDTAIiRP RAGQIQVGSS SACGASENEA GVRPVPPLAG ALARAGRRRT 180 

PHCRPCWLLG LGGLLQPAPR YHEAAGGRGG LHPARWGAQH RACGRRAARC ARAPAGRPRA 240 

RRGLQRPAVL GRTGAQAFPL HPGERAFAGF LLAVLRPRRS RKRHAAVGGG APTLLHRAEM 300 

RGTPGHRWGR ARSWKEMRCH LRANGYLCKY QFEVLCPAPR PGAASNLSYR APFQLHSAAL 360 

DFSPPGTEVS ALCRGQLPIS VTCIADEIGA RWDKLSGDVL CPCPGRYLRA GKCAELPNCL 420 

DDLGGFACEC ATGFELGKDG RSCVTSGEGQ PTLGGTGVPT RRPPATATSP VPQRTWPIRV 480 

DEKLGETPLV PEQDNSVTSI PEIPRWGSQS TMSTLQMSIiQ AESKATITPS GSVISKFNST 540 

TSSATPQAFD SSSAWFIFV STAVWLVIL TMTVLGLVKI* CFHESPSSQP RKESMGPPGL 600 
ESDPEPAALG SSSAHCTNNG VKVGDCDIiRD RAEGALLAES PL.GSSDA 



CTGGGAGGGA TTTCAGATGC CCCTGCCCTC TTGTCTGCCA 1860 

TTTCTTCTGA CCTCAGACGG CTCTGAGCCT TATTTCTCTG 1920 

GGGGGCTGGG AGCCCTGGGC GTGTAGTGTA ACTGGAATCT 1980 

CCTCCCAGCC CCCCAGGAGA GCTGGGCACA TGTCCCTU^GC 2040 

TGCACTGTCC CCGAAACCCC TGCTTGGGAA GGGAAGCTGT 2100 

CTTGTGGTGT TTTTTTGGGT GGTGGCTGGA AACAGCCCCT 2160 

AGCCTGGCTC CCTTCCCTGG AGCGGCAGGG CGTGACGGCC 2220 

GTTCTGCCAA GGTGGTGGTG GCGGGCGGGT AGGGGTGTGG 2280 

CTTTCCTTTC ACCCTAGCCT GACTGGAAGC AGAAAATGAC 2 340 

TGAAATATTA TTGCTGGAGG CGTCCCAGGC AAGCCTGGCT 240 0 

GGGGGGCGTC TCAGCACCCT CCCCAGGGGG TGCATCTCAG 2460 

CGTCCAGCCC CAGCCCTGGG CCTGGGCTGC CGACACCTGG 2 52 0 

TGGTGCTCCC TGGGCCTCCC GGGTGGATGA AGCCAGGCGT 2580 

GGGTGAGCCX3 CCGGGGCCCC CCTGCTGCCA GCCTCCCCCG 2640 

CTGGGTGTCT TGGTCTTTTA TTTTTTGTAA GTGTCATTTG 2700 

ATAGTAGCTT CAAACTGGAA ATAGCGAAAT AAAATAACTC 2760 



315 



wo 02/086443 



Seq ID NOi 344 DNA sequence 
Nucleic Acid Accession #: NM_012072 
Coding sequence: 149-2107 

1 . 11 21 31 41 51 

11)111 

AAAGCCCTCA GCCTTTGTGT CCTTCTCTGC GCCGGAGTGG CTGCAGCTCA CCCCTCAGCT 60 

CCCCTTGGGG CCCAGCTGGG AGCCGAGATA GAAGCTCCTG TCGCCGCTGG GCTTCTCGCC 120 

TCCCGCAGAG GGCCACACAG AGACCGGGAT GGCCACCTCC ATGGGCCTGC TGCTGCTGCT 180 

GCTGCTGCTC CTGACCCAGC CCGGGGCGGG GACGGGAGCT GACACGGAGG CGGTGGTCTG 240 

CGTGGGGACC GCCTGCTACA CGGCCCACTC GGGCAAGCTG AGCGCTGCCG AGGCCCAGAA 300 

CCACTGCAAC CAGAACGGGG GCAACCTGGC CACTGTGAAG AGCAAGGAGG AGGCCCAGCA 360 

CGTCCAGCGA GTACTGGCCC AGCTCCTGAG GCGGGAGGCA GCCCTGACGG CGAGGATGAG 420 

CAAGTTCTGG ATTGGGCTCC AGCGAGAGAA GGGCAAGTGC CTGGACCCTA GTCTGCCGCT 480 

GAAGGGCTTC AGCTGGGTGG GCGGGGGGGA GGACACGCCT TACTCTAACT GGCACAAGGA 540 

GCTCCGGAAC TCGTGCATCT CCAAGCGCTG TGTGTCTCTG CTGCTGGACC TGTCCCAGCC 600 

GCTCCTTCCC AACCGCCTGC CCAAGTGGTC TGAGGGCCCC TGTGGGAGCC CAGGCTCCCC 660 

CGGAAGTAAC ATTGAGGGCT TCGTGTGCAA GTTCAGCTTC AAAGGCATGT GCCGGCCTCT 720 

GGCCCTGGGG GGCCCAGGTC AGGTGACCTA CACCACCCCC TTCCAGACCA CCAGTTCCTC 780 

CTTGGAGGCT GTGCCCTTTG CCTCTGCGGC CAATGTAGCC TGTGGGGAAG GTGACAAGGA 840 

CGAGACTCAG AGTCATTATT TCGTGTGCAA GGAGAAGGCC CCCGATGTGT TCGACTGGGG 900 

CAGCTCGGGC CCCCTCTGTG TCAGCCCCAA GTATGGCTGC AACTTCAACA ATGGGGGCTG 960 

CCACCAGGAC TGCTTTGAAG GGGGGGATGG CTCCTTCCTC TGCGGCTGCC GACCAGGATT 1020 

CCGGCTGCTG GATGACCTGG TGACCTGTGC CTCTCGAAAC CCTTGCAGCT CCAGCCCATG 1080 

TCGTGGGGGG GCCACGTGCG TCCTGGGACC CCATGGGAAA AACTACACGT GCCGCTGCCC 1140 

CCAAGGGTAC CAGCTGGACT CGAGTCAGCT GGACTGTGTG GACGTGGATG AATGCCAGGA 1200 

CTCCCCCTGT GCCCAGGAGT GTGTCAACAC CCCTGGGGGC TtCCGCTGCG AATGCTGGGT 1260 

TGGCTATGAG CCGGGCGGTC CTGGAGAGGG GGCCTGTCAG GATGTGGATG AGTGTGCTCT 1320 

GGGTCGCTCG CCTTGCGCCC AGGGCTGCAC CAACACAGAT GGCTCATTTC ACTGCTCCTG 1380 

TGAGGAGGGC TACGTCCTGG CCGGGGAGGA CGGGACTCAG TGCCAGGACG TGGATGAGTG 1440 

TGTGGGCCCG GGGGGCCCCC TCTGCGACAG CTTGTGCTTC AACACACAAG GGTCCTTCCA 1500 

CTGTGGCTGC CTGCCAGGCT GGGTGCTGGC CCCAAATGGG GTCTCTTGCA CCATGGGGCC 1560 

TGTGTCTCTG GGACCACCAT CTGGGCCCCC CGATGAGGAG GACAAAGGAG AGAAAGAAGG 162 0 

GAGCACCGTG CCCCGCGCTG CAACAGCCAG TCCCACAAGG GGCCCCGAGG GCACCCCCAA 1680 

GGCTACACCC ACCACAAGTA GACCTTCGCT GTCATCTGAC GCCCCCATCA CATCTGCCCC 1740 

ACTCAAGATG CTGGCCCCCA GTGGGTCCTC AGGCGTCTGG AGGGAGCCCA GCATCCATCA 1800 

CGCCACAGCT GCCTCTGGCC CCCAGGAGCC TGCAGGTGGG GACTCCTCCG TGGCCACACA 1860 

AAACAACGAT GGCACTGACG GGCAAAAGCT GCTTTTATTC TACATCCTAG GCACCGTGGT 192 0 

66CCATCCTA CTCCTGCTGG CCCTGGCTCT GGGGCTACTG GTCTATCGCA AGCGGAGAGC 1980 

GAAGAGGGAG GAGAAGAAGG AGAAGAAGCC CCAGAATGCG GCAGACAGTT ACTCCTGGGT 2040 

TCCAGAGCGA GCTGAGAGCA GGGCCATGGA GAACCAGTAC AGTCCGACAC CTGGGACAGA 2100 

CTGCTGAAAG TGAGGTGGCC CTAGAGACAC TAGAGTCACC AGCCACCATC CTCAGAGCTT 2160 

TGAACTCCCC ATTCCAAAGG GGCACCCACA TTTTTTTGAA AGACTGGACT GGAATCTTAG 2220 

CAAACAATTG TAAGTCTCCT CCTTAAAGGC CCCTTGGAAC ATGCAGGTAT TTTCTACGGG 2280 

TGTTTGATGT TCCTGAAGTG GAAGCTGTGT GTTGGCGTGC CACGGTGGGG ATTTCGTGAC 2340 

TCTATAATGA TTGTTACTCC CCCTCCCTTT TCAAATTCCA ATGTGACCAA TTCCGGATCA 2400 

GGGTGTGAGG AGGCTGGGGC TAAGGGGCTC CCCTGAATAT CTTCTCTGCT CACTTCCACC 2460 

ATCTAAGAGG AAAAGGTGAG TTGCTCATGC TGATTAGGAT TGAAATGATT TGTTTCTCTT 2520 

CCTAGGATGA AAACTAAATC AATTAATTAT TCAATTAGGT AAGAAGATCT GGTTTTTTGG 2580 

TCAAAGGGAA CATGTTCGGA CTGGAAACAT TTCTTTACAT TTGCATTCCT CCATTTCGCC 2640 

AGCACAAGTC TTGCTAAATG TGATACTGTT GACATCCTCC AGAATGGCCA GAAGTGCAAT 2700 

TAACCTCTTA GGTGGCAAGG AGGCAGGAAG TGCCTCTTTA GTTCTTACAT TTCTAATAGC 2760 

CTTGGGTTTA TTTGCAAAGG AAGCTTGAAA AATATGAGAA AAGTTGCTTG AAGTGCATTA 2820 

CAGGTGTTTG TGAAGTCACA TAATCTACGG GGCTAGGGCG AGAGAGGCCA GGGATTT6TT 2880 

CACAGATACT TGAATTAATT CATCCJ^AATG TACTGAGGTT ACCACACACT TGACTACGGA 2940 

TGTGATCAAC ACTAACAAGG AAACAAATTC AAGGACAACC TGTCTTTGAG CCAGGGCAGG 3000 

CCTCAGACAC CCTGCCTGTG GCCCCGCCTC CACTTCATCC TGCCCGGAAT GCCAGTGCTC 3060 

CGAGCTCAGA CAGAGGAAGC CCTGCAGAAA GTTCCATCAG GCTGTTTCCT AAAGGATGTG 3120 

TGAACGGGAG ATGATGCACT GTGTTTTGAA AGTTGTCATT TTAAAGCATT TTAGCACAGT 3180 

TCATAGTCCA CAGTTGATGC AGCATCCTGA GATTTTAAAT CCTGAAGTGT GGGTGGCGCA 3240 

CACACCAA6T AGGGAGCTAG TCAGGCAGTT TGCTTAAGGA ACTTTTGTTC TCTGTCTCTT 3300 

TTCCTTAAAA TTGGGGGTAA GGAGGGAAGG AAGAGGGAAA GAGATGACTA ACTAAAATCA 3360 

TTTTTACAGC AAAAACTGCT CAAAGCCATT TAAATTATAT CCTCATTTTA AAAGTTACAT 3420 

TTGCAAATAT TTCTCCCTAT GATAATGCAG TCGATAGTGT GCACTCTTTC TCTCTCTCTC 3480 

TCTCTCTCAC ACACACACAC ACACACACAC ACACACACAC AGAGACACGG CACCATTCTG 3540 

CCTGGGGCAC TGGAACACAT TCCTGGGGGT CACCGATGGT CAGAGTCACT AGAAGTTACC 3600 

TGAGTATCTC TGGGAGGCCT CATGTCTCCT GTGGGCTTTT TACCACCACT GTGCAGGAGA 3660 

ACAGACAGAG GAAATGTGTC TCCCTCCAAG GCCCCAAAGC CTCAGAGAAA GGGTGTTTCT 3720 

GGTTTTGCCT TAGCAATGCA TCGGTCTCTG AGGTGACACT CTGGAGTGGT TGAAGGGCCA 3780 

CAAGGTGCAG GGTTAATACT CTTGCCAGTT TTGAAATATA GATGCTATGG TTCAGATTGT 3 840 

TTTTAATAGA AAACTAAAGG GGCAGGGGAA GTGAAAGGAA AGATGGAGGT TTTGTGCGGC 3900 

TCGATGGGGC ATTTGGAACT TCTTTTTAAA GTCATCTCAT GGTCTCCAGT TTTCAGTTGG 3960 

AACTCTGGTG TTTAACACTT AAGGGAGACA AAGGCTGTGT CCATTTGGCA AAACTTCCTT 4020 

GGCCACGAGA CTCTAGGTGA TGTGTGAAGC TGGGCAGTCT GTGGTGTGGA GAGCAGCCAT 4080 

CTGTCTGGCC ATTCAGAGGA TTCTAAAGAC ATGGCTGGAT GCGCTGCTGA CCAACATCAG 4140 

CACTTAAATA AATGCAAATG CAACATTTCT CCCTCTGGGC CTTGAAAATC CTTGCCCTTA 4200 

TCATTTGGGG TGAAGGAGAC ATTTCTGTCC TTGGCTTCCC ACAGCCCCAA CGCAGTCTGT 4260 

GTATGATTCC TGGGATCCAA CGAGCCCTCC TATTTTCACA 6TGTTCTGAT TGCTCTCACA 4320 

GCCCAGGCCC ATCGTCTGTT CTCTGAATGC AGCCCTGTTC TCAACAACAG GGAGGTCATG 4380 

GAACCCCTCT GTGGAACCCA CAAGGGGAGA AATGGGTGAT AAAGAATCCA GTTCCTCAAA 4440 

ACCTTCCCTG GCAGQCTGGG TCCCTCTCCT GCTGGGTGGT GCTTTCTCTT GCACACCACT 4500 

CCCACCACGG GGGGAGAGCC AGCAACCCAA CCAGACAGCT CAGGTTGTGC ATCTGATGGA 4560 

AACCACTGGG CTCAAACACG TGCTTTATTC TCCTGTTTAT TTTTGCTGTT ACTTTGAAGC 4620 

ATGGAAATTC TTGTTTGGGG GATCTTGGGG CTACAGTAGT GGGTAAACAA ATGCCCACCG 4680 

GCCAAGAGGC CATTAACAAA TCGTCCTTGT CCTGAGGGGC CCCAGCTTGC TCGGGCGTGG 4740 

CACAGTGGGG AATCCAAGGG TCACAGTAT6 GGGAGAGGTG CACCCTGCCA CCTGCTAACT 4800 
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TCTCGCTAGA CACAGTGTTT CTGCCCAGGT GACCTGTTCA GCAGCAGAAC AAGCCAGGGC 4860 

CATGGGGACG GGGGAAGTTT TCACTTGGAG ATGGACACCA AGACAATGAA GATTT6TTGT 4920 

CCAAATAGGT CAATAATTCT GGGAGACTCT TGGAAAAAAC TGAATATATT CAGGACC7\AC 4980 

TCTCTCCCTC CCCTCATCCC ACATCTCAAA 6CAGACAATG TAAAGAGAGA ACATCTCACA 5040 

CACCCAGCTC GCCATGCCTA CTCATTCCTG AATTTCAGGT GCCATCACTG CTCTTTCTTT 5100 

CTTCTTTGTC ATTTGAGAAA GGATGCAGGA GGACAATTCC CACAGATAAT CTGAGGAATG 5160 

CAGAAAAACC AGGGCAGGAC AGTTATCGAC AATGCATTAG AACTTGGTGA GCATCCTCTG 5220 

TAGAGGGACT CCACCCCTGC TCAACAGCTT GGCTTCCAGG CAAGACCAAC CTVCATCTGGT 5280 

CTCT6CCTTC GGTGGCCCAC ACAGCTAAGC GTCATCGTCA TTGCCATAGC ATCATGATGC 5340 

AACACATCTA CGT6TAGCAC TACGACGTTA TGTTTGGGTA ATGTGGGGAT GAACTGCATG 540 0 

AGGCTCTGAT TAAGGATGTG GGGAAGTGGG CTGCGGTCAC TGTCGGCCTT GCAAGGCCAC 5460 

CTGGAGGCCT GTCTGTTAGC CAGTGGTGGA GGAGCAAGGC TTCAGGAAGG GCCAGCCACA 552 0 

TGCCATCTTC CCTGCGATCA GGCAAAAAAG TGGAATTAAA AAGTCAAACC TTTATATGCA 5580 

TGTGTTATGT CCATTTTGCA GGATGAACTG AGTTTAAAAG J\ATTTTTTTT TCTCTTCAAG 5640 

TTGCTTTGTC TTTTCCATCC TCATCACAAG CCCTT6TTTG AGTGTCTTAT CCCTGAGCAA 5700 

TCTTTCGATG GATGGAGATG ATCATTAGGT ACTTTTGTTT CAACCTTTAT TCCTGTAAAT 5760 

ATTTCTGTGA AAACTAGGAG AACAGAGATG AGATTTGACA AAAAAAAATT GAATTAAAAA 5820 

TAACACAGTC TTTTTA7VAAC TAACATAGGA AAGCCTTTCC TATTATTTCT CTTCTTAGCT 5880 

TCTCCATTGT CTAAATCAGG AAAACAGGAA AACACAGCTT TCTAGCAGCT GCAAAATGGT 5940 

TTAATGCCCC CTACATATTT CCATCACCTT GAACAATAGC TTTAGCTTGG GAATCTGAGA 6000 

TATGATCCCA GAAAACATCT GTCTCTACTT CGGCTGCAAA ACCCATGGTT TAAATCTATA 6060 

TGGTTTGTGC ATTTTCTCAA CTATVAAATAG AGATGATAAT CCGAATTCTC CATATATTCA 612 0 

CTAATCAAAG ACACTATTTT CATACTAGAT TCCTGAGACA AATACTCACT GAAGGGCTTG 6180 

TTTAAAAATA AATTGTGTTT TGGTCTGTTC TTGTAGATAA TGCCCTTCTA TTTTAGGTAG 6240 

AAGCTCTGGA ATCCCTTTAT TGTGCTGTTG CTCTTATCTG CAAGGTGGCA AGCAGTTCTT 6300 

TTCAGCAGAT TTTGCCCACT ATTCCTCT6A GCTGAAGTTC TTTGCATAGA TTTGGCTTAA 6360 

GCTTGAATTA GATCCCTGCA AAGGCTTGCT CTGTGATGTC AGATGTAATT GTAAATGTCA 6420 

GTAATCACTT CATGAATGCT AAATGAGAAT GTAAGTATTT TTAAATGTGT GTATTTCAAA 6480 

TTTGTTTGAC TAATTCTGGA ATTACAAGAT TTCTATGCAG GATTTACCTT CATCCTGTGC 6540 

ATGTTTCCCA AACT6TGAGG AGGGAAGGCT CAGAGATCGA GCTTCTCCTC TGAGTTCTAA 6600 

CAAAATGGTG CTTTGAGGGT CAGCCTTTAG GAAGGTGCAG CTTTGTTGTC CTTTGAGCTT 6660 
TGTGTTATGT GCCTATCCTA ATAAACTCTT AAACACATT 

Seq ID NOs 345 Protein sequence 
Protein Accession #: NP_036204 

1 11 21 31 41 51 

I I I I I I 

MATSMGLLLL LLIjIjLTQPGA GTGADTEAW CVGTACYTAH SGKLSAAEAQ NHCNQNGGNL 60 

ATVKSKEEAQ HVQRVLAQLL RREAALTARM SKFWIGLQRE KGKCLDPSLP LKGFSWVGGG 12 0 

EDTPYSNWHK ELRNSCISKR CVSLLLDLSQ PLLPNRLPKW SEGPCGSPGS PGSNIEGFVC 180 

KFSFKGMCRP LALGGPGQVT YTTPFQTTSS SLEAVPFASA ANVACGEGDK DETQSHYFLC 240 

KEKAPDVFDW GSSGPLCVSP KYGCNFNNGG CHQDCFEGGD GSFLCGCRPG FRBLDDLVTC 300 

ASRNPCSSSP CRGGATCVLG PHGKNYTCRC PQGYQLDSSQ LDCVDVDECQ DSPCAQECVN 360 

TPGGFRCECW VGYEPGGPGE GACQDVDECA LGRSPCAQGC TNTDGSFHCS CEEGYVLAGE 420 

DGTQCQDVDE CVGPGGPLCD SLCFNTQGSF HCGCLPGWVL APNGVSCTMG PVSLGPPSGP 480 

PDEEDKGEKE GSTVPRAATA SPTRGPEGTP KATPTTSRPS LSSDAPITSA PLKMLAPSGS 540 

SGVWREPSIH HATAASGPQE PAGGDSSVAT QNNDGTDGQK LLLFYILGTV VAILLLLAIiA 600 
LGLLVYRKRR AKREEKKEKK PQNAADSYSW VPERAESRAM ENQYSPTPGT DC 

Seq ID NO: 346 DNA sequence 
Nucleic Acid Accession #: Z31560 
Coding sequence: <l-966 

1 11 21 31 41 51 

I I I I I I 

CACAGCGCCC GCATGTACAA CATGATGGAG ACGGAGCTGA AGCCGCCGGG CCCGCAGCAA 60 

ACTTCGGGGG GCGGCGGCGG CAACTCCACC GCGGCGGCGG CCGGCGGCAA CCAGAAAAAC 120 

AGCCCGGACC GCGTCAAGCG GCCCATGAAT GCCTTCATGG TGTGGTCCCG CGGGCAGCGG 180 

CGCAAGATGG CCCAGGAGAA CCCCAAGATG CACAACTCGG AGATCAGCAA GCGCCTGGGC 240 

GCCGAGTGGA AACTTTTGTC GGAGACGGAG AAGCGGCCGT TCATCGACGA GGCTAAGCGG 300 

CTGCGAGCGC TGCACATGAA GGAGCACCCG GATTATAAAT ACCGGCCCCG GCGGAAAACC 360 

AAGACGCTCA TGAAGAAGGA TAAGTACACG CTGCCCGGCG GGCTGCTGGC CCCCGGCGGC 420 

AATAGCATGG CGAGCGGGGT CGGGGTGGGC GCCGGCCTGG GCGCGGGCGT GAACCAGCGC 4 80 

ATGGACAGTT ACGCGCACAT GAACGGCTGG AGCAACGGCA GCTACAGCAT GATGCAGGAC 540 

CAGCTGGGCT ACCCGCAGCA CCCGGGCCTC AATGCGCACG GCGCAGCGCA GATGCAGCCC 600 

ATGCACCGCT ACGACGTGAG CGCCCTGCAG TACAACTCCA TGACCAGCTC GCAGACCTAC 660 

ATGAACGGCT CGCCCACCTA CAGCATGTCC TACTCGCAGC AGGGCACCCC TGGCATGGCT 720 

CTTGGCTCCA TGGGTTCGGT GGTCAAGTCC GAGGCCAGCT CCAGCCCCCC TGTGGTTACC 780 

TCTTCCTCCC ACTCCAGGGC GCCCTGCCAG GCCGGGGACC TCCGGGACAT GATCAGCATG 840 

TATCTCCCCG GCGCCGAGGT GCCGGAACCC GCCGCCCCCA GCAGACTTCA CATGTCCCAG 900 

CACTACCAGA GCGGCCCGGT GCCCGGCACG GCCATTAACG GCACACTGCC CCTCTCACAC 960 

ATGTGAGGGC CGGACAGCGA ACTGGAGGGG GGAGAAATTT TCAAAGAAAA ACGAGGGAAA 1020 

TGGGAGGGGT GCAAAAGAGG AGAGTAAGAA ACAGCATGGA GAAAACCCGG TACGCTCAAA 1080 
AAAAA 

Seq ID NO: 347 Protein sequence 
Protein Accession #: CAA83435 

1 11 21 31 41 51 

1 1 1 I I 1 

HSARMYNMME TELKPPGPQQ TSGGGGGNST AAAAGGNQKN SPDRVKRPMN AFMVWSRGQR 60 

RKMAQENPKM HNSEISKRLG AEWKLLSETE KRPFIDEAKR LRALHMKEHP DYKYRPRRKT 120 

KTLMKKDKYT LPGGLLAPGG NSMASGVGVG AGLGAGVNQR MDSYAHMNGW SNGSYSMMQD 180 

QLGYPQHPGL NAHGAAQMQP MHRYDVSALQ YNSMTSSQTY MNGSPTYSMS YSQQGTPGMA 240 

LGSMGSWKS EASSSPPWT SSSHSRAPCQ AGDLRDMISM YLPGAEVPEP AAPSRLHMSQ 300 
HYQSGPVPGT AINGTLPLSH M 
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Seq ID NO: 348 DNA sequence 
Nucleotide Accession #: NM_002638 
^ Coding sequence: 120-473 

1 11 21 31 41 51 

I I I 1 ! I 

CAATACAGCT AAGGAATTAT CCCTTGTAAA TACCACAGAC CCGCCCTGGA GCCAGGCCAA 60 

GCTGGACTGC ATAAAGATTG GTATGGCCTT AGCTCTTAGC CAAACACCTT CCTGACACCA 120 

10 TGAGGGCCAG CAGCTTCTTG ATCGTGGTGG TGTTCCTCAT CGCTGGGACG CTGGTTCTAG ISO 

AGGCAGCTGT CACGGGAGTT CCTGTTAAAG GTCAAGACAC TGTCAAAGGC CGTGTTCCAT 240 

TCAATGGACA AGATCCCGTT AAAGGACAAG TTTCAGTTAA AGGTCAAGAT AAAGTCAAAG 300 

CGCAAGAGCC AGTCAAAGGT CCAGTCTCCA CTAAGCCTGG CTCCTGCCCC ATTATCTTGA 360 

TCCGGTGCGC CATGTTGAAT CCCCCTAACC GCTGCTTGAA AGATACTGAC TGCCCAGGAA 420 

15 TCAAGAAGTG CTGTGAAGGC TCTTGCGGGA TGGCCTGTTT CGTTCCCCAG TGAAGGGAGC 480 

CGGTCCTTGC TGCACCTGTG CCGTCCCCAG AGCTACAGGC CCCATCTGGT CCTAAGTCCC 540 

TGCTGCCCTT CCCCTTCCCA CACTGTCCAT TCTTCCTCCC ATTCAGGATG CCCACGGCTG 600 
GAGCTGCCTC TCTCATCCAC TTTCCAATAA A 

20 Seq ID NO: 349 Protein sequence: 

Protein Accession NF_002629 

1 11 21 31 41 51 

ZD MRASSFLIW VFLIAGTIiVL EAAVTGVPVK GQDTVKGRVP FNGQDPVKGQ VSVKGQDKVK 60 
AQEPVKGPVS TKPGSCPIIL IRCAMIjNPPN RCLKDTDCPG IKKCCEGSCG MACFVPQ 

Seq ID NO: 3 50 DNA sequence 
30 Nucleic Acid Accession #: NM_007183 
Coding sequence: 75-2468 

1 11 21 3X 41 51 

I I I I I I 

3 J GAATTCCGGA CAGGACGTGA AGATAGTTGG GTTTGGAGGC GGCCGCCAGG CCCAGGCCCG 60 

GTGGACCTGC CGCCATGCAG GACGGTAACT TCCTGCTGTC GGCCCTGCAG CCTGAGGCCG 120 

GCGTGTGCTC CCTGGCGCTG CCCTCTGACC TGCAGCTGGA CCGCCGGGGC GCCGAGGGGC 180 

CGGAGGCCGA GCGGCTGCGG GCAGCCCGCG TCCAGGAGCA GGTCCGCGCC CGCCTCTTGC 240 

AGCTGGGACA GCAGCCGCGG CACAACGGGG CCGCTGAGCC CGAGCCTGAG GCCGAGACTG 300 

40 CCAGAGGCAC ATCCAGGGGG CAGTACCACA CCCTGCAGGC TGGCTTCAGC TCTCGCTCTC 360 

AGGGCCTGAG TGGGGACAAG ACCTCGGGCT TCCGGCCCAT CGCCAAGCCG GCCTACAGCC 42 0 

CAGCCTCCTG GTCCTCCCGC TCCGCCGTGG ATCTGAGCTG CAGTCGGAGG CTGAGTTCAG 480 

CCCACAATGG GGGCAGCGCC TTTGGGGCCG CTGGGTACGG GGGTGCCCAG CCCACCCCTC 540 

CCATGCCCAC CAGGCCCGTG TCCTTCCATG AGCGCGGTGG GGTTGGGAGC CGGGCCGACT 600 

45 ATGACACACT CTCCCTGCGC TCGCTGCGGC TGGGGCCCGG GGGCCTGGAC GACCGCTACA 660 

GCCTGGTGTC TGAGCAGCTG GAGCCCGCGG CCACCTCCAC CTACAGGGCC TTTGCGTACG 720 

AGCGCCAGGC CAGCTCCAGC TCCAGCCGGG CAGGGGGGCT GGACTGGCCC GAGGCCACTG 7 80 

AGGTTTCCCC GAGCCGGACC ATCCGTGCCC CTGCCGTGCG GACCCTGCAG CGATTCCAGA 840 

GCAGCCACCG GAGCCGCGGG GTAGGCGGGG CAGTGCCGGG GGCCGTCCTG GAGCCAGTGG 900 

50 CTCGAGCGCC ATCTGTGCGC AGCCTCAGCC TCAGCCTGGC TGACTCGGGC CACCTGCCGG 960 

ACGTGCATGG GTTCAACAGC TACGGTAGCC ACCGAACCCT GCAGAGACTC AGCAGCGGTT 1020 

TTGATGACAT TGACCTGCCC TCAGCAGTCA AGTACCTCAT GGCTTCAGAC CCCAACCTGC 1080 

AGGTGCTGGG AGCGGCCTAC ATCCAGCACA AGTGCTACAG CGATGCAGCC GCCAAGAAGC 1140 

AGGCCCGCAG CCTTCAGGCC GTGCCTAGGC TGGTGAAGCT CTTCAACCAC GCCAACCAGG 1200 

55 AAGTGCAGCG CCATGCCACA GGTGCCATGC GCAACCTCAT CTACGACAAC GCTGACAACA 1260 

AGCTGGCCCT GGTGGAGGAG AACGGGATCT TCGAGCTGCT GOGGACACTG CGGGAGCAGG 1320 

ATGATGAGCT TCGCAAAAAT GTCACAGGGA TCCTGTGGAA CCTTTCATCC AGCGACCACC 13 80 

TGAAGGACCG CCTGGCCAGA GACACGCTGG AGCAGCTCAC GGACCTGGTG TTGAGCCCCC 1440 

TGTCGGGGGC TGGGGGTCCC CCCCTCATCC AGCAGAACGC CTCGGAGGCG GAGATCTTCT 1500 

60 ACAACGCCAC CGGCTTCCTC AGGAACCTCA GCTCT^GCCTC TCAGGCCACT CGCCAGAAGA 1560 

TGCGGGAGTG CCACGGGCTG GTGGACGCCC TGGTCACCTC TATCAACCAC GCCCTGGACG 1620 

CGGGCAAATG CGAGGACAAG AGCGTGGAGA ACGCGGTGTG CGTCCTGCGG AACCTGTCCT 1680 

ACCGCCTCTA CGACGAGATG CCGCCGTCCG CGCTGCAGCG GCTGGAGGGT CGCGGCCGCA 1740 

GGGACCTGGC GGGGGCGCCG CCGGGAGAGG TCGTGGGCTG CTTCACGCCG CAGAGCCGGC 1800 

65 GGCTGCGCGA GCTGCCCCTC GCCGCCGATG CGCTCACCTT CGCGGAGGTG TCCAAGGACC 1860 

CCAAGGGCCT CGAGTGGCTG TGGAGCCCCC AGATCGTGGG GCTGTACAAC CGGCTGCTGC 1920 

AGCGCTGCGA GCTCAACCGG CACACGACGG AGGCGGCCGC CGGGGCGCTG CAGAACATCA 1980 

CGGCAGGCGA CCGCAGGTGG GCGGGGGTGC TGAGCCGCCT GGCCCTGGAG CAGGAGCGTA 2040 

TTCTGAACCC CCTGCTAGAC CGTGTCAGGA CCGCCGACCA CCACCAGCTG CGCTCACTGA 2100 

70 CTGGCCTCAT CCGAAACCTG TCTCGGAACG CTAGGAACAA GGACGAGATG TCCACGAAGG 2160 

TGGTGAGCCA CCTGATCGAG AAGCTGCCAG GCAGCGTGGG TGAGAAGTCG CCCCCAGCCG 2220 

AGGTGCTGGT CAACATCATA GCTGTGCTCA ACAACCTGGT GGTGGCCAGC CCCATCGCTG 2280 

CCCGAGACCT GCTGTATTTT GACGGACTCC GAAAGCTCAT CTTCATCAAG AAGAAGCGGG 2340 

ACAGCCCCGA CAGTGAGAAG TCCTCCCGGG CAGCATCCAG CCTCCTGGCC AACCTGTGGG 2400 

75 AGTACAACAA GCTCCACCGT GACTTTCGGG CGAAGGGCTA TCGGAAGGAG GACTTCCTGG 2460 

GCCCATAGGT GAAGCCTTCT GGAGGAGAAG GTGACGTGGC CCAGCGTCCA AGGGACAGAC 2520 

TCAGCTCCAG GCTGCTTGGC AGCCCAGCCT GGAGGAGAAG GCTAATGACG GAGGGGCCCC 2 58 0 

TCGCTGGGGC CCCTGTGTGC ATCTTTGAGG GTCCTGGGCC ACCAGGAGGG GCAGGGTCTT 2640 

ATAGCTGGGG ACTTGGCTTC CGCAGGGCAG GGGGTGGGGC AGGGCTCAAG GCTGCTCTGG 2700 

80 TGTATGGGGT GGTGACCCAG TCACATTGGC AGAGGTGGGG GTTGGCTGTG GCCTGGCAGT 2760 

ATCTTGGGAT AGCCAGCACT GGGAATAAAG ATGGCCATGA ACAGTCACAA AAAAAAAAAA 2820 
AAAAGGAATT C 

Seq ID NO: 3 51 Protein sequence 
85 Protein Accession #: NP_009114.1 

1 11 21 31 41 51 
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I I I I I 1 

MQDGNFLLSA LQPEAGVCSL ALPSDLQLDR RGAEGPEAER LRAARVQEQV RARLLQLGQQ 60 

PRHNGAAEPE PEAETARGTS RGQYHTLQAG FSSRSQGLSG DKTSGFRPIA KPAYSPASWS 120 

SRSAVDLSCS RRLSSAHNGG SAFGAAGYGG AQPTPPMPTR PVSFHERGGV GSRADYDTLS 180 

5 LRSLRLGPGG LDDRYSLVSE QLEPAATSTY RAFAYERQAS SSSSRAGGLD WPEATEVSPS 240 

RTIRAPAVRT LQRFQSSHRS RGVGGAVPGA VLEPVARAPS VRSLSLSLAD SGHLPDVHGF 300 

NSYGSHRTLQ RLSSGFDDID LPSAVKYLMA SDPNLQVLGA AYIQHKCYSD AAAKKQARSL 360 

QAVPRLVICLF NHANQEVQRH ATGAMRNLIY DNADNKLALV EEMGIFELLR TLREQDDELR 420 

KNVTGILWNL SSSDHLKDRL ARDTLEQLTD LVLSPLSGAG GPPLIQQNAS EAEIFYNATG 480 

10 FLRNLSSASQ ATRQKMRECH GLVDALVTSI NHALDAGKCE DKSVENAVCV LRNLSYRLYD 540 

EMPPSALQRL EGRGRRDLAG APPGEWGCF TPQSRRIiREL PLAADALTFA EVSKDPKGLE 600 

WLWSPQIVGL YNRLLQRCEL NRHTTEAAAG ALQNITAGDR RWAGVLSRLA LEQERILNPL 660 

LDRVRTADHH QLRSLTGLIR NLSRNARNKD EMSTKWSHL lEKLPGSVGE KSPPAEVLVN 720 

IIAVLNNIiW ASPIAARDLIi YFDGLRKLIF IKKKRDSPDS EKSSRT^SSL LANLWQYNKL 780 

15 HRDFRAKGYR KEDFLGP 
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Seq ID NO: 3 52 DNA sequence 
Nucleic Acid Accession #: M31469 
Coding secfuence: 1-651 



1 

! 

ATGGCTGCGC 
ACTGGAAAAA 
GCCACCTTGG 
TTCAATGTAT 
ATCCAAGCCC 
GTGCCTAACT 
GGCAACAAAG 
AAGAAGAATC 
TTCCTCTGGC 
GCTCTCGCCC 
TTAGAGGTTG 



11 
1 

AGGGAGAGCC 
CGACCTTCGT 
GTGTTGAGGT 
GGGACACAGC 

AGTGTGCCAT 
GGCATAGAGA 
TGGATATTAA 
TTCAGTACTA 

ttgctaggaa 
caccagaagt 
ctc:agacaac 



21 
1 

CCAGGTCCAG 
G/U^CGTCAT 
TCATCCCCTA 
CGGCCAGGAG 
CATAATGTTT 
TCTGGTACGA 
GGACAGGAAA 
CGACATTTCT 
GCTCATTGGA 
TGTCATGGAC 
TGCTCTCCCG 



31 
I 

TTCAAACTTG 
TTGACTGGTG 
GTGTTCCACA 
AAATTCGGTG 
GATGTAACAT 
GTGTGTGAAA 
GTGAAGGCGA 
GCCAAAAGTA 
GACCCTAACT 
CCAGCTTTGG 
GATGAGGATG 



41 
I 

TATTGGTTGG 
AATTTGAGAA 
CCAACAGAGG 
GACTGAGAGA 
CGAGAGTTAC 
ACATCCCCAT 
AATCCATTGT 
ACTACAACTT 
TGGAATTTGT 
CAGCACAGTA 
ATGACCTGTG 



51 

1 

TGATGGTGGT 
GT^VGTATGTA 
ACCTATTT^G 
TGGCTATTAT 
TTACAAGAAT 
TGTGTTGTGT 
CTTCCACCGA 
TGAAAAGCCC 
TGCCATGCCT 
TGAGCACGAC 
A 



Seq ID NO: 353 Protein sequence 
Protein Accession #: AAA36546 



11 



21 31 41 51 

I I I I I I 

MAAQGEPQVQ FKLVLVGDGG TGKTTFVKRH LTGEFEKKYV ATLGVEVHPL VFHTNRGPIK 

FNVWDTAGQE KFGGLRDGYY IQAQCAIIMF DVTSRVTYKN VPNWHRDLVR VCENIPIVLC 

GNKVDIKDRK VKAKSIVFHR KKNLQYYDIS AKSNYNFEKP FLWLARKLIG DPNLEFVAMP 

ALAPPEWMD PALAAQYEHD LEVAQTTAIiP DEDDDIi 



Seq ID NO: 354 DNA sequence 
Nucleic Acid Accession #: NM_002820 
Coding sequence: 304-831 



CCGGTTCGCA 
CCCTGTTCCA 
CGTGTAAACA 
TTCAGAGGAA 
GTTTGGAGAA 
ACGATGCAGC 
GTGCCCTCCT 
GAACATCAGC 
CTTCACCATC 
CCTAACTCCA 
GAGGGCAGAT 
AAGACACCTG 
AAACGGCGAA 
GACCACCTGT 
CTGGCCCGTA 
GCTTGGACAA 
CAGAGAATAA 
TGTCCTCCAG 
CATCAATCCT 
ATCTTCATAA 
TTCTTCAGTG 
GATATTATCT 
ACTTTTTATT 
TAAATTATGT 
CCAGCTCATA 
GGTTTTTCTC 
CCGTAGGAAA 



11 

I 

AAGAAGCTGA 
CGAACCCAGG 
CACTACTTAT 
GCGCCTCTGA 
AGCACAGTTG 
GGAGACTGGT 
GCGGGCGCTC 
TCCTCCATGA 
TGATCGCAGA 
AGCCCTCTCC 
ACCTAACTCA 
GGAAGAAAAA 
CTCGCTCTGC 
CTGACACCTC 
GCCTCAGCGG 
ACCTAGAATT 
CTCAGAATAT 
CACCATAGAG 
TTACCACTCT 
TTTGCTGGAG 
TTTTTCATTT 
ACAAACACTG 
TAATTAAATG 
TTTAAACACA 
CAAAATAAAT 
ATGTATCTTT 
AATAAAACTT 



21 
I 

CTTCAGAGGG 
AGAACTGCTG 
CATTGATGCA 
TTTGTTTCTT 
GAGTAGCCGG 
TCAGCAGTGG 
GGTGGAGGGT 
CAAGGGGAAG 
AATCCACACA 
CAACACAAAG 
GGAAACT7VAC 
GAAAGGCAAG 
CTGGTTAGAC 
CACAACGTCG 
GGTGCTCTCA 
TTCTCCCTTT 
TGTCTGCCTT 
AGGCGCTAGA 
ACCAAATAAT 
AAGTGTATTT 
CTTACGTTCT 
CAGAACAGCA 
TATTTAATTA 
TGCCTTAAAT 
GGTTTCTGAA 
TTGTTCATTG 
CACATTTAAA 



31 

1 

GGAAACTTTC 
GCCAGATTAA 
TATATAAAAC 
TTTTCCCTTT 
TTGCTAAATA 
AGCGTCGCGG 
CTCAGCCGCC 
TCCATCCAAG 
GCTGAAATCA 
AACCACCCCG 
AAGGTGGAGA 
CCCGGGAAAC 
TCTGGAGTGA 
CTGGAGCTCG 
GCTGGGTTTT 
ATGTATCTCT 
AAAGCAGTAC 
GCCCATTCCT 
TTCATATTCA 
CTTCCCCTTA 
TTCACTTCAA 
TCATGTCATA 
AATCTCAAAT 
TTGTTTAATT 
AATGTTTAAG 
GCAAGATGAA 
AAAAA 



41 

I 

TTCTTTTAGG 
TTAGACATTG 
CATTTTATTT 
TTGCTCTTTC 
AGTCCCGAGC 
TGTTCCTGCT 
GCCTCAAAAG 
ATTTACGGCG 
GAGCTACCTC 
TCCGATTTGG 
CGTACAAAGA 
GCAAGGAGCA 
CTGGGAGTGG 
ATTCACGGTA 
GGAGCCTCCC 
ATCGATTGTG 
CCCCCTACCA 
CTTTCTCCAC 
AGCTTCAGAA 
CTCTCACACC 
GGGAGAATAT 
AACGATTCTG 
TTATTTTAAT 
AAATTTAACT 
TATTAACTTA 
ATAATTTTTC 



51 

1 

AGGCGGTTAG 
CTATGGGAGA 
TGGCTATTAT 
TGGCTGTGTG 
GCGAGCGGAG 
GAGCTACGCG 
AGCTGTGTCT 
ACGATTCTTC 
GGAGGTGTCC 
GTCTGATGAT 
GCAGCCGCTC 
GGAAJ^GAAA 
GCTAGAAGGG 
ACAGGCTTCT 
TTCTGCCTTG 
TAGCAATTGA 
CACACACCCC 
CGTCACCCAA 
GCTAGTGACC 
TGGGCAAACT 
AGAAGCATTT 
AGCCATTCAC 
GTAAAGAACT 
CTGGTTTCTA 
CAAGGATATA 
TAGGGTAATG 



Seq ID NO J 355 Protein sequence 
Protein Accession #: NM_002820 



31 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



41 



51 



1 11 21 

I 1 I I I I 

MQRRLVQQWS VAVFLLSYAV PSCGRSVEGL SRRLKRAVSE HQLLHDKGKS IQDLRRRFFL 
HHLIAEIHTA EIRATSEVSP NSKPSPNTKN HPVRFGSDDE GRYLTQETNK VETYKEQPLK 



60 
120 



319 



wo 02/086443 

TPGKKKKGKP GKRKEQEKKK RRTRSAWLDS GVTGSGLEGD HLSDTSTTSL ELDSR 
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Seq ID NO: 356 DNA sequence 
Nucleic Acid Accession #: NM_017522 
5 Coding sequence: 1-2100 

1 11 21 31 41 51 

I 1 I I I I 

ATGGGCCTCC CCGAGCCGGG CCCTCTCCGG CTTCTGGCGC TGCTGCTGCT GCTGCTGCTG 60 

10 CTGCTGCTGC TGCGGCTCCA GCATCTTGCG GCGGCAGCGG CTGATCCGCT GCTCGGCGGC 120 

CAAGGGCCGG CCAAGGAGTG CGAAAAGGAC CAATTCCAGT GCCGGAACGA GCGCTGCATC 180 

CCCTCTGTGT GGAGATGCGA CGAGGACGAT GACTGCTTAG ACCACAGCGA CGAGGACGAC 240 

TGCCCCAAGA AGACCTGTGC AGACAGTGAC TTCACCTGTG ACAACGGCCA CTGCATCCAC 300 

GJ^ACGGTGGA AGTGTGACGG CGAGGAGGAG TGTCCTGATG GCTCCGATGA GTCCGAGGCC 360 

15 ACTTGCACCA AGCAGGTGTG TCCTGCAGAG AAGCTGAGCT GTGGACCCAC CAGCCACAAG 420 

TGTGTACCTG CCTCGTGGCG CTGCGACGGG GAGAAGGACT GCGAGGGTGG AGCGGATGAG 480 

GCCGGCTGTG CTACCTCACT GGGCACCTGC CGTGGGGACG AGTTCCAGTG TGGGGATGGG 540 

ACATGTGTCC TTGCAATCAA GCACTGCAAC CAGGAGCAGG ACTGTCCAGA TGGGAGTGAT 600 

GAAGCTGGCT GCCTACAGGG GCTGAACGAG TGTCTGCACA ACAATGGCGG CTGCTCACAC 660 

20 ATCTGCACTG ACCTCAAGAT TGGCTTTGAA TGCACGTGCC CAGCAGGCTT CCAGCTCCTG 720 

GACCAGAAGA CTTGTGGCGA CATTGATGAG TGCAAGGACC CAGATGCCTG CAGCCAGATC 780 

TGTGTCAATT ACAAGGGCTA TTTTAAGTGT GAGTGCTACC CTGGCTGCGA GATGGACCTA 840 

CTGACCAAGA ACTGCAAGGC TGCTGCTGGC AAGAGCCCAT CCCTAATCTT CACCAACCGC 900 

ACGAGTGCGG AGGATCGACC TGTGAAGCGG AACTATTCAC GCCTCATCCC CATGCTCAAG 960 

25 AATGTCGTGG CACTAGATGT GGAAGTTGCC ACCAATCGCA TCTACTGGTG TGACCTCTCC 1020 

TACCX3TAAGA TCTATAGCGC CTACATGGAC AAGGCCAGTG ACCCGAAAGA GCG6GAGGTC 1080 

CTCATTGACG AGCAGTTGCA CTCTCCAGAG GGCCTGGCAG TGGACTGGGT CCACAAGCAC 1140 

ATCTACTGGA CTGAGTCGGG CAATAAGACC ATCTCAGTGG CCACAGTTGA TGGTGGCCGC 1200 

CGACGCACTC TCTTCAGCCG TAACCTCAGT GAACCCCGGG CCATCGCTGT TGACCCCCTG 1260 

30 CGAGGGTTCA TGTATTGGTC TGACTGGGGG GACCAGGCCA AGATTGAGAA ATCTGGGCTC 1320 

AACGGTGTGG ACCGGCAAAC ACTGGTGTCA GACAATATTG AATGGCCCAA CGGAATCACC 1380 

CTGGATCTGC TGAGCCAGCG CTTGTACTGG GTAGACTCCA AGCTACACCA ACTGTCCAGC 1440 

ATTGACTTCA GTGGAGGCAA CAGAAAGACG CTGATCTCCT CCACTGACTT CCTGAGCCAC 1500 

CCTTTTGGGA TAGCTGTGTT TGAGGACAAG GTGTTCTGGA CAGACCTGGA GAACGAGGCC 1560 

35 ATTTTCAGTG CAAATCGGCT CAATGGCCTG GAAATCTCCA TCCTGGCTGA GAACCTCAAC 162 0 

AACCCACATG ACATTGTCAT CTTCCATGAG CTGAAGCAGC CAAGAGCTCC AGATGCCTGT 1680 

GAGCTGAGTG TCCAGCCTAA TGGAGGCTGT GAATACCTGT GCCTTCCTGC TCCTCAGATC 1740 

TCCAGCCACT CTCCCAAGTA CACATGTGCC TGTCCTGACA CAATGTGGCT GGGTCCAGAC 1800 

ATGAAGAGGT GCTACCGAGA TGCAAATGAA GACAGTAAGA TGGGCTCAAC AGTC7VCTGCC 1860 

40 GCTGTTATCG GGATCATCGT GCCCATAGTG GTGATAGCCC TCCTGTGCAT GAGTGGATAC 1920 

CTGATCTGGA GAAACTGGAA GCGGAAGAAC ACCAAAAGCA TGAATTTTGA CAACCCAGTC 1980 

TACAGGAAAA CAACAGAAGA AGAAGATGAA GATGAGCTCC ATATAGGGAG AACTGCTCAG 2040 

ATTGGCCATG TCTATCCTGC ACGAGTGGCA TTAAGCCTTG AAGATGATGG ACTACCCTGA 2100 

GGATGGGATC ACCCCCTTCG TGCCTCATGG AATTCAGTCC CATGCACTAC ACTCCGGATG 2160 

45 GTGTATGACT GGATGAATGG GTTTCTATAT ATGGGTCTGT GTGAGTGTAT GTGTGTGTGT 2220 

GATTTTTTTT TTTAAATTTA TGTTGCGGAA AGGTAACCAC AAAGTTATGA TGAACTGCAA 2280 

ACATCCAAAG GATGTGAGAG TTTTTCTATG TATAATGTTT TATACACTTT TTAACTGGTT 2340 

GCACTACCCA TGAGGAATTC GTGGAATGGC TACTGCTGAC TAACATGATG CACATAACCA 2400 

AATGGGGGCC AATGGCACAG TACCTTACTC ATCATTTAAA AACTATATTT ACAGAAGATG 24 60 

50 TTTGGTTGCT GGGGGGCTTT TTTAGGTTTT GGGCATTTGT TTTTTGTAAA TAAGATGATT 2520 
ATGCTTTGTG GCTATCCATC AACATAAGT 



55 



70 



Seq ID NO: 357 Protein sequence 
Protein Accession #: NP_0S9992 

1 11 21 31 41 51 



M6LPEPGPLR LIjALLLLLLL LLLItRLQHLA AAAADPLLGG QGPAKECEKD QFQCRNERCI 60 

PSVWRCDEDD DCLDHSDEDD CPKKTCADSD FTCDNGHCIH ERWKCDGEEE CPDGSDESEA 12 0 

60 TCTKQVCPAE KLSCGPTSHK CVPASWRCDG EKDCEGGADE AGCATSLGTC RGDEFQCGDG 180 

TCVLAIKHCN QEQDCPDGSD EAGCLQGLNE CLHNNGGCSH ICTDLKIGFE CTCPAGFQLL 240 

DQKTCX3DIDE CKDPDACSQI CVNYKGYFKC ECYPGCEMDL LTKNCKAAAG KSPSLIFTNR 300 

TSAEDRPVKR NYSRLIPMLK NWALDVEVA TNRIYWCDLS YRKIYSAYMD KASDPKEREV 360 

LIDEQLHSPE GLAVDWVHKH lYWTDSGNKT ISVATVDGGR RRTLFSRNLS EPRAIAVDPL* 420 

65 RGFMYWSDWG DQAKIEKSGL NGVDRQTLVS DNIEWPNGIT LDLLSQRLYW VDSKLHQLSS 480 

IDFSGGNRKT LISSTDFLSH PFGIAVFEDK VFWTDIiENEA IFSANRLNGL EISIIiAENLN 540 

NPHDIVIFHB LKQPRAPDAC EhSVQPNQGC EYLCLPAPQI SSHSPKYTCA CPDTMWLGPD 600 

MKRCYRDANE DSKMGSTVTA AVIGIIVPIV VIALLCMSGY LIWRNWKRKM TKSMNFDNPV 660 
YRKTTEEEDE DBLHIGRTAQ IGHVYPARVA LSLEDDGLP 



Seq ID NO: 358 DNA sequence 
Nucleic Acid Accession #: M27826 
Coding sequence: <:l-503 



75 1 11 21 31 41 51 

I I I 1 I I 

AGCCCAAGAA ACATCTCACC AATTTCAAAT CTGATCTATT CGGCTTAGCG ACTGAAGATT 60 

GACGCTGCCC GATCGCCTCG GAAGTCCCCT GGACCATCAC AGAAGCCGAG CTTCGGGTAA 120 

CTCTCACAGT GGAGGGTAAG TCCATCCCCT GTTT7UVTCGA TACGGGGGCT ACCCACTCCA 180 

80 CGTTGCCTTC TTTTCAAGGG CCTGTTTCCC TTGCCCCCAT AACTGTTGTG GGTATTGACG 240 

GCCAAGCTTC TUyy^CCCCTG AAAACTCCCC CACTCTGGTG CCAACTTGGA CAACACTCTT 300 

TTATGCACTC TTTTTTAGTT ATCCCCACCT GCCCACTTCC CTTATTAGGC CGAAATATTT 360 

TAACCAAATT ATCTGCTTCC CTGACTATTC CTGGAGTACA GCTACATCTC ATTGCTGCCC 420 

TTCTTCCCAA TCCAAAGCCT CCTTTGTGTC CTCTAACATC CCCACT^TAT CAGCCCTTAC 480 

85 CACAAGACCT CCCTTCAGCT TAATCTCTCC CACTCTAGGT TCCCACGCCG CCCCTAATCC 540 

CACTTGAAGC AGCCCTGAGA AACATCGCCC ATTCTCTCTC CATACCACCC CCCAAAAATT 600 

TTCGCCGCTC CAACACTTCA ACACTATTTT GTTTTATTTG TCTTATTAAT ATCAGAAGGC 660 
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AGGAATGTCA GGCCTCTGAG CCCAGGCCAG GCCATCGCAT 
ACATCCAGAT GGCCTGAAGT AACTGAAGAT CCACAAAAGA 
ATGACATTCC ACCATTGTGA TTTGTTCCTG CCCCACCCTA 
ATCTCCCCCA CCCTTAAGAA GGTTCTTTGT AATTCTCCCC 
TGAGATCCAC CCCTGCCCAC CAGAGAACAA CCCCCTTTGA 
CCCAAATCCT ATAAAACAGC CCCACCCCTA TCTTCCTTCA 
AGCCACCGGC ACCCAGGTGA AATAAACAGC TTTATTGCTC 



PCT/US02/12476 









A6TAAAAACA 


GCCTTAACTG 


780 


ACTGATCAAT 


GTACTTTGTA 


840 


ACCCTTGAGA 


ATGTACTTTG 


900 


TTGTAATTTT 


TTATTACCTT 


960 


CTGACTCTCT 


TTTCGGACTC 


1020 


AC 
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Seq ID NO: 359 Protein sequence 
Protein Accession #; AAAS5999 



1 11 21 31 41 51 

i I I i I I 

PKKHLTNPKS DIiFSLATEDW RCPIASEVPW TITEAELRVT LTVEGKSIPC LIDT6ATHST 

15 LPSFQGPVSL APITWGIDG QASKPIiKTPP LHCQLGQHSF MHSFLVIPTC PLPIiLGRNIL 

TKLSASLTIP GVQLHLIAAL LPNPKPPLCP LTSPQYQPLP QDIiFSA 



60 
120 



20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 360 DNA sequence 
Nucleic Acid Accession #: NM_001854 
Coding sequence: 162-5582 



AACCATCAAA 
TGTGTAGCAA 
TTCGTGGGTT 
GGTGGAAAAC 
TCTTCCAAGC 
TTCACAATTC 
CTAAAGGCTC 
AACAGTTATT 
CAAAAAAAGG 
TTGGTGTTGA 
CCCCAGAAGA 
TAGCAATCAG 
CGAAACCACT 
GAACAAGGAT 
GTGATCCCAA 
CACCCAAGGC 
TCGAATATGA 
GACCCACTGT 
AAGAATACAA 
GGACAAATGA 
ATTATGATTC 
GCAGGGATTC 
AAGAATATGA 
CAGAAACTGA 
AGAAAGGAGA 
CAGGACCTGC 
ACCCTGGCGA 
GTCCTCCTGG 
CAACCATCTC 
TGAGAGGCCC 
GTTCATCTGG 
AGGGTCCCCC 
GAAGAGGAAT 
GTCTGCCAGG 
CTGGTGATGA 
AAGCTGGCCC 
GTATGGCAGG 
CTGGGCCTCC 
CAATTGGTCC 
GTGCTGATGG 
TGGGTCCCCC 
CAGATGGTGT 
GATTCAAAGG 
GAGGGNAAGA 
CTTCAGGTCA 
GAAGACAAGG 
AAGGTGCACG 
CTCGAGGTTC 
GTGGCGATGG 
TTGGATTCCC 
ACCCTGC3GCA 
GAGTGGTTGG 
CTGGTCCTCC 
CAAAGGGTGA 
GTTTCCCAGG 
AAGGTCCCCA 
CAGCTGGCCC 
GAGAGAAAGG 
AAGGTCCTGT 
ACAAGGGTGA 
GCCCTCCCGG 
ATGGTGAACC 
CCAGAGGCTT 
GTGAAAAAGG 



11 

1 

TTTAGAAGAA 
ACATCCCTGG 
GAGTTCACAG 
GAAACGGTGG 
TAGAGAGGTC 
TCCAGAGGGA 
AGATACTGCT 
TCCAGGTGGA 
AATTCAGTCT 
GGTTGGGAGA 
CTATCCCCTC 
CGTGGAGAAG 
TGATAGAAGT 
TTTGGATGAA 
GGCAGCATAT 
TGCTCAAGCT 
CTATGAGTAT 
AACTGAGGAG 
CTATGGAACA 
GCCAAATCCA 
CCAGAGGAAA 
TGATCTTCTG 
AGATAAACCA 
TATTACAGAA 
ACCAGCAGTG 
AGGTATTATG 
TAGGGGCCCC 
TACTATGTTG 
TGCTCAGGAA 
ACCTGGCCCA 
GGCCAAAGGT 
TGGTCCAACG 
GCCAGGAGAA 
TGACAAAGGT 
TGGAATGAGG 
ACGAGGTTTG 
TGTAGATGGC 
AGGTCAACAA 
TCCTGGTGAA 
GCCTCCTGGT 
TGGTCCACAA 
CAGAGGTCTC 
TGACATGGGT 
TGGCCCTGAA 
AGCAGGAGAA 
TCCAAAGGGT 
GGGAGTAGCT 
AAGAGGTGCA 
CCCTCCTGGC 
TGGACCAAAA 
ACGTGGGGAG 
ACCACAGGGA 
TGGCCCTCCT 
TCCAGGTCCT 
GGAAAGAGGT 
GGGCCCACCA 
AATTGGTTTA 
TGCTCCTGGA 
TGGTCTCCCA 
AATTGGTGAG 
TCCCCCAGGT 
AGGTCCTAGA 
CCCTGGACCT 
TGAAAATGGG 



21 

I 

AAAGCCCTTT 
CGATACCTTG 
TTGTGAGTGC 
CTCTGGGATT 
AGAGGAGCTG 
ATATCAAAAA 
TACAGAGTTT 
ACTTTCCCAG 
TTCCTTTTAT 
TCACCTGTTT 
TTCAGAACTG 
AAAACTGTGA 
GAGAGAGCAA 
GAAGTTTTTG 
GACTACTGTG 
CAGGAACCTC 
GGGGAAGCAG 
ACAATAGCAC 
ATGGAAAGTT 
GTTGAAGAAA 
AATTCTGAGG 
GTAGATGGAG 
ACAAGCCCCC 
ACAAGCATAA 
GTTGAGCCTG 
GGTCCTCCAG 
CCAGGACGTC 
ATGTTACCGT 
GCTCAGGCTC 
ATGGGTCTAA 
GAGAGTGGTG 
GGAAAACCTG 
CCTGGGGCAA 
CACAGGGGTG 
GGAGAAGATG 
CTGGGTCCAA 
CCCCCAGGAC 
GGGAATCCAG 
AAAGGACCAC 
CATCCTGGGA 
GGTCCTATTG 
AAGGGATCTA 
CTAAAAGGTG 
GGACCCAAAG 
AAGGGAAAAC 
TCCACTGGAT 
GGCAAACCAG 
AGAGGTCCCA 
CCTCCAGGTG 
GGCCCTCCTG 
ACTGGATTTC 
CCAACCGGTG 
GGTGAGCAAG 
CAAGGTATCT 
CTTCCTGGAG 
GGTCCAGTTG 
CGAGGGCGCC 
GAAAAAGGTC 
GGGCCAGCTG 
CCGGGACAAA 
CTTCAAGGAC 
GGACAGCAGG 
CCTGGTCCAA 
GATGTTGGTC 



31 
I 

GACTTTTTCC 
GAAAGGACGA 
GG6GCTCGGA 
TCACCGTAAC 
CTCCAGTTGA 
CAACGGGATT 
CAAAGCAAGC 
AAGACTTTTC 
CTATATATAA 
TTCTGTTTGA 
TTAACATCGC 
CAATGATTGT 
TTGTTGATAC 
AGGGGGACAT 
AGCATTATAG 
AGATAGATGA 
AGTATAAAGA 
AGACGGAGGC 
ACCAGACAGA 
TATTTACTGA 
ATACACTATA 
ATTTAGGCGA 
CTAATGAAGA 
ATGGCCATGG 
GTATGCTTGT 
GTCTACAAGG 
CTGGCTTACC 
TCCGTTATGG 
AAGCTATTCT 
CTGGAAGACC 
ATCCAGGTCC 
GAAAAAGGGG 
AGGGAGATCG 
AACGAGGTCC 
GAGAAATTGG 
GGGGAACTCC 
CAAAAGGGAA 
GACCTCAGGG 
AA6GAAAACC 
AAGTU^GGCCA 
GATNNCCGGG 
AAGGTGAAAA 
ACAGAGGAGA 
GTCGAGCAGG 
TTGGAGTTCC 
TCCCTGGGTT 
GCCCTCGGGG 
CTGGGAAACC 
AAAGAGGTCC 
GACCACCAGG 
AAGGCAAGAC 
AGACTGGTCC 
GTCTTCCTGG 
CAGGGAAAGA 
CTCAGGGTGC 
GCTCACCAGG 
CGG6ACCTCA 
CCCAAGGGCC 
GTCCTGCCGG 
AAGGCAGCAA 
CAGTTGGTGC 
GGATGTTTGG 
TAGGTCTTCA 
CATGGGG6CC 



41 
1 

CCCTCTCCCT 
AGTTGGTCTG 
GATGGAGCCG 
AACCCTCGCA 
TGTACTAAAA 
TTGCACAAAC 
ACAACTCAGT 
AATACTATTT 
TGAGCATGGT 
AGACCACACT 
TGACGGGAAG 
TGATTGTAAG 
CAATGGAATC 
TCAGCAGTTT 
TCCAGACTGT 
GTATGCACCA 
GGCTGAAAGT 
AAACATCGTT 
AGCTCCTAGG 
AGAATATCTA 
TGAA/^CAAA 
ATATGATTTT 
ATTTGGTCCA 
TGCATATGGA 
CGAAGGACCA 
CCCCACTGGA 
AGGG6CTGAT 
TGGTGATGGT 
TCAGCAGGCT 
AGGTCCTGTG 
TCAGGGCCCT 
TCGTCCAGGT 
AGGGTTTGAT 
TCAAGGTCCT 
ACCAAGAGGT 
AGGAGCTCCA 
CATGGGTCCC 
TCTTCCTGGT 
AGGACTTGCT 
GTCTGGAGAA 
CCCCCGGGGA 
GGGTGAAGAT 
AGTTGGTCAA 
CCCAACTGGA 
AGGATTACCA 
TCCAGGTGCC 
TCAGCGTGGT 
TGGGCCAAAG 
TCAAGGACCT 
AAGGATGGGC 
CGGCCCTCCT 
AATAGGGGAA 
TGCTGCAGGA 
TGGACCAGCA 
ACCTGGACTG 
AGAACGTGGG 
GGGTCCTCCT 
TGCAGGGAGA 
CTCCCCTGGG 
GGGTGGCAAG 
CCCTGGAATT 
6CAAAAAGGT 
GGGTCTGCCA 
ACCTGGTCCT 



51 

1 

CCCCAATGGC 
CAGTCGCAAT 
TGGTCCTCTA 
TTGACCTTCC 
GCACTAGATT 
AGAAAGAATT 
GCCCCAACAA 
ACAGTAAAAC 
ATTCAGCAAA 
GGAAAACCTG 
TGGCATCGGG 
AAGAAAACCA 
ACGGTTTTTG 
TTGATCACAG 
GACTCTTCAG 
GAGGATATAA 
GTAACAGAGG 
GATGATTTTC 
CATGTTTCTG 
ACGG6AGAGG 
GAAATAGACG 
TATGAATATA 
GGTGTACCAG 
GAGAAAGGAC 
CCAGGACCAG 
CCCCCTGGTG 
GGTCTACCTG 
TCCAAAGGAC 
CGGATTGCTC 
GGGGGGCCTG 
CGAGGCGTCC 
GCAGATGGAG 
GGACTTCCGG 
CCAGGTCCTC 
CTTCCAGGTG 
GGGCAGCCTG 
CAAGGGGAGC 
CCACAAGGTC 
GGACTTCCTG 
AAGGGGGCTC 
GTAAAGGGAG 
GGTTTTCCAG 
ATTGGCCCAA 
GACCCAGGTC 
GGATATCCAG 
AATGGAGAGA 
CCAACGGGTC 
GGCACTTCAG 
CAGGGTCCAG 
TGCCCAGGAC 
GGGCCAGGGG 
CGTGGGTATC 
AAAGAAGGTG 
GGATTACGTG 
AAAGGAGGGG 
TCAGCAGGTA 
GGTCCAGCTG 
GATGGAGTTC 
GAAGACGGAG 
GGAGAAAATG 
GCT6GAGGTG 
GATGAG6GTG 
GGCCCACCTG 
CCAGGCCCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3 840 
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GAGGCCCTCA AGGTCCCAAT GGAGCTGATG GACCACAAGG ACCCCCAGGT TCTGTTGGTT 3900 

CAGTTGGTGG TGTTGGAGAA AAGGGTGAAC CTGGAGAAGC AGGAAACCCA GGGCCTCCTG 3960 

GGGAAGCAGG TGTAGGCGGT CCCAAAGGAG AAAGAGGAGA GAAAGGGGAA GCTGGTCCAC 4020 

CTGGAGCTGC TGGACCTCCA GGTGCCAAGG GGCCGCCAGG TGATGATGGC CCTAAGGGTA 4080 

ACCCGGGTCC TGTTG6TTTT CCTGGAGATC CTGGTCCTCC TGGGGAACTT GGCCCTGCAG 4140 

GTCAAGATGG TGTTGGTGGT GACAAGGGTG AAGATGGAGA TCCTGGTCAA CCGGGTCCTC 4200 

CTGGCCCATC TGGTGAGGCT GGCCCACCAG GTCCTCCTGG AA/^CGAGGT CCTCCTGGAG 4260 

CTGCAGGTGC AGAGGGAAGA CAAGGTGAAA AAGGTGCTAA GGGGGAAGCA GGTGCAGAAG 4320 

GTCCTCCTGG AAAAACCGGC CCAGTCGGTC CTCAGGGACC TGCAGGAAAG CCTGGTCCAG 4380 

AAGGTCTTCG GGGCATCCCT GGTCCTGTGG GAGAACAAGG TCTCCCTGGA GCTGCAGGCC 4440 

AAGATGGACC ACCTGGTCCT ATGGGACCTC CTGGCTTACC TGGTCTCAAA GGTGACCCTG 4500 

GCTCCAAGGG TGAAAAGGGA CATCCTGGTT TAATTGGCCT GATTGGTCCT CCAGGAGAAC 4560 

AAGGGGAAAA AGGTGACCGA GGGCTCCCTG GAACTCAAGG ATCTCCAGGA GCAAAAGGGG 4620 

ATGGGGGAAT TCCTGGTCCT GCTGGTCCCT TAGGTCCACC TGGTCCTCCA GGCTTACCAG 4680 

GTCCTCAAGG CCCAAAGGGT AACAAAGGCT CTACTGGACC CGCTGGCCAG AAAGGTGACA 4740 

GTGGTCTTCC AGGGCCTCCT GGGCCTCCAG GTCCACCTGG TGAAGTCATT CAGCCTTTAC 4800 

CAATCTTGTC CTCCAAAAAA ACGAGAAGAC ATACTGAAGG CATGCAAGCA GATGCAGATG 4860 

ATAATATTCT TGATTACTCG GATGGAATGG AAGAAATATT TGGTTCCCTC AATTCCCTGA 4920 

AACAAGACAT CGAGCATATG AAATTTCCAA TGGGTACTCA GACCAATCCA GCCCGAACTT 4980 

GTAAAGACCT GCAACTCAGC CATCCTGACT TCCCAGATGG TGAATATTGG ATTGATCCTA 5040 

ACCAAGGTTG CTCAGGAGAT TCCTTCAAAG TTTACTGTAA TTTCACATCT GGTGGTGAGA 5100 

CTTGCATTTA TCCAGACAAA J\AATCTGAGG GAGTAAGAAT TTCATCATGG CCAAAGGAGA 5160 

AACCAGGAAG TTGGTTTAGT GAATTTAAGA GGGGAAAACT GCTTTCATAC TTAGATGTTG 5220 

AAGGAAATTC CATCAATATG GTGCAAATGA CATTCCTGAA ACTTCTGACT GCCTCTGCTC 52 80 

GGCAAAATTT CACCTACCAC TGTCATCAGT CAGCAGCCTG GTATGATGTG TCATCAGGAA 5340 

GTTATGACAA AGCACTTCGC TTCCTGGGAT CAAATGATGA GGAGATGTCC TATGACAATA 5400 

ATCCTTTTAT CAAAACACTG TATGATGGTT GTACGTCCAG AAAAGGCTAT GAAAAAACTG 5460 

TCATTG7VAAT CAATACACCA AAAATTGATC AAGTACCTAT TGTTGATGTC ATGATCAGTG 5520 

ACTTTGGTGA TCAGAATCAG AAGTTCGGAT TTGAAGTTGG TCCTGTTTGT TTTCTTGGCT 5580 

AAGATTAAGA CAAAGAACAT ATCAAATCAA CAGAAAATGT ACCTTGGTGC CACCAACCCA 5640 

TTTTGTGCCA CATGCAAGTT TTGAATAAGG ATGTATGGAA AACAACGCTG CATATACAGG 5700 

TACCATTTAG GAAATACCGA TGCCTTTGTG GGGGCAG7VAT CACAGACT^ AGCTTTGAAA 5760 

ATCATAAAGA TATiU\GTTGG TGTGGCTAAG ATGGAAACAG GGCTGATTCT TGATTCCCAA 582 0 

TTCTCAACTC TCCTTTTCCT ATTTGAATTT CTTTGGTGCT GTAGAAAACA AAAAAAGAAA 5 880 

AATATATATT CATAAAAAAT ATGGTGCTCA TTCTCATCCA TCCAGGATGT ACTAAAACAG 5940 

TGTGTTTAAT AAATTGTAAT TATTTTGTGT ACAGTTCTAT ACTGTTATCT GTGTCCATTT 6000 

CCAA7VACTTG CACGTGTCCC TGAATTCCGC TGACTCTAAT TTATGAGGAT GCCGAACTCT 6060 

GATGGCAATA ATATATGTAT TATGAAAATG AAGTTATGAT TTCCGATGAC CCTJ^AGTCCC 6120 
TTTCTTTGGT TAATGATGAA ATTCCTTTGT GTGTGTTT 

Seq ID NO: 361 Protein sequence 
Protein Accession #: NP_00184S 

1 11 21 31 41 51 

I I I I I I 

MEPWSSRWKT KRWLWDFTVT TLALTFLFQA REVRGAAPVD VLKALDFHNS PEGISKTTGF 60 

CTNRKNSKGS DTAYRVSKQA QLSAPTKQLF PGGTFPEDFS ILFTVKPKKG IQSFLLSIYN 120 

EHGIQQIGVE VGRSPVFLFE DHTGKPAPED YPLFRTVNIA DGKWHRVAIS VEKKTVTMIV 180 

DCKKKTTKPL DRSERAIVDT NGITVFGTRI LDBEVFEGDI QQFLITGDPK AAYDYCEHYS 240 

PDCDSSAPKA AQAQEPQIDB YAPEDIIEYD YEYGEAEYKE AESVTEGPTV TEETIAQTEA 300 

NIVDDFQEYN YGTMESYQTE APRHVSGTNE PNPVEEIFTE EYLTGEDYDS QRKNSEDTLY 360 

ENKEIDGRDS DLLVDGDLGE YDFYEYKEYE DKPTSPPNEE FGPGVPAETD ITETSINGHG 420 

AYGEKGQKGE PAWEPGMLV EGPPGPAGPA GIMGPPGLQG PTGPPGDPGD RGPPGRPGLP 480 

GADGLPGPPG TMLMLPFRYG GDGSKGPTIS AQBAQAQAIL QQARIAIiRGP PGPMGIiTGRP 540 

GPVGGPGSSG AKGESGDPGP QGPRGVQGPP GPTGKPGKRG RPGADGGRGM PGEPGAKGDR 600 

GFDGLPGLPG DKGHRGERGP QGPPGPPGDD GMRGEDGEIG PRGLPGEAGP RGLLGPRGTP 660 

GAPGQPGMAG VDGPPGPKGN MGPQGEPGPP GQQGNPGPQG LPGPQGPIGP PGEKGPQGKP 720 

GLAGLPGADG PPGHPGKEGQ SOEKGALGPP GPQGPIGXPG PRGVKGADGV RGLKGSKGEK 780 

GEDGFPGFKG DMGIiKGDRGE VGQIGPRGXD GPBGPKGRAG PTGDPGPSGQ AGEKGKLGVP 840 

GLPGYPGRQG PKGSTGFPGF PGANGEKGAR GVAGKPGPRG QRGPTGPRGS RGARGPTGKP 900 

GPKGTSGGDG PPGPPGERGP QGPQGPVGFP GPKGPPGPPG RMGCPGHPGQ RGETGFQGKT 960 

GPPGPGGWG PQGPTGETGP IGERGYPGPP GPPGEQGLPG AAGKEGAKGD PGPQGISGKD 1020 

GPAGLRGFPG ERGLPGAQGA PGLKGGEGPQ GPPGPVGSPG ERGSAGTAGP IGLRGRPGPQ 1080 

GPPGPAGEKG APGEKGPQGP AGRDGVQGPV GLPGPAGPAG SPGEDGDKGE IGEPGQKGSK 1140 

GGKGENGPPG PPGLQGPVGA PGIAGGDGEP GPRGQQGMFG QKGDEGARGF PGPPGPIGLQ 12 00 

GLPGPPGEKG ENGDVGPWGP PGPPGPRGPQ GPNGADGPQG PPGSVGSVGG VGEKGEPGEA 1260 

GNPGPPGEAG VGGPKGERGE KGEAGPPGAA GPPGAKGPPG DDGPKGNPGP VGFPGDPGPP 1320 

GELGPAGQDG VGGDKGEDGD PGQPGPPGPS GBAGPPGPPG KRGPPGAAGA EGRQGEKGAK 1380 

GBAGAEGPPG KTGPVGPQGP AGKPGPEGLR GIPGPVGEQG LPGAAGQDGP PGPMGPPGLP 1440 

GLKGDPGSKG EKGHPGLIGl* IGPPGEQGEK GDRGIiPGTQG SPGAKGDGGI PGPAGPLGPP 1500 

GPPGLPGPQG PKGNKGSTGP AGQKGDSGLP GPPGPPGPPG EVIQPLPILS SKKTRRHTEG 1560 

MQADADDNIL DYSDGMEEIF GSLNSLKQDI EHMKFPMGTQ TNPARTCKDL QLSHPDFPDG 1620 

EYWIDPNQGC SGDSFKVYCN FTSGGETCIY PDKKSEGVRI SSWPKEKPGS WFSEFKRGKL 1680 

LSYLDVEGNS INMVQMTFLK LLTASARQNF TYHCHQSAAW YDVSSGSYDK ALRFLGSNDE 1740 

EMSYDNNPFI KTLYDGCTSR KGYEKTVIEI NTPKIDQVPI VDVMISDFGD QNQKFGFEVG 1800 
PVCFLG 



Seq ID NO: 362 DNA sequence 
Nucleic Acid Accession #: NM_003107 
Coding sequences 351-1775 

1 11 21 31 41 51 

I I I I I I 

TTCCCCAGCA TTCGAGAAAC TCCTCTCTAC TTTAGCACGG TCTCCAGACT CAGCCGAGAG 60 
ACAGCAAACT GCAGCGCGGT GAGAGAGCGA GAGAGAGGGA GAGAGAGACT CTCCAGCCTG 120 
GGAACTATAA CTCCTCTGCG AGAGGCGGAG AACTCCTTCC CCAAATCTTT TGGGGACTTT ISO 
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TCTCXCTTTA CCCACCTCCG CCCCTGCGAG GAGTTGAGGG GCCAGTTCGG CCGCCGCGCG 240 

CGTCTTCCCG TTCGGCGTGT GCTTGGCCCG GGGAACCGGG AGGGCCCGGC GATCGCGCGG 300 

CGGCCGCCGC GAGGGTGTGA GCGCGCGTGG GCGCCCGCCG AGCCGAGGCC ATGGTGCAGC 360 

AAACCAACAA TGCCGAGAAC ACGGAAGCGC TGCTGGCCGG CGAGAGCTCG GACTCGGGCG 420 

5 CCGGCCTCGA GCTGGGAATC GCCTCCTCCC CCACGCCCGG CTCCACCGCC TCCACGGGCG 430 

GCAAGGCCGA CGACCCGAGC TGGTGCAAGA CCCCGAGTGG GCACATCAAG CGACCCATGA 540 

ACGCCTTCAT GGTGTGGTCG CAGATCGAGC GGCGCAAGAT CATGGAGCAG TCGCCCGACA 600 

TGCACAACGC CGAGATCTCC AAGCGGCTGG GCAAACGCTG GAAGCTGCTC ATAGACAGCG 660 

^ ^ ACAAGATCCC TTTCATTCX3A GAGGCGGAGC GGCTGCGCCT CAAGCACATG GCTGACTACC 720 

10 CCGACTACAA GTACCGGCCC AGGAAG2\AGG TGAAGTCCGG CAACGCCAAC TCCAGCTCCT 780 

CGGCCGCCGC CTCCTCCAAG CCGGGGGAGA AGGGAGACAA GGTCGGTGGC AGTGGCGGGG 840 

GCGGCCATGG GGGCGGCGGC GGCGGCGGGA GCAGCAACGC GGGGGGAGGA GGCGGCGGTG 900 

CGAGTGGCGG CGGCGCCAAC TCCAAACCGG CGCAGAAAAA GAGCTGCGGC TCCAAAGTGG 960 

CGGGCGGCGC GGGCGGTGGG GTTAGCAAAC CGCACGCCAA GCTCATCCTG GCAGGCGGCG 1020 

15 GCGGCGGCGG GAAAGCAGCG GCTGCCGCCG CCGCCTCCTT CGCCGCCG7VA CAGGCGGGGG 1080 

CCGCCGCCCT GCTGCCCCTG GGCGCCGCCG CCGACCACCA CTCGCTGTAC AAGGCGCGGA 1140 

CTCCCAGCGC CTCGGCCTCC GCCTCCTCGG CAGCCTCGGC CTCCGCAGCG CTCGCGGCCC 1200 

CGGGCAAGCA CCTGGCGGAG AAGAAGGTGA AGCGCGTCTA CCTGTTCGGC GGCCTGGGCA 1260 

CGTCGTCGTC GCCCGTGGGC GGCGTGGGCG CGGGAGCCGA CCCCAGCGAC CCCCTGGGCC 1320 

20 TGTACGAGGA GGAGGGCGCG GGCTGCTCGC CCGACGCGCC CAGCCTGAGC GGCCGCAGCA 1380 

GCGCCGCCTC GTCCCCCX3CC GCCGGCCGCT CGCCCGCCGA CCACCGCGGC TACGCCAGCC 1440 

TGCGCGCCGC CTCGCCCGCC CCGTCCAGCG CGCCCTCGCA CGCGTCCTCC TCGGCCTCGT 150 0 

CCCACTCCTC CTCTTCCTCC TCCTCGGGCT CCTCGTCCTC CGACGACGAG TTCGAAGACG 1560 

ACCTGCTCGA CCTGAACCCC AGCTCAAACT TTGAGAGCAT GTCCCTGGGC AGCTTCAGTT 162 0 

25 CGTCGTCGGC GCTCGACCGG GACCTGGATT TTAACTTCGA GCCCGGCTCC GGCTCGCACT 1680 

TCGAGTTCCC GGACTACTGC ACGCCCGAGG TGAGCGAGAT GATCTCGGGA GACTGGCTCG 1740 

AGTCCAGCAT CTCCAACCTG GTTTTCACCT ACTGAAGGGC GCGCAGGCAG GGAGAAGGGC 1800 

CGGGGGGGGT AGGAGAGGAG ' AAAAAAAAAG TGAAAAAAAG AAACGAAAAG GACAGACGAA 1860 

GAGTTTAAAG AGAAAAGGGA AAAAAGAAAG AAAAAGTAAG CAGGGCTCGT TCGCCCGCGT 1920 

30 TCTCGTCGTC GGATCAAGGA GGGCGGCGGC GTTTTGGACC CGCGCTCCCA TCCCCCACCT 1980 

TCCCGGGCCG GGGACCCACT CTGCCCAGCC GGAGGGACGC GGAGGAGGAA GAGGGTAGAC 2040 

AGGGGCGACC TGTGATTGTT GTTATTGATG TTGTTGTTGA TGGCAAAAAA AAAAAGCGAC 2100 

TTCGAGTTTG CTCCCCTTTG CTTGAAGAGA CCCCCTCCCC CTTCCAACGA GCTTCCGGAC 2160 

TTGTCTGCAC CCCCAGCAAG J^GGCGAGTT AGTTTTCTAG AGACTTGAAG GAGTCTCCCC 2220 

35 CTTCCTGCAT CACCACCTTG GTTTTGTTTT ATTTTGCTTC TTGGTCAAGA AAGGAGGGGA 2280 

GAACCCAGCG CACCCCTCCC CCCCTTTTTT TAAACGCGTG ATGAAGACAG AAGGCTCCGG 2 3 40 

GGTGACGAAT TTGGCCGATG GCAGATGTTT TGGGGGAACG CCGGGACTGA GAGACTCCAC 2400 

GCAGGCGAAT TCCCGTTTGG GGCCTTTTTT TCCTCCCTCT TTTCCCCTTG CCCCCTCTGC 2460 

AGCCGGAGGA GGAGATGTTG AGGGGAGGAG GCCAGCCAGT GTGACCGGCG CTAGGAAATG 2520 

40 ACCCGAGAAC CCCGTTGGAA GCGCAGCAGC GGGAGCTAGG GGCGGGGGCG GAGGAGGACA 2580 

CGAACTGGAA GGGGGTTCAC GGTCAAACTG AAATGGATTT GCACGTTGGG GAGCTGGCGG 2640 

CGGCGGCTGC TGGGCCTCCG CCTTCTTTTC TACGTGAAAT CAGTGAGGTG AGACTTCCCA 2700 

GACCCCGGAG GCGTGGAGGA GAGGAGACTG TTTGATGTGG TACAGGGGCA GTCAGTGGAG 2760 
GGCGAGTGGT TTCGGAAAAA AAAAAAGAAA AAAAGGG 

45 

Seq XD NO: 363 Protein sequence 
Protein Accession NP_003098 
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50 
55 
60 
65 
70 
75 
80 
85 



MVQQTinJAEN 
RPMNAFMVWS 
ADYPDYKYRP 
GGGASGGGAN 
QAGAAALIiPL 
GLGTSSSPVG 
YASLRAASPA 
SFSSSSALDR 



11 

1 

TEALLAGESS 
QIERRKIMEQ 
RKKVKSGNAN 
SKPAQKKSCG 
GAAADHHSLY 
GVGAGADPSD 
PSSAPSHASS 
DLDFNFEPGS 



21 
1 ' 

DSGAGLELGI 
SPDMHNAEIS 
SSSSAAASSK 
SKVAGGAGGG 
KARTPSASAS 
PLGLYEEEGA 
SASSHSSSSS 
GSHFEFPDYC 



31 
1 

ASSPTPGSTA 
KRLGKRWKLL 
PGEKGDKVGG 
VSKPHAKLIIi 
ASSAASASAA 
GCSPDAPSLS 
SSGSSSSDDE 
TPEVSEMISG 



Seq ID NO: 3 64 DNA sequence 
Nucleic Acid Accession #: U10860 
Coding sequence: 123-2204 



TGCCGGCTGC 
GGCACCCTCC 
CGATGGCTCT 
GCCACCACCA 
TCATAGACCG 
CAGCATTTGC 
CTGTGTATGC 
TTCTTGGAAT 
AA/^AAAGTGT 
TCAGGGGCCT 
TAGCTGATGG 
AATCTAAAAA 
AAGTAATACT 
AGAACAGAGA 
TGGTTTTACT 
TGAACCAAGA 
GCCAGTCTGT 
CTCATTCTTT 
GGA7VAAGAAT 
TTGGGGATAC 
AGGAGGTTTT 



11 
I 

TCCTCGACCA 
CGCCCCGTCT 
GTGCAACGGA 
CTATGAAGGA 
AAGAGTGAGG 
TATAAAGGAA 
TGAAGATGCT 
TTGCTATGGT 
CAGAGAAGAT 
TCAGAAGGAA 
ATTCAAGGTT 
GTTATATGGA 
GAAGAATTTC 
ACTTGAGTGT 
CAGTGGTGGA 
ACAAGTCATT 
TGAAGAGGCC 
CTACAATGGA 
TAGCAAAACG 
TTTTGTTAAG 
CCTTGCCCAA 



21 
1 

GGCCTCCTTC 
CGTACTGTCG 
GACTCCAAGC 
GCTGTTGTCA 
GAACTGTTCG 
CAAGGATTCC 
CCCTGGTTTG 
ATGCAGATGA 
GGAGTTTTCA 
GAAGTTGTTT 
GTGGCACGTT 
GCACAGTTCC 
CTTTATGATA 
ATTCGAGAGA 
GTAGACTCAA 
GCTGTGCACA 
CTCAAAAAGC 
ACAACAACCC 
TTAAATATGA 
ATTGCCAATG 
GGTACTTTAC 



31 

1 

TCAACCTCAG 
CCGTCACCQC 
TGGAGAATGC 
TTCTGGATGC 
TGCAGTCTGA 
GTGCTATTAT 
ATCCAGCAAT 
TGAATAAGGT 
ACATTAGTGT 
TGCTTACACA 
CTGGAAACAT 
ACCCTGAAGT 
TAGCTGGATG 
TCAAAGAGAG 
CAGTTTGTAC 
TTGATAATGG 
TTGGAATTCA 
TACCAATATC 
CCACAAGTCC 
AAGTAATT6G 
GGCCTGATCT 



41 
I 

STGGKADDPS 
KDSDKIPFIR 
SGGGGHGGGG 
AGGGGGGKAA 
LAAPGKHLAE 
GRSSAASSPA 
FEDDLLDLNP 
DWLESSZSNL 



41 

1 

CGGGCGGCGC 
CGCGGCTCCG 
TGGAGGAGAC 
TGGTGCTCAG 
AATTTTCCCC 
CATCTCTGGA 
ATTCACTATT 
ATTTGGAGGT 
GGATAATACA 
TGGAGATAGT 
AGTAGCAGGC 
TGGCCTTACA 
CAGTGGAACC 
AGTAGGCACG 
AGCTTTGCTA 
CTTTATGAGA 
GGTCAAAGTG 
AGATGAAGAT 
TGAAGAGAAA 
A6AAAT6AAC 
AATTGi\AAGT 



51 
I 

WCKTPSGHIK 
EAERLRLKHM 
GGGSSNAGGG 
AAAAASFAAE 
KKVKRVYLFG 
AGRSPADHRG 
SSNFESMSLG 
VFTY 



51 
I 

CGACCCTTCC 
GCCCTGGCCC 
CTTAAGGATG 
TACGGGAAAG 
TTGGAAACAC 
GGACCTAATT 
GGCAA6CCTG 
ACTGTGCACA 
TGTTCATTAT 
GTAGACAAAG 
ATAGCAAATG 
GAAAATGGAA 
TTCACCGTGC 
TCAAAAGTTT 
AATCGTGCTT 
AAACGAGAAA 
ATAAATGCTG 
AGAACCCCAC 
AGAAA7U\TCA 
TTGA7VACCAG 
GCATCCCTTG 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



323 



wo 02/086443 

TTGCAAGTGG CAAAGCTGAA CTCATCAAAA CCCATCACAA TGACACAGAG CTCATCAGAA 1320 

AGTTGAGAGA GGAGGGAAAA GTAATAGAAC CTCTGAAAGA TTTTCATAAA GATGAAGTGA 1380 

G/LATTTTGGG CAGAGAACTT GGACTTCCAG AAGAGTTAGT TTCCAGGCAT CCATTTCCAG 1440 

GTCCTGGCCT GGCAATCAGA GTAATATGTG CTGAAGAACC TTATATTTGT AAGGACTTTC 1500 

CTGTWW^CCAA CAATATTTTG AAAATAGTAG CTGATTTTTC TGCAAGTGTT AAAAAGCCAC 1560 

ATACCCTATT ACAGAGAGTC AAAGCCTGCA CAACAGAAGA GGATCAGGAG AAGCTGATGC 1620 

AAATTACCAG TCTGCATTCA CTGAATGCCT TCTTGCTGCC AATTAAAACT GTAGGTGTGC 1680 

AGGGTGACTG TCGTTCCTAC AGTTACGTGT GTGGAATCTC CAGTAAAGAT GAACCTGACT 1740 

GGGAATCACT TATTTTTCTG GCTAGGCTTA TACCTCGCAT GTGTCACAAC GTTAACAGAG 1800 

TTGTTTATAT ATTTGGCCCA CCAGTTAAAG J^ACCTCCTAC AGATGTTACT CCCACTTTCT 1860 

TGACAACAGG GGTGCTCAGT ACTTTACGCC AAGCTGATTT TGAGGCCCAT AACATTCTCA 1920 

GGGAGTCTGG GTATGCTGGG AAAATCAGCC AGATGCCGGT GATTTTGACA CCATTACATT 1980 

TTGATCGGGA CCCACTTCAA AAGCAGCCTT CATGCCAGAG ATCTGTGGTT ATTCGAACCT 2040 

TTATTACTAG TGACTTCATG ACTGGTATAC CTGCAACACC TGGCAATGAG ATCCCTGTAG 2100 

AGGTGGTATT AAAGATGGTC ACTGAGATTA AGAAGATTCC TGGTATTTCT CGAATTATGT 2160 
ATGACTTAAC ATCAAAGCCC CCAGGAACTA CTGAGTGGGA GTAATAAACT TC 

Seq ID NO: 365 Protein sequence 
Protein Accession #: AAA60331 

1 11 21 * 31 41 51 

I I I I I I 

MALCNGDSKL ENAGGDIiKDG HHHYEGAWI LDAGAQYGKV IDRRVRELFV QSEIFPLETP 60 

AFAIKEQGFR AIIISGGPNS VYAEDAPWFD PAIFTIGKPV LGICYGMQMM NKVFGGTVHK 120 

KSVREDGVFN ISVDNTCSLF RGLQKEEWL LTHGDSVDKV ADGFKWARS GNIVAGIANE 180 

SKKLYGAQFH PEVGLTENGK VILKNFLYDI AGCSGTFTVQ NRELECIREI KERVGTSKVL 240 

VLLSGGVDST VCTALLNRAL NQEQVIAVHI DNGFMRKRES QSVEEALKKL GIQVKVINAA 3 00 

HSFYNGTTTL PISDEDRTPR KRISKTLNMT TSPEEKRKII GDTFVKIANE VIGEMNLKPE 360 

EVFLAQGTLR PDLIESASLV ASGKAELIKT HHNDTELIRK LREEGKVIEP LKDFHKDEVR 420 

ILGRELGLPE ELVSRHPFPG PGLAIRVICA EEPYICKDFP ETNNILKIVA DFSASVKKPH 480 

TLLQRVKACT TEEDQEKLMQ ITSLHSLNAF LLPIKTVGVQ GDCRSYSYVC GISSKDEPDW 540 

ESLIFLARLI PRMCHNVNRV VYIFGPPVKE PPTDVTPTFL TTGVIjSTLRQ ADFEAHNILR 600 

ESGYAGKISQ MPVILTPLHF DRDPLQKQPS CQRSWIRTF ITSDFMTGIP ATPGNEIPVE 660 
WIiKMVTEIK KIPGISRIMY DLTSKPPGTT EWE 



Seq ID NO: 366 DNA sequence 
Nucleic Acid Accession #: NM_004219 
Coding sequence: 46-654 

1 11 21 31 41 51 

I I I I I I 

GCGGCCTCAG ATGAATGCGG CTGTTAAGAC CTGCAATAAT CCAGAATGGC TACTCTGATC 60 
TATGTTGATA AGGAAAATGG AGAACCAGGC ACCCGTGTGG TTGCTAAGGA TGGGCTGAAG 120 
CTGGGGTCTG GACCTTCAAT CAAAGCCTTA GATGGGAGAT CTCAAGTTTC AACACCACGT 18 0 
TTTGGCAAAA CGTTCGATGC CCCACCAGCC TTACCTAAAG CTACTAGAAA GGCTTTGGGA 240 
ACTGTCAACA GAGCTACAGA AAAGTCTGTA AAGACCAAGG GACCCCTCAA ACAAAAACAG 300 
CCAAGCTTTT CTGCCAAAAA GATGACTGAG AAGACTGTTA AAGCAAAAAG CTCTGTTCCT 3 60 
GCCTCAGATG ATGCCTATCC AGAAATAGAA AAATTCTTTC CCTTCAATCC TCTAGACTTT 420 
GAGAGTTTTG ACCTGCCTGA AGAGCACCAG ATTGCGCACC TCCCCTTGAG TGGAGTGCCT 480 
CTCATGATCC TTGACGAGGA GAGAGAGCTT GAAAAGCTGT TTCAGCTGGG CCCCCCTTCA 540 
CCTGTGAAGA TGCCCTCTCC ACCATGGGAA TCCAATCTGT TGCAGTCTCC TTCAAGCATT -600 
CTGTCGACCC TGGATGTTGA ATTGCCACCT GTTTGCTGTG ACATAGATAT TTAAATTTCT 660 
TAGTGCTTCA GAGTTTGTGT GTATTTGTAT TAATAAAGCA TTCTTCAACA GAAAAAAAAA 720 
AAATU^AAA 

Seq ID NO: 367 Protein sequence 
Protein Accession #: NP_004210 

1 11 21 31 41 51 

I I i I I I 

MATLIYVDKE NGEP6TRWA KDGLKLGSGP SIKALDGRSQ VSTPRFGKTF DAPPALPKAT 60 
RKALGTVNRA TEKSVKTKGP LKQKQPSFSA KKMTEKTVKA KSSVPASDDA YPEIEKFFPF 120 
NPLDFESFDL PEEHQIAHLP LSGVPLMILD EBRELEKLFQ liGPPSPVKMP SPPWESNLliQ 180 
SPSSILSTLD VELPPVCCDI DI 



Seq ID NO: 3 68 DNA sequence 
Nucleic Acid Accession #: NM_000597 
Coding sequences 118-1104 

1 11 21 31 41 51 

t I 1 I I I 

ATTCGGGGCG AGGGAGGAGG AAGAAGCGGA GGAGGCGGCT CCCGCTCGCA GGGCCGTGCA 60 

CCTGCCCGCC CGCCCGCTCG CTCGCTCGCC CGCCGCGCCG CGCTGCCGAC CGCCAGCATG 120 

CTGCCGAGAG TGGGCTGCCC CGCGCTGCCG CTGCCGCCGC CGCCGCTGCT GCCGCTGCTG 180 

CCGCTGCTGC TGCTGCTACT GGGCGCGAGT GGCGGCGGCG GCGGGGCGCG CGCGGAGGTG 240 

CTGTTCCGCT GCCCGCCCTG CACACCCGAG CGCCTGGCCG CCTGCGGGCC CCCGCCGGTT 300 

GCGCCGCCCG CCGCGGTGGC CGCAGTGGCC GGAGGCGCCC GCATGCCATG CGCGGAGCTC 360 

GTCCGGGAGC CGGGCTGCGG CTGCTGCTCG GTGTGCGCCC GGCTGGAGGG CGAGGCGTGC 420 

GGCGTCTACA CCCCGCGCT6 CGGCCAGGGG CTGCGCTGCT ATCCCCACCC GGGCTCCGAG 480 

CTGCCCCTGC AGGCGCTGGT CATGGGCGAG GGCACTTGTG AGAAGCGCCG GGACGCCGAG 540 

TATGGCGCCA GCCCGGAGCA GGTTGCAGAC AATGGCGATG ACCACTCAGA AGGAGGCCTG 600 

GTGGAGAACC ACGTGGACAG CACCATGAAC ATGTTGGGCG GGGGAGGCAG TGCTGGCCGG 660 

AAGCCCCTCA AGTCGGGTAT GAAGGAGCTG GCCGTGTTCC GGGAGAAGGT CACTGAGCAG 720 

CACCGGCAGA TGGGCAAGGG TGGCAAGCAT CACCTTGGCC TGGAGGAGCC CAAGAAGCTG 780 

CGACCACCCC CTGCCAGGAC TCCCTGCCAA CAGGAACTGG ACCAGGTCCT 6GAGCGGATC 840 



324 



10 



wo 02/086443 

TCCACCATGC GCCTTCCGGA TGAGCGGGGC CCTCTGGAGC ACCTCTACTC CCTGC7VCATC 900 

CCCAACTGTG ACAAGCATGG CCTGTACAAC CTCAAACAGT GCAAGATGTC TCTGAACGGG 960 

CAGCGTGGGG A6TGCTGGTG TGTGAACCCC AACACCGGGA AGCTGATCCA GGGAGCCCCC 1020 

ACCATCCGGG GGGACCCCGA GTGTCATCTC TTCTACAATG AGCAGCAGGA GGCTTGCGGG 10 BO 

GTGCACACCC AGCGGATGCA GTAGACCGCA GCCAGCCGGT GCCTGGCGCC CCTGCCCCCC 1140 

GCCCCTCTCC AAACACCGGC AGAAAACGGA GAGTGCTTGG GTGGTGGGTG CTGGAGGATT 1200 

TTCCAGTTCT GACACACGTA TTTATATTTG GAAAGAGACC AGCACC6AGC TCGGCACCTC 1260 

CCCGGCCTCT CTCTTCCCAG CTGCAGATGC CACACCTGCT CCTTCTTGCT TTCCCCGGGG 1320 

GAGGAAGGGG GTTGTGGTCG GGGAGCTGGG GTACAGGTTT GGGGAGGGGG AAGAGAAATT 1380 
TTTATTTTTG AACCCCTGTG TCCCTTTTGC ATAAGATTAA AGGAAGGAAA AGT 



PCT/US02/12476 



15 



20 



Seq ID NO: 369 Protein sequence 
Protein Accession #: NP_000588 



MLPRVGCPAL 
VAPPAAVAAV 
ELPLQALVMG 
RKPLKSGMKE 
ISTMRLPDER 
PTIRGDPECH 



11 
1 

PLPPPPLLPL 
AGGARMPCAE 
EGTCEKRRDA 
IjAVPRBKVTE 
GPLBHLYSIiH 
liFYNEQQEAC 



21 

1 

LPLLLLLLGA 
LVREPGCGCC 
EYGASPEQVA 
QHRQMGKGGK 
IPNCDKHGLY 
GVHTQRMQ 



31 

1 

SGGGGGARAE 
SVCARLEGEA 
DNGDDHSEGG 
HHLGIjEEPKK 
NLKQCKMSLN 



41 

I 

VLPRCPPCTP 
CGVYTPRCGQ 
LVENHVDSTM 
IjRPPPARTPC 
GQRGECWCVN 



51 

! 

ERLAACGPPP 
GLRCYPHPGS 
MMLGGGGSAG 
QQEUDQVLER 
PNTGKLIQGA 



60 
120 
180 
240 
300 



_ Seq ID NO I 370 DNA sequence 

25 Nucleic Acid Accession NM_004264 
Coding sequence: 6-440 



30 



35 



40 



45 



GGAACATGGC 
TTTGTAATGC 
AGACAGCAAT 
CAGCACTGAT 
AAGAATCTAC 
AAGCTGCTAC 
AAAGCGCACT 
AGTCTCTTCC 
GTGCCATTAA 
TTAAACACTA 
GATAAGCTTA 
GAGTGAAATT 
AATTCTGTTA 
C 



11 
I 

GGATCGGCTC 
CATTGGAGTA 
TAACAAAGAC 
TGCACGAACA 
AGCTGCTTTA 
ATGTGTGGAG 
TGCTGATATT 
AGACTCATAG 
GAATTCTGCA 
TGACACATTA 
TAAATCATGA 
ATTAAGGCAT 
TGACATAATT 



21 

I 

ACGCAGCTTC 
TTGCAGCAAT 
CAGCCAGCTA 
GCAAAAGACA 
CAGGCTGCTA 
GATGTTGTTT 
GCACAGTCAC 
CATCAGTGGA 
TCAGACTTAG 
CCTTTTTAGC 
TTGAATCAGC 
GTAATACATT 
TATGTCTCCA 



31 
I 

AGGACGCTGT 
GTGGTCCTCC 
ACCCTACAGA 
TTGATGTTTT 
GCTTGTATAA 
ATCGAGGAGA 
AGCTGAAGAC 
TACCATGTGG 
ATACAAGCCT 
TATTTTTAAT 
TTTAAAGCAT 
AATGAACATA 
TTTTGTTGTA 



41 

I 

GAATTCGCTT 
TGCCTCTTTC 
AGAGTATGCC 

GATAGATTCC 
GCTAGAAGAA 
CATGCTTCTG 
AAGAAGTGGT 
CTGAGAAAAG 
TACCAACAAT 
AGTCTTCTAT 
CATACCATCA 
ATATAAGGAA 
TTGGCCAGTA 



51 

I 

GCAGATCAGT 
AATAATATTC 
CAGCTTTTTG 
TTACCCAGTG 
GAAAACCATG 
GAGAAGATAC 
ACCCATAGCC 
AACTGTTTGA 
TACAGAAACA 
TTTCACTCTT 
TTTTTTAACT 
ACATATGTAA 
CTTTTACAAT 



60 
120 
IBO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



Seq ID NO: 371 Protein sequence 
Protein Accession tt: NP__004255 



11 



21 



31 



41 



51 



50 MADRLTQLQD AVNSLADQFC NAIGVLQQCG PPASFNNIQT AINKDQPANP TEEYAQLFAA 60 
LIARTAKDID VLIDSLPSEE STAALQAASL YKLEEBNHEA ATCVEDWYR GDMLLEKIQS 120 
ALADIAQSQL KTRSGTHSQS LPDS 



55 
60 
65 
70 
75 
80 



Seq ID NO: 372 DNA sequence 
Nucleic Acid Accession #: AJ271091 
Coding sec[uence: 1-1113 



ATGGAGAATC 
CTGCGCGTGG 
CATTTCAAAG 
TTCTTAGACC 
ACAGTACAGA 
CTGTTTTTGG 
AGAGCTAAGG 
ACTCTTACAA 
TTCTCCTGGA 
TATGACACAT 
GAAACTATCA 
CTTCTTGGAA 
AAAGCTGTGG 
TTCTACATGC 
CTGTGGATTC 
ATTCCAATAT 
AAAGTTAGAT 
ATAAATTTTC 
CATGCCTGTG 



11 
I 

AGGTGTTGAC 
AGCTGAGTGA 
CTCAAGGACA 
TTGTGAAACC 
AGAAAGTGAG 
CTCCTGACTT 
AAGAAGAGCG 
ACTTAAGGAA 
TCTTTGTCAA 
TCCATACTGT 
ATGCAGCAAT 
GAAATTTTAT 
TTTTCTTTGT 
TGACGTGCAT 
CCTTATATCC 
TCAATGAGAC 
TTTCCTTTTT 
GTCACCTTTA 
ATCCCAGCGC 



21 
I 

GCCGCATGTC 
CGTACAGAAC 
TGGTGCCAAA 
AGAGCCTGTT 
TCAGTGGTGG 
TGATCGTTGG 
CCTAAATAAA 
AGGATACCTG 
CCTGACTGTG 
GGCTGACATG 
TGGAGTCACT 
TTTGTTTATC 
GTTTTATTTG 
TGACATGGAT 
ACTGGGATGT 
CGGACGATTC 
TCTTCAGATT 
TAAACAGCGC 
TTTGGGAGGC 



31 

i 

TACTGGGCTC 
CCTGCCATCA 
GGAGACAATG 
TACAAACTGA 
GAGAGACTCA 
CTGGATGAAT 
CTCCGACTGG 
TTTATGTATA 
CGATTCTGTA 
ATGTATTTCT 
ACGTCACCGG 
ATCTTTGGCA 
TGGAGTGCAA 
TGGAAGGTGC 
TTGGCGGAAG 
AGTTTCACAT 
TATCTTATAA 
AGACTGAAAA 
TGA 



41 
I 

AGC6ACACC6 
GCATCACTGA 
TCTATGAATT 
CCCAGAGGCA 
CAAAGCAGGA 
CTGATGCGGA 
AAAGCGAAGG 
ATCTTGTGCA 
TCTTGGGAAA 
GCCAGATGCT 
TGCTGCCTTC 
CCATGGAAGA 
TTGAAATTTT 
TCACATGGCT 
CTGTCTCAGT 
TGCCATATCC 
TGATATTTTT 
TGAGGGCAGG 



51 
I 

CGAGCTATAT 
AAACGTGCTG 
TCACCTGGAG 
GGTAAACATT 
AAAGCGACCA 
AATGGA6CTC 
CTCTCCTGAA 
ATTCTTGGGA 
AGAGTCCTTT 
GGCAGTTGTG 
TCTGATCCAG 
AATGCAGAAC 
CAGGTACTCT 
TCGTTACACT 
GATTCAGTCC 
AGTGAAAATC 
AGGTTTATAC 
CGCAGTGGCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



Seq ID NO: 373 Protein sequence 
Protein Accession #: CAB69070 



1 11 21 31 41 51 

or I I I I I 1 

OD MENQVLTPHV YWAQRHREIjY LRVELSDVQN PAISITENVL HFKAQGHGAK GDNVYEFHLE 60 

FLDIiVKPEPV YKLTQRQVNI TVQKKVSQWW ERLTKQEKRP LPIjAPDFDRW LDESDAEMEL 120 
RTOCEEERLNK IiRLESEGSPE TLTNLRKGYL FMYNLVQFLG FSWIFVNLTV RFCILGKESF 180 



325 



10 



50 



WO 02/086443 PCT/US02/12476 

YDTFHTVADM MYFCQMIjAVV ETINAAIGVT TSPVliPSLIQ LLGRNFILFI IFGTMEEMQN 240 
KAWFFVFYL WSAIBIFRYS FYMLTCIDMD WKVLTWIiRYT LWIPLYPLGC LAEAVSVIQS 300 
IPIFNETGRF SFTLPYPVKI KVRFSFFLQI YLIMIFLGLY INFRHLYKQR RLKMRAGAVA 360 
KACDPSALGG 

Seq ID NO: 374 DNA sequence 
Nucleic Acid Accession NM__016395 
Coding sequence: 1-1113 



1 11 21 31 41 51 

I I i I I I 

ATGGAGAATC AGGTGTTGAC GCCGCATGTC TACTGGGCTC AGCGACACCG CGAGCTATAT 60 

CTGCGCGTGG AGCTGAGTGA CGTACAGAAC CCTGCCATCA GCATCACTGA T^AACGTGCTG 120 

15 CATTTCAAAG CTCAAGGACA T6GTGCCAAA GGAGACAATG TCTATGAATT TCACCTGGA6 180 

TTCTTAGACC TTGTGAAACC AGAGCCTGTT TACAAACTGA CCCAGAGGCA GGTAAACATT 240 

ACAGTACAGA AGAAAGTGAG TCAGTGGTGG GAGAGACTCA CAAAGCAGGA AAAGCGACCA 300 

CTGTTTTTGG CTCCTGACTT TGATCGTTGG CTGGATGAAT CTGATGCGGA AATGGAGCTC 3 60 

AGAGCTAAGG AAGAAGAGCG CCTAAATAAA CTCCGACTGG AAAGCGAAGG CTCTCCTGAA 420 

20 ACTCTTACAA ACTTAAGGAA AGGATACCTG TTTATGTATA ATCTTGTGCA ATTCTTGGGA 480 

TTCTCCTGGA TCTTTGTCAA CCTGACTGTG CGATTCTGTA TCTTGGGAAA AGAGTCCTTT 540 

TATGACACAT TCCATACTGT GGCTGACATG ATGTATTTCT GCCAGATGCT GGCAGTTGTG 60 0 

GAAACTATCA ATGCAGCAAT TGGAGTCACT ACGTCACCGG TGCTGCCTTC TCTGATCCAG 660 

CTTCTTGGAA GAAATTTTAT TTTGTTTATC ATCTTTGGCA CCATGGAAGA AATGCAGAAC 72 0 

25 AAAGCTGTGG TTTTCTTTGT GTTTTATTTG TGGAGTGCAA TTGAAATTTT CAGGTACTCT 780 

TTCTACATGC TGACGTGCAT TGACATGGAT TGGAAGGTGC TCACATGGCT TCGTTACACT 840 

CTGTGGATTC CCTTATATCC ACTGGGATGT TTGGCGGAAG CTGTCTCAGT GATTCAGTCC 900 

ATTCCAATAT TCAATGAGAC CGGACGATTC AGTTTCACAT TGCCATATCC AGTGAAAATC 9'60 

AAAGTTAGAT TTTCCTTTTT TCTTCAGATT TATCTTATAA TGATATTTTT AGGTTTATAC 102 0 

30 ATAAATTTTC GTCACCTTTA TAAACAGCGC AGACTGAAAA TGAGGGCAGG CGCAGTGGCT 1080 
CATGCCTGTG ATCCCAGCGC TTTGGGAGGC TGA 

Seq ID NO: 375 Protein sequence 
35 Protein Accession #: NP_057479 

1 11 21 31 41 51 

I I I I I I 

MENQVLTPHV YWAQRHRELY LRVELSDVQN PAISITENVXj HFKAQGHGAK GDNVYEFHLE 60 

40 FLDLVKPEPV YKLTQRQVNI TVQKKVSQWW ERLTKQEKRP LFLAPDFDRW LDESDAEMEL 120 

RAKEEERLNK LRLESEGSPE TLTNLRKGYL FMYNLVQFLG FSWIFVNLTV RFCILGKESF 180 

YDTFHTVADM MYFCQMLAW ETINAAIGVT TSPVLPSLIQ LLGRNFILFI IFGTMEEMQN 240 

KAWFFVFYL WSAIEIFRYS FYMLTCIDMD WKVLTWLRYT LWIPLYPLGC LVEAVSVIQS 300 

IPIFNETGRF SFTLPYPVKI KVRFSFFLQI YLIMIFLGLY INFRHLYKQR RRRYGKKRKR 360 
45 STKKKDLDGF LPV 



Seq ID NO: 3 76 DNA sequence 
Nucleic Acid Accession #: NM_005987 
Coding sequence: 1-270 



11 21 31 41 51 

I I I I I I 

ATGAATTCTC AGCAGCAGAA GCAGCCTTGC ACCCCACCCC CTCAGCCTCA GCAGCAGCAG 60 

GTGAAACAAC CTTGCCAGCC TCCACCCCAG GAACCATGCA TCCCCAAAAC CAAGGAGCCC 120 

55 TGCCAACCCA AGGTGCCTGA GCCCTGCCAC CCCAAAGTGC CTGAGCCCTG CCAGCCCAAQ 180 

ATTCCAGAGC CCTGCCAGCC CAAGGTGCCT GAGCCCTGCC CTTCAACGGT CACTCCAGCA 240 
CCAGCCCAGC AGAAGACCAA GCAGAAGTAA 

Seq ID NO: 3 77 Protein sequence 
60 Protein Accession #: NP_005978 

1 11 21 31 41 51 

I I I I I I 

MNSQQQKQPC TPPPQPQQQQ VKQPCQPPPQ EPCIPKTKEP CQPKVPEPCH PKVPEPCQPK 60 
65 IPEPCQPKVP EPCPSTVTPA PAQQKTKQK 

Seq ID NO: 378 DNA sequence 
Nucleic Acid Accession #s NM_002105 
70 Coding sequence: 74-505 

1 11 21 31 41 51 

I I I I I I 

ACAGCAGTTA CACTGCGGCG GGCGTCTGTT CTAGTGTTTG AGCCGTCGTG CTTCACCGGT 60 

75 CTACCTCGCT AGCATGTCGG GCCGCGGCAA GACTGGCGGC AAGGCCCGCG CCAAGGCCAA 120 

GTCGCGCTCG TCGCGCGCCG GCCTCCAGTT CCCAGTGGGC CGTGTACACC GGCTGCTGCG 180 

GAAGGGCCAC TACGCCGAGC GCGTTGGCGC CGGCGCGCCA GTGTACCTGG CGGCAGTGCT 240 

GGAGTACCTC ACCGCTGAGA TCCTGGAGCT GGCGGGCAAT GCGGCCCGCG ACAACAAGAA 300 

GACGCGAATC ATCCCCCGCC ACCTGCAGCT GGCCATCCGC AACGACGAGG AGCTCAACAA 360 

80 GCTGCTGGGC GGCGTGACGA TCGCCCAGGG AGGCGTCCTG CCCAACATCC AGGCCGTGCT 420 

GCTGCCCAAG AA6ACCAGCG CCACCGTGGG GCCGAAGGCG CCCTCGGGCG GCAAGAAGGC 480 

CACCCAGGCC TCCCAGGAGT ACTAAGAGGG CCCGCGCCGC GGCCGGCCGC CCCAGCTCCC 540 

CATGCCACCA CAAAGGCCCT TTTAAGGGCC ACCACCGCCC TCATGGAAAG AGCTGAGCCG 600 

CTTCAGACTG CGGGGCAAGC GGGCCGCGGC TCCCTTCCCC TCCCCTCCCC TCGCCCGCCT 660 

85 TCGCCGCCCG GCCTCGAGTC CCCGCCCGCC CCCGCTCCCG TCCCGCACCG CCTGCCGCGT 720 

CGGCCTCGGG CCTGCCCTGT CCGCCGTCCG CCCTCCGGTA GGGTTCGGGC CTTCCGGATG 780 

CGGCTTGGGC GCTCTTCGGG GACCTCCGTG GCGCGGAAGA CCCGAGCCTG CCGGGGGGAG 840 
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GCCGGCGGCG CCGCACCTGC CCGCCTCGGC GTTCGTGACT CAGCCGCCCC ATCCCGAGTC 900 

GCTAAGGGGC TGCGGGGAGG CCX3CAGCACC TTCTGGAAGA CTTGGCCTTC CGCTCTGACG 960 

CAGGGCCGAG GTGGGCAGTC CAGGCCGAGA GCCGGCGGCG CTGAAGGTGA 6TGAGGCCCT 1020 

CGGCAGCTGC AGCCGGGGTG TCTGGTACCC CCCCGGCGTG GTGCTTAGCC CAGGACTTTC 1080 

AGACGGCCGC TGGCCGGGAG GCTTTGGTGG GAGAGACGCG ATCGCCGATT TCGGTCTGGC 1140 

GCCCCTTCTG CGGCCGGGAC CCAGGCCTTT CACATCAGCT CTCCCTCCAT CTTCATTCAT 1200 

AGGTCTGCGC TGGGGCCGGG ACGAAGCACT TGGTAACAGG CACATCTTCC TCCCGAGTGA 1260 

CTGCCTCCTA GGAGGACATT TAGGGGAGGG CAGAGGCCTG CAGTTTGGCT TCACGGCTGG 1320 

CTATGTGGAC AGCAAGAGTC GTTTTGCGGA ACGCGACTGG CAGCCAGGCC TGTCGGGCCC 13 80 

CCGACGCC6C CCCATTTCCC TTCCAGCAAA CTCAACTCGG CAATCCAAGC ACCTAGATAC 1440 

CAGCACAAGT CGGTTAATCC CTGTCTGGAC TGAGCCTCCG TTGGCTTCTG AACTGGAATT 1500 

CTGCAGCTAA CCCTTCCACG ACTAGAACCT TAGGCATTGG GGAGTTTTAG ATGGACTAAT 1560 
TTTATTAAAG GATTGTTTTT TTTTT 

Seq ID NO: 379 Protein sequence 
Protein Accession #: NP_002096 

1 11 21 31 41 51 

I I I I I I 

MSGRGKTGGK ARAKAKSRSS RAGLQPPVGR VHRLLRKGHY AERVGAGAPV YLAAVLEYLT 60 

AEILELAGNA ARDNKKTRII PRHLQLAIRN DEBLNKLLGG VTIAQGGVLP NIQAVLLPKK 120 
TSATVGPKAP SGGKKATQAS QEY 

Seq ID NO: 3 80 DNA sequence 
Nucleic Acid Accession #: AL136942 
Coding sequence: 184-864 

1 11 21 31 41 51 

I I I I I I 

ACGCGTCCGG CAGAAGCTCG GAGCTCTCGG GGTATCGAGG AGGCAGGCCC GCGGGCGCAC 60 

GGGCGAGCGG GCCGGGAGCC GGAGCGGCGG AGGAGCCGGC AGCAGCGGCG CGGCGGGCTC 120 

CAG6CGAGGC GGTCGACGCT CCTGAAAACT TGCGCGCGCG CTCGCGCCAC TGCGCCCGGA 180 

GCGATGAAGA TGGTCGCGCC CTGGACGCGG TTCTACTCCA ACAGCTGCTG CTTGTGCTGC 240 

CATGTCCGCA CCGGCACCAT CCTGCTCGGC GTCTGGTATC TGATCATCAA TGCTGTGGTA 300 

CTGTTGATTT TATTGAGTGC CCTGGCTGAT CCGGATCAGT ATAACTTTTC AAGTTCTGAA 3 60 

CTGGGAGGTG ACTTTGAGTT CATGGATGAT GCCAACATGT GCATTGCCAT TGCGATTTCT 420 

CTTCTCATGA TCCTGATATG TGCTATGGCT ACTTACGGAG CGTACAAGCA ACGCGCAGCC 480 

TGGATCATCC CATTCTTCTG TTACCAGATC TTTGACTTTG CCCTGAACAT GTTGGTTGCA 540 

ATCACTGTGC TTATTTATCC AAACTCCATT CAGGAATACA TACGGCAACT GCCTCCTAAT 600 

TTTCCCTACA GAGATGATGT CATGTCAGTG AATCCTACCT GTTTGGTCCT TATTATTCTT 660 

CTGTTTATTA GCATTATCTT GACTTTTAAG GGTTACTTGA TTAGCTGTGT TTGGAACTGC 720 

TACCGATACA TCAATGGTAG GAACTCCTCT GATGTCCTGG TTTATGTTAC CAGCAATGAC 780 

ACTACGGTGC TGCTACCCCC GTATGATGAT GCCACTGTGA ATGGTGCTGC CAAGGAGCCA 840 

CCGCCACCTT ACGTGTCTGC CTAAGCCTTC AAGTGGGCGG AGCTGAGGGC AGCAGCTTGA 900 

CTTTGCAGAC ATCTGAGCAA TAGTTCTGTT ATTTCACTTT TGCCATGAGC CTCTCTGAGC 960 

TTGTTTGTTG CTGAAATGCT ACTTTTTAAA ATTTAGATGT TAGATTGAAA ACTGTAGTTT 1020 

TCAACATATG CTTTGCTAGA ACACTGTGAT AGATTAACTG TAGAATTCTT CCTGTACGAT 1080 

TGGGGATATA ACGGGCTTCA CTAACCTTCC CTAGGCATTG AAACTTCCCC CAAATCTGAT 1140 

GGACCTAG/A GTCTGCTTTT GTACCTGCTG GGCCCCAAAG TTGGGCATTT TTCTCTCTGT 1200 

TCCCTCTCTT TTGAAAATGT AAAATAAAAC CAAAAATAGA CAACTTTTTC TTCAGCCATT 1260 

CCAGCATAGA GAACAAAACC TTATGGAAAC AGGAATGTCA ATTGTGTAAT CATTGTTCTA 1320 

ATTAGGTAAA TAGAAGTCCT TATGTATGTG TTACAAGAAT TTCCCCCACA ACATCCTTTA 13 80 

TGACTGAAGT TCAATGACAG TTTGTGTTTG GTGGTAAAGG ATTTTCTCCA TGGCCTGAAT 1440 

TAAGACCATT AGAAAGCACC AGGCCGTGGG AGCAGTGACC ATCTACTGAC TGTTCTTGTQ 1500 

GATCTTGTGT CCAGGGACAT GGGGTGACAT GCCTCGTATG TGTTAGAGGG TGGAATGGAT 1560 

GTGTTTGGCG CTGCATGGGA TCTGGTGCCC CTCTTCTCCT GGATTCACAT CCCCACCCAG 1620 

GGCCCGCTTT TACTAAGTGT TCTGCCCTAG ATTGGTTCAA GGAGGTCATC CAACTGACTT 1680 

TATCAAGTGG AATTGGGATA TATTTGATAT ACTTCTGCCT AACAACATGG AAAAGGGTTT 1740 

TCTTTTCCCT GCAAGCTACA TCCTACTGCT TTGAACTTCC AAGTATGTCT AGTCACCTTT 1800 

TAAAATGTAA ACATTTTCAG AAAAATGAGG ATTGCCTTCC TTGTATGCGC TTTTTACCTT 1860 

GACTACCTGA ATTGCAAGGG ATTTTTATAT ATTCATATGT TACAAAGTCA GCAACTCTCC 1920 

TGTTGGTTCA TTATTGAATG TGCTGTAT^T TAAGTCGTTT GCAATTAAAA CAAGGTTTGC 1980 
CCACATCCAA AAAAAAAAAA AAAAA 

Seq ID NO: 381 Protein sequence 
Protein Accession #: CAB66876 

1 11 21 31 41 51 

I I I I 1 1 

MKMVAPWTRF YSNSCCLCCH VRTGTILLGV WYLIINAWL LtlLLSALADP DQYNFSSSEL 60 

GGDFEFMDDA NMCIAIAISL LMILICAMAT YGAYKQRAAW IIPFFCYQIF DFAIJTMLVAI 120 

TVLIYPNSIQ EYIRQLPPNF PYRDDVMSVN PTCLVLIILL PISIILTFKG YLISCVWNCY 180 
RYINGRNSSD VLVYVTSNDT TVLLPPYDDA TVNGAAKEPP PPYVSA 

Seq ID NO: 382 DNA sequence 
Nucleic Acid Accession #: NM_002510 
Coding sequence: 92-1774 

1 11 21 31 41 51 

I I I I I 1 

CAGATGCCAG AAGAACACTG TTGCTCTTGG TGGACGGGCC CAGAGGAATT CAGAGTTAAA 60 

CCTTGAGTGC CTGCGTCCGT GAGAATTCAG CATGGAATGT CTCTACTATT TCCTGGGATT 120 

TCTGCTCCTG GCTGCAAGAT TGCCACTTGA TGCCGCCAAA CGATTTCATG ATGTGCTGGG ISO 

CAATGAAAGA CCTTCTGCTT ACATGAGGGA GCACAATCAA TTAAATGGCT GGTCTTCTGA 240 

TGAAAATGAC TGGAATGAAA AACTCTACCC AGTGTGGAAG CGGGGAGACA TGAGGTGGAA 300 

AAACTCCTGG AAGGGAGGCC GTGTGCAGGC GGTCCTGACC AGTGACTCAC CA6CCCTCGT 360 
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GGGCTCAAAT ATAACATTTG CGGTGAACCT GATATTCCCT 
CAATGGC/iAC ATAGTCTATG AGAAGAACTG CAGAAATGAG 
ATATGTTTAC AACTGGACAG CATGGTCAGA GGACAGTGAC 
AAGCCATCAT AACGTCTTCC CTGATGGGAA ACCTTTTCCT 
ATGGAATTTC ATCTACGTCT TCCACACACT TGGTCAGTAT 
TTCAGTGAGA GTTTCTGTGA ACACAGCCAA TGTGACACTT 
GACTGTCTAC AGAAGACATG GACGGGCATA TGTTCCCATC 
CGTGGTAACA GATCAGATTC CTGTGTTTGT GACTATGTTC 
ATCCGACGAA ACCTTCCTCA AAGATCTCCC CATTATGTTT 
TAGCCACTTC CTCAATTATT CTACCATTTVA CTACAAGTGG 
CCTGTTTGTT TCCACCAATC ATACTGTGAA TCACACGTAT 
CCTTAACCTC ACTGTGAAAG CTGCAGCACC AGGACCTTGT 
CAGACCTTCA AAACCCACCC CTTCTTTAGG ACCTGCTGGT 
TAGGATTCCT GATGAAAACT GCCAGATTAA CAGATATGGC 
AATTGTAGAG GGAATCTTAG AGGTTAACAT CATCCAGATG 
GCCATGGCCT GAAAGCTCCC TAATAGACTT TGTCGTGACC 
GGAGGTCTGT ACCATCATTT CTGACCCCAC CTGCGAGATC 
CCCTGTGGAT GTGGATGAGA TGTGTCTGCT GACTGTGAGA 
GACGTACTGT GTGAACCTCA CCCTGGGGGA TGACACAAGC 
GATTTCTGTT CCTGACAGAG ACCCAGCCTC GCCTTTAAGG 
CTCCGTTGGC TGCTTGGCCA TATTTGTCAC TGTGATCTCC 
CAAGGAATAC AACCCAATAG AAAATAGTCC TGGGAATGTG 
TGTCTTTCTC AACCGTGCAA AAGCCGTGTT CTTCCCGGGA 
ACTCAAAAAC CAAGAATTTA AAGGAGTTTC TTAAATTTCG 
TTTTCAGTGC CATTGATGTG AGATGTGCTG GAGTGGCTAT 
TTATTGTTAA ATAGATATTG TGGTTTGGGG AAGTTGAATT 
TTTAGAGATG GGGAGAGGGA TTATACTGCA GGCAGCTTCA 
AAAGCAACTT AGCAAGGCTT CTTTTCATTA TTTTTTATGT 
TAACTAGTAG GATAGAAACA CTGTGTCCCG AGAGTAAGGA 
AGCCTAACCC AGGTTAACTG CAAGAAGAGG CGGGATACTT 
ATGCATAAAG CCAATGTAGT CCAGTTTCTA AGATCATGTT 
TTCAATACAC ACTCATGTWVC TCCTGATGGA ACAATAACAG 
GTGCACACTT GCTA6ACTCA GAAAAAATAC TACTCTCATA 
TGACAACCTA CTTTGCTTGG CTGAGTGAAG GAATGATATT 
GGACATTTAG TTAGTGCTTT TTATATACCA GGCATGATGC 
ATTTCCAAAT TTTTGTATAG TCGCTGCACA TATTTGAAAT 
AGATGAGGTC CCTGGTTTTT CATGGCAACT TGATCAGTAA 
ACTAAAACCA TCTACTATAT GTTAGACATG ACATTCTTTT 
AAGTGTGGGA AGAGACAAAA A/^AAAAAAA 

Seq ID NO: 383 Protein sequence 
Protein Accession #: NP_002501 



PCT/US02/12476 



AGATGCCAAA 
GCTGGTTTAT 
GGGGAAAATG 
CACCACCCCG 
TTCCAGAAAT 
GGGCCTCAAC 
GCACAAGTGA 
CAGAAGAACG 
GATGTCCTGA 
AGCTTCGGGG 
GTGCTCAATG 
CCGCCACCGC 
GACAACCCCC 
CACTTTCAAG 
ACAGACGTCC 
TGCCAAGGGA 
ACCCAGAACA 
CGAACCTTCA 
CTGGCTCTCA 
ATGGCAAACA 
CTCTTGGTGT 
GTCAGAAGCA 
AACCAGGAAA 
ACCTTGTTTC 
TAACCTTTTT 
TTTTATAGGT 
GCCATGTTGT 
TTCACTTATA 
GAGAAGCTAC 
TCAGCTTTCC 
CCAAGCTAAC 
GCCCAAGCCT 
AATGGGTGGG 
CATATATTCA 
TGAGTGACAC 
CATATATTAA 
GGATTTCACC 
TCTCTCCTTC 



AGGAAGATGC 
CTGCTGATCC 
GCACCGGCCA 
GATGGAGAAG 
TGGGACGATG 
TCATGGAAGT 
AAGATGTGTA 
ATCGAAATTC 
TTCATGATCC 
ATAATACTGG 
GAACCTTCAG 
CACCACCACC 
TGGAGCTGAG 
CCACCATCAC 
TGATGCCGGT 
GCATTCCCAC 
CAGTCTGCAG 
ATGGGTCTGG 
CGAGCACCCT 
GTGCCCTGAT 
ACAAAAJ^ACA 
AAGGCCTGAG 
AGGATCCGCT 
TGAAGCTCAC 
TTCCTAAAGA 
TAAATGTCAT 
GAAACTGATA 
AAGTCTTAGG 
TATTGATTAG 
ATGTAAGTGT 
TGAATCCCAC 
GTGGTATGAT 
AGTATTTTGG 
TTTATTCCAT 
TCTTGTGTAT 
GACTTTCCAA 
TCTGTTTGTA 
CTGAAAAATA 



MECLYYFLGF 
VWKRGDMRWK 
RNEAGLSADP 
GQYFQKLGRC 
TMFQKNDRNS 
HTYVLNGTFS 
RYGHFQATIT 
CEITQNTVCS 
PLRMANSALI 
FPGNQEKDPL 



11 
I 

LLIiAARLPLD 
NSWKGGRVQA 
YVYNWTAWSE 
SVRVSVNTAN 
SDETFLKDLP 
LNLTVKA7WVP 
IVEGILEVNI 
PVDVDEMCIiL 
SVGCLAIFVT 
DKNQEFKGVS 



21 
I 

AAKRFHDVLG 
VLTSDSPALV 
DSDGENGTGQ 
VTLGPQLMEV 
IMFDVLIHDP 
GPCPPPPPPP 
IQMTDVLMPV 
TVRRTFNGSG 
VISIiliVYKKH 



31 

I 

NERPSAYMRE 
GSNITFAVNL 
SHHNVFPDGK 
TVYRRHGRAY 
SHFliNYSTIN 
RPSKPTPSLG 
PWPESSLIDF 
TYCVNLTLGD 
KEYNPIENSP 



41 

1 

HNQLNGWSSD 
IFPRCQKEDA 
PFPHHPGWRR 
VPIAQVKDVY 
YKWSFGDNTG 
PAGDNPIiELS 
WTCQGSIPT 
DTSLAhTSTL 
GNWRSKGLS 



51 

I 

ENDWNEKLYP 
NGNIVYEKNC 
WNFIYVFHTL 
WTDQIPVFV 
LFVSTNHTVN 
RIPDENCQIN 
EVCTIISDPT 
ISVPDRDPAS 
VFLNRAKAVF 



Seq ID NO: 384 DNA sequence 
Nucleic Acid Accession #: NM_001134 
Coding sequence: 48-1877 



TCCATATTGT 
AATCAATTTT 
AATATGGAAT 
ACCTGGCTAC 
AAATGGTGAA 
GGTGTTTAGA 
TGGAGAAGTA 
TTCTTGCACA 
TCACAAGCTG 
AGATAGCAAG 
ATGACAAAAT 
AGGCAGCAAC 
CAGTAATGAA 
AGAAGTTTAC 
TACATGAGCA 
TGTCCTACAT 
TGACCACGCT 
GTCTATCTCC 
GGGAAAAAAA 
TTGCTGTCTC 
TCCAGACTGA 
TCCAGGAGAG 
ATTACTTACA 
CGGAGCTGAT 
GTGA6GACAA 



11 
I 

GCTTCCACCA 
TTTAATTTTC 
AGCTTCCATA 
CATATTTTTT 
AGATGCATTG 
AAACCAGCTA 
CGGACATTCA 
CAAAAAGCCC 
TGAAGCATAT 
AAGGCATCCC 
AATTCCATCT 
AGTTACAAAA 
AAATTTTGGG 
CAAAGTTAAT 
CTGTTGCAGA 
ATGTTCTCAA 
GQAACGTGGT 
T^TCTAAAC 
TATCTTCTTG 
AGTAATTCTA 
AAACCCTCTT 
CCAAGCATTG 
AAATGCGTTT 
GGCCATCACC 
ACTATTGGCC 



21 
1 

CTGCCAATAA 
CTACTAAATT 
TTGGATTCTT 
GCCCAGTTTG 
ACTGCAATTG 
CCTGCCTTTC 
GACTGCTGCA 
ACTCCAGCAT 
GAAGAAGACA 
TTCCTGTATG 
TGCTGCAAAG 
GAATTAAGAG 
ACCCGAACTT 
TTTACTGAAA 
GGAGATGTGC 
CAAGACACTC 
CAATGTATAA 
AGGTTTTTAG 
GCAAGTTTTG 
AGAGTTGCTA 
GAATGCCAAG 
GCAAAGCGAA 
CTCGTTGCTT 
AGAAAAATGG 
TGTGGC6AGG 



31 
I 

CT^AAATTUVCT 
TTACTGAATC 
ACCAATGTAC 
TTCAAGAAGC 
AGAAACCCAC 
TGGAAGAACT 
GCC7\AAGTGA 
CGATCCCACT 
GGGAGACATT 
CACCTACAAT 
CTGAAAATGC 
AAAGCAGCTT 
TCCAAGCCAT 
TCCAGAAACT 
TGGATTGTCT 
TGTCAAACAA 
TTCATGCAGA 
GAGATAGAGA 
TTCATGAATA 
AAGGATACCA 
ATAAAGGAGA 
GCTGCGGCCT 
ACACAAAGAA 
CAGCCACAGC 
GAGCGGCTGA 



41 
I 

AGCAACCATG 
CAGAACACTG 
TGCAGAGATA 
CACTTACT^G 
TGGAGATGAA 
TTGCCATGAG 
AGAGGGAAGA 
TTTCCAAGTT 
CATGAACAAA 
TCTTCTTTGG 
AGTTGAATGC 
GTTAAATCAA 
AACTGTTACT 
AGTCCTGGAT 
GCAGGATGGG 
AATAACAGAA 
AAATGATGAA 
TTTTAACCAA 
TTCAAGAAGA 
GGAGTTATTG 
AGAAGAATTA 
CTTCCAGAAA 
AGCCCCCCAG 
AGCCACTTGT 
CATTATTATC 



420 
4B0 
540 

600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



60 
120 
180 
240 
300 
360 
420 
480 
540 



51 
I 

AAGTGGGTGG 
CATAGAAATG 
AGTTTAGCTG 
GAAGTAAGCA 
CAGTCTTCAG 
AAAGAAATTT 
CATAACTGTT 
CCAGAACCTG 
TTCATTTATG 
GCTGCTCGCT 
TTCCAAACAA 
CATGCATGTG 
AAACTGAGTC 
GTGGCCCATG 
GAAAAAATCA 
TGCTGCAAAG 
T^CCTGAAG 
TTTTCTTCAG 
CATCCTCAGC 
GAGAAGTGTT 
CAGAAATACA 
CTAGGAGAAT 
CTGACCTCGT 
TGCCAACTCA 
GGACACTTAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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GTATCAGACA TGAAATGACT CCAGTAAACC CTGGTGTTGG CCAGTGCTGC ACTTCTTCAT IS 60 

ATGCCAACAG GAGGCCATGC TTCAGCAGCT TGGTGGTGGA TGAAACATAT GTCCCTCCTG 1620 

CATTCTCTGA TGACAAGTTC ATTTTCCATA AGGATCTGTG CCAAGCTCAG GGTGTAGCGC 1680 

TGCAAACGAT GAAGCAAGAG TTTCTCATTA ACCTTGTGAA GCAAAAGCCA CAAATAACAG 1740 

AGGAACAACT XGAGGCTGTC ATTGCAGATT TCTCAGGCCT GTTGGAGA?^ TGCTGCCAAG 1800 

GCCAGGAACA GGAAGTCTGC TTTGCTGAAG AGGGACAAAA ACTGATTTCA ATiAACTCGTG 1860 

CTGCTTTGGG AGTTTAAATT ACTTCAGGGG AAGAGAAGAC AAAACGAGTC TTTCATTCGG 1920 

TGTGAACTTT TCTCTTTAAT TTTAACTGAT TTAACACTTT TTGTGAATTA ATGAAATGAT 1980 
AAAGACTTTT ATGTGAGATT TCCTTATCAC AGAAATT^AAA TATCTCCAAA TG 

Seg ID NO: 385 Protein sequence 
Protein Accession #: NP__001125 

1 11 21 31 41 51 

I I I I I I 

MKWVESIFLI FLLNFTESRT LHRNEYGIAS ILDSYQCTAE ISLADLATIF FAQFVQEATY 60 

KEVSKMVKDA LTAIEKPTGD" EQSSGCLENQ LPAFLEELCH EKEILEKYGH SDCCSQSEEG 120 

RHNCFLAHKK PTPASIPLFQ VPEPVTSCBA YEEDRETFMN KFIYEIARRH PFLYAPTILL 180 

WAARYDKIIP SCCKAEMAVB CFQTKAATVT KELRESSLLN QHACAVMKNF GTRTFQAITV 240 

TKLSQKFTKV NPTEIQKLiVL DVAHVHEHCC RGDVLDCLQD GEKIMSYICS QQDTLSNKIT 300 

ECCKLTTLER GQCIIHAEND EKPEGLSPNL NRFLGDRDFN QFSSGEKNIF LASFVHEYSR 3 60 

RHPQIjAVSVI LRVAKGYQEL LEKCFQTENP LECQDKGEEE liQKYIQESQA LAKRSCGLFQ 420 

KLGEYYLQNA FLVAYTKKAP QLTSSELMAI TRKMAATAAT CCQLSEDKLL ACGEGAADII 480 

IGHLCIRHEM TPVNPGVGQC CTSSYANRRP CFSSLWDET YVPPAFSDDK FIFHKDLCQA 540 

QGVALQTMKQ EPLINLVKQK PQITEEQLEA VIADFSGLLE KCCQGQEQEV CFAEEGQKLI 600 
SKTRAALGV 

Seq ID NO: 386 DNA sequence 

Nucleic Acid Accession #j NM_002205.1 

Coding sequence: 1..3149 

1 11 21 31 41 51 

1 I . I • 1 I 1 

ATGGGGAGCC GGACGCCAGA GTCCCCTCTC CACGCCGTGC AGCTGCGCTG GGGCCCCCGG 60 

CGCCGACCCC CGCTSSTGCC GCTGCTGTTG CTGCTSSTGC CGCCGCCACC CAGGGTCGGG 120 

GGCTTCAACT TAGACGCGGA GGCCCCAGCA GTACTCTCGG GGCCCCCGGG CTCCTTCTTC 180 

GGATTCTCAG TGGAGTTTTA CCGGCCGGGA ACAGACGGGG TCAGTGTGCT GGTGGGAGCA 240 

CCCAAGGCTA ATACCAGCCA GCCAGGAGTG CTGCAGGGTG GTGCTGTCTA CCTCTGTCCT 300 

TGGGGTGCCA GCCCCACACA GTGCACCCCC ATTGAATTTG ACAGCAAAGG CTCTCGGCTC 360 

CTGGAGTCCT CACTGTCCAG CTCAGAGGGA GAGGAGCCTG TGGAGTACAA GTCCTTGCAG 420 

TGGTTCGGGG CAACAGTTCG AGCCCATGGC TCCTCCATCT TGGCATGCGC TCCACTGTAC 480 

AGCTGGCGCA CAGAGAAGGA GCCACTGAGC GACCCCGTGG GCACCTGCTA CCTCTCCACA 540 

GATAACTTCA CCCGAATTCT GGAGTATGCA CCCTGCCGCT CAGATTTCAG CTGGGCAGCA 600 

GGACAGGGTT ACTGCCAAGG AGGCTTCAGT GCCGAGTTCA CCAAGACTGG CCGTGTGGTT 660 

TTAGGTGGAC CAGGAAGCTA TTTCTGGCAA GGCCAGATCC TGTCTGCCAC TCAGGAGCAG 720 

ATTGCAGAAT CTTATTACCC CGAGTACCTG ATCAACCTGG TTCAGGGGCA GCTGCAGACT 780 

CGCCAGGCCA GTTCCATCTA TGATGACAGC TACCTAGGAT ACTCTGTGGC TGTTGGTGAA 840 

TTCAGTGGTG ATGACACAGA AGACTTTGTT GCTGGTGTGC CCAAAGGGAA CCTCACTTAC 900 

GGCTATGTCA CCATCCTTAA TGGCTCAGAC ATTCGATCCC TCTACAACTT CTCAGGGGAA 960 

CAGATGGCCT CCTACTTTGG CTATGCAGTG GCC6CCACAG ACGTCAATGG GGACGGGCTG 1020 

GATGACTTGC TGGTGGGGGC ACCCCTGCTC ATGGATCGGA CCCCTGACGG GCX3GCCTCAG 1080 

GAGGTGGGCA GGGTCTACGT CTACCTGCAG CACCCAGCCG GCATAGAGCC CACGCCCACC 1140 

CTTACCCTCA CTGGCCATGA TGAGTTTGGC CGATTTGGCA GCTCCTTGAC CCCCCTGGGG 1200 

GACCTGGACC AGGATGGCTA CAATGATGTG GCCATCGGGG CTCCCTTTGG TGGGGAGACC 12 60 

CAGCAGGGAG TAGTGTTTGT ATTTCCTGGG GGCCCAGGAG GGCTGGGCTC TAAGCCTTCC 1320 

CAGGTTCTGC AGCCCCTGTG GGCAGCCAGC CACACCCCAG ACTTCTTTGG CTCTGCCCTT 1380 

CGAGGAGGCC GAGACCTGGA TGGCAATGGA TATCCTGATC TGATTGTGGG GTCCTTTGGT 1440 

GTGGACAAGG CTGTGGTATA CAGGGGCCGC CCCATCGTGT CCGCTAGTGC CTCCCTCACC 1500 

ATCTTCCCCG CCATGTTCAA CCCAGAGGAG CGGAGCTGCA GCTTAGAGGG GAACCCTGTG 1560 

GCCTGCATCA ACCTTAGCTT CTGCCTCAAT GCTTCTGGAA AACACGTTGC TGACTCCATT 1620 

GGTTTCACAG TGGAACTTCA GCTGGACTGG CAGAAGCAGA AGGGAGGGGT ACGGCGGGCA 1680 

CTGTTCCTGG CCTCCAGGCA GGCAACCCTG ACCCAGACCC TGCTCATCCA GAATGGGGCT 1740 

CGAGAGGATT GCAGAGAGAT GAAGATCTAC CTCAGGT^CG AGTCAGAATT TCGAGACAAA 1800 

CTCTCGCCGA TTCACATCGC TCTCAACTTC TCCTTGGACC CCCAAGCCCC AGTGGACAGC 1860 

CACGGCCTCA GGCCAGCCCT ACATTATCAG AGCAAGAGCC GGATAGAGGA CAAGGCTCAG 192 0 

ATCTTGCTGG ACTGTGGAGA AGACAACATC TGTGTGCCTG ACCTGCAGCT GGAAGTGTTT 1980 

GGGGAGCAGA ACCATGTGTA CCTGGGTGAC AAGAATGCCC TGAACCTCAC TTTCCATGCC 2040 

CAGAATGTGG GTGAGGGTGG CGCCTATGAG GCTGAGCTTC GGGTCACCGC CCCTCCAGAG 2100 

GCTGAGTACT CAGGACTCGT CAGACACCCA GGGAACTTCT CCAGCCTGAG CTGTGACTAC 2160 

TTTGCCGTGA ACCAGAGCCG CCTGCTGGTG TGTGACCTGG GCAACCCCAT GAAGGCAGGA 2220 

GCCAGTCTGT GGGGTGGCCT TCGGTTTACA GTCCCTCATC TCCGGGACAC TAAGAAAACC 2280 

ATCCAGTTTG ACTTCCAGAT CCTCAGCAAG AATCTCAACA ACTCGCAAAG CGACGTGGTT 2340 

TCCTTTCGGC TCTCCGTGGA GGCTCAGGCC CAGGTCACCC TGAACGGTGT CTCCAAGCCT 2400 

GAGGCAGTGC TATTCCCAGT AAGCGACTGG CATCCCCGAG ACCAGCCTCA GAAGGAGGAG 2460 

GACCTGGGAC CTGCTGTCCA CCATGTCTAT GAGCTCATCA ACCAAGGCCC CAGCTCCATT 2520 

AGCCAGGGTG TGCTGGAACT CAGCTGTCCC CAGGCTCTGG AAGGTCAGCA GCTCCTATAT 2580 

GTGACCAGAG TTACGGGACT CAACTGCACC ACCAATCACC CCATTAACCC AAAGGGCCTG 2 640 

GAGTTGGATC CCX3AGGGTTC CCTGCACCAC CAGCAAAAAC GGGAAGCTCC AAGCCGCAGC 2700 

TCTGCTTCCT CGGGACCTCA GATCCTGAAA TGCCCGGAGG CTGAGTGTTT CAGGCTGCGC 2760 

TGTGAGCTCG GGCCCCTGCA CCAACAAGAG AGCCAAAGTC TGCAGTTGCA TTTCCX5AGTC 2820 

TGGGCCAAGA CTTTCTTGCA GCGGGAGCAC CAGCCATTTA GCCTGCAGTG TGAGGCTGTG 2880 

TACAAAGCCC TGAAGATGCC CTACCGAATC CTGCCTCGGC AGCTGCCCCA AAAAGAGCGT 2 940 

GAGGTGGGCA CAGCTGTGCA ATGGACCAAG GCAGAAGGCA GCTATGGCGT CCCACTGTGG 3000 

ATCATCATCC TAGCCATCCT GTTTGGCCTC CTGCTCCTAG GTCTACTCAT CTACATCCTC 3060 

TACAAGCTTG GATTCTTCAA ACGCTCCCTC CCATATGGCA CCGCCATGGA AAAAGCTCAG 3120 
CTCAAGCCTC CAGCCACCTC TGATGCCTGA 



329 



wo 02/086443 



PCT/US02/12476 



Seq ID NO: 387 Protein sequence 
Protein Accession NP_002196.1 



10 

15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



1 
I 

MGSRTPESPL 
GFSVEFYRPG 
LESSLSSSEG 
DNFTRILEYA 
lAESYYPEYL 
GYVTIIiNGSD 
EVGRVYVYLQ 
QQGWFVFPG 
VDKAWYRGR 
GFTVELQLDW 
LSPIHIALNF 
GEQNHVYLGD 
FAVNQSRLLV 
SFRLSVEAQA 
SQGVLELSCP 
SASSGPQILK 
YKALKMPYRI 
YKLGFFKRSL 



11 

I 

HAVQLRWGPR 
TDGVSVIjVGA 
BEPVEYKSLQ 
PCRSDFSWAA 

'INLVQGQLQT 

irslynfsge 

HPAGIEPTPT 
GPGGIjGSKPS 
PIVSASASLT 
QKQKGGVRRA 
SLDPQAPVDS 
KNALNLTFHA 
CDLGNPMKAG 
QVTLNGVSKP 
QALEGQQIiLY 
CPEAECFRLR 
LPRQLPQKBR 
PYGTAMEKAQ 



21 

1 

RRPPLLPLLL 
PKANTSQPGV 
WFGATVRAHG 
GQGYCQGGFS 
RQASSIYDDS 
QMASYFGYAV 
LTLTGHDEFG 
QVLQPLWAAS 
IFPAMFNPEE 
LFLASRQATL 
HGLRPALHYQ 
QNVGEGGAYE 
ASLWGGLRFT 
EAVLFPVSDW 
VTRVTGLNCT 
CEL6PLHQQE 
QVATAVQWTK 
LKPPATSDA 



31 

I 

LIiLPPPPRVG 
LQGGAVYLCP 
SSILACAPLY 
AEFTKTGRW 
YLGYSVAVGE 
AATDVNGDGL 
RFGSSLTPLG 
HTPDFFGSAL 
RSCSLEGNPV 
TQTLLIQNGA 
SKSRIEDKAQ 
AELRVTAPPE 
VPHLRDTKKT 
HPRDQPQKEE 
TNHPINPKGIj 
SQSIjQLHFRV 
AEGSYGVPLW 



41 

1 

GFNLDAEAPA 
WGASPTQCTP 
SWRTEKEPLS 
LGGP6SYFWQ 
FSGDDTEDFV 
DDLLVGAPLL 
DLDQDGYNDV 
RGGRDLDGNG 
ACINLSFCLN 
REDCREMKIY 
ILLDCGEDNI 
AEYSGLVRHP 
IQFDFQILSK 
DLGPAVHHVY 
ELDPEGSIiHH 
WAKTFLQREH 
IIILAILFGL 



51 

1 

VLSGPPGSFF 

lEFDSKGSRIi 

DPVGTCYLST 

GQIIiSATQEQ 

AGVPKGNLTY 

MDRTPDGRPQ 

AIGAPFGGET 

YPDIiIVGSFG 

ASGKHVTyDSI 

LRNESEFRDK 

CVPDLQLEVF . 

GNFSSLSCDY 

NLNNSQSDW 

ELINQGPSSI 

QQKREAPSRS 

QPFSLQCEAV 

LLLGLLIYIL 



Seq' ID NO: 388 DNA sequence 

Nucleic Acid Accession #: NM_002425 

Coding sequence: 26.,14S3 



AAAGAAGGTA 
AGTCTGCTCT 
TGCCCAGC7VA 
AAAGGACAGT 
GGTGACAGGG 
TCCTGACGTT 
TACATACAGG 
TGAGAAAGCT 
AGGAGAGGCT 
TGATGGCCCA 
TATTCACTTT 
CGTTGCTGCT 
TTTGATGTAC 
TGATGTGAAT 
GGTGCCCACA 
GTCCTTCGAT 
TTGGCGAAGA 
CTCTCTTCCA 
TTTTAAAGGA 
AGGCATCCAT 
CAAGGAAAAG 
TAGCCAGTCC 
GCCTAAGGTT 
ACAGTTTGAG 
GTTACATTGC 
ATTATTCATC 
GAAGAAGATG 
ACTTGCTTTT 
ATGTATTTTC 
CTT 



11 
I 

AGGGCAGTGA 
GCCTATCCTC 
TACCTAGA7VA 
AATCTCATTG 
AAGCTAGACA 
GGTCACTTCA 
ATTGTGAATT 
CTGAAAGTCT 
GATATAATGA 
GGACACAGTT 
GATGATGATG 
CATGAACTTG 
CCACTCTACA 
GGCATTCAGT 
AAATCTGTTC 
GCCATCAGCA 
TCCCACTGGA 
TCATATTTGG 
AATGAGTTCT 
ACCCTGGGTT 
AAGAAAACAT 
ATGGAGCAAG 
GATGCTGTAT 
TTTGACCCCA 
TAGGCGAGAT 
TAATGTATTA 
AGCCTTGCAG 
GAATTGCACT 
ATAGATGTGT 



21 
I 

GAATGATGCA 
TGAGTGGGGC 
AGTACTACAA 
TTAAAAAAAT 
CTGACACTCT 
GCTCCTTTCC 
ATACACCAGA 
GGGAAGAGGT 
TCTCTTTCGC 
TGGCTCATGC 
AAAAATGGAC 
GCCACTCCCT 
ACTCATTCAC 
CTCTCTACGG 
CTTCGGGATC 
CTCTGAGGGG 
ACCCTGAACC 
ATGCTGCATA 
GGGCCATCAG 
TTCCTCCAAC 
ACTTCTTTGC 
GCTTCCCTAG 
TACAGGCATT 
ATGCCAGGAT 
AGGGGGAAGA 
TGAGCCAAAA 
ATATCTGCAT 
GAACAGAATT 
TATTACTTCC 



31 

I 

TCTTGCATTC 
AGCAAAAGAG 
CCTCGAAAAG 
CCAAGGAATG 
GGAGGTGATG 
TGGCATGCCG 
TTTGCCAAGA 
GACTCCACTC 
AGTTAAAGAA 
CTACCCACCT 
AGAAGATGCA 
GGGGCTCTTT 
AGAGCTCGCC 
ACCTCCCCCT 
TGAGATGCCA 
AGAATATCTG 
TGAATTTCAT 
TGAAGTTAAC 
AG6AAATGAG 
CATAAGGAAA 
AGCGGACAAA 
ACTAATAGCT 
TGGATTTTTC 
6GTGACACAC 
CAGATATGGG 
TGGTTAATTT 
GTGTCATGAA 
AAGAAATACT 
TCAATAAAAA 



41 

I 

CTTGTGCTGT 
GAGGACTCCA 
GATGTGAAAC 
CAGAAGTTCC 
CGCAAGCCCA 
AAGTGGAGGA 
GATGCTGTT6 
ACATTCTCCA 
CATGGAGACT 
GGACCTGGGC 
TCAGGCACCA 
CACTCAGCCA 
CAGTTCCGCC 
GCCTCTACTG 
GCCAAGTGTG 
TTCTTTAAAG 
TTGATTTCTG 
AGCAGGGACA 
GTACAAGCAG 
ATTGATGCAG 
TACTGGAGAT 
GATGACTTTC 
TACTTCTTCA 
ATATTAAAGA 
TGTTTTTAAT 
TTCCTGCATG 
GAATGTTTCT 
CATGTGCAAT 
GTTTTATTTT 



51 

I 

TGTGTCTGCC 
ACAAGGATCT 
AGTTTAGAAG 
TTGGGTTGGA 
GGTGTGGAGT 
AAACCCACCT 
ATTCTGCCAT 
GGCTGTATGA 
TTTACTCTTT 
TTTATGGAGA 
ATTTATTCCT 
ACACTGAAGC 
TTTCGCAAGA 
AGGAACCCCT 
ATCCTGCTTT 
ACAGATATTT 
CATTTTGGCC 
CCGTTTTTAT 
GTTATCCAAG 
CTGTTTCTGA 
TTGATGAAAA 
CAGGAGTTGA 
GTGGATCATC 
GTAACAGCTG 
AAATCTAATA 
TTCTGTGACT 
GGAATTCTTC 
AGGTGAGAGA 
GGGCCTGTTC 



Seq ID NO: 389 Protein sequence 
Protein Accession NP_002416 



MHIiAFIiVLLC 
KIQGMQKFLG 
PDLPRDAVDS 
HAYPPGPGLY 
FTELAQFRLS 
RGEYLFFKDR 
IRGNEVQAGY 
PRLIADDFPG 



11 
I 

LPVCSAYPLS 
LEVTGKLDTD 
AIEKALKVWE 
GDIHFDDDEK 
QDDVNGIQSL 
YFWRRSHWNP 
PRGIHTLGFP 
VEPKVDAVLQ 



21 
1 

GAAKEEDSNK 
TLEVMRKPRC 
EVTPLTFSRL 
WTEDASGTNL 
YGPPPASTEE 
EPEFHLISAF 
PTIRKIDAAV 
AFGFFYFFSG 



31 
1 

DLAQQYLEKY 
GVPDVGHFSS 
YEGEADIMIS 
FLVAAHELGH 
PLVPTKSVPS 
WPSLPSYLDA 
SDKEKKKTYF 
SSQFEFDPNA 



41 
I 

YNLEKDVKQF 
FPGMPKWRKT 
FAVKEHGDFY 
SLGLFHSANT 
GSEMPAKCDP 
AYEVNSRDTV 
FAADKYWRFD 
RMVTHILKSN 



51 
I 

RRKDSNLIVK 

HLTYRIVNYT 

SFDGPGHSLA ' 

EALMYPLYNS 

ALSFDAISTL 

FIFKGNEFWA 

ENSQSMEQGF 

SWLHC 



Seq ID NO: 3 90 DNA sequence 

Nucleic Acid Accession #: NM__002421.2 

Coding sequence: 1..1409 



1 
I 

ATGCACAGCT 
CCAGCGACTC 
TACTACAACC 
GTTGAAAAAT 
GCTGAAACCC 



11 
1 

TTCCTCCACT 
TAGAAACACA 
TGAAGAATGA 
TGAAGCAJ\AT 
TGAAGGTGAT 



21 
I 

GCTGCTGCTG 
AGAGCAAGAT 
TGGGAGGCAA 
GCAGGAATTC 
GAAGCAGCCC 



31 
1 

CTGTTCTGGG 
GTGGACTTAG 
GTTGAAAAGC 
TTTGGGCTGA 
AGATGTGGAG 



41 
1 

GTGTGGTGTC 
TCCAGAAATA 
GGAGAAATAG 
AAGTGACTGG 
TGCCTGATGT 



51 

1 

ACACAGCTTC 
CCTGGAAAAA 
TGGCCCAGTG 
GAAACCA6AT 
GGCTCAGTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 



330 



10 



15 



20 



25 



30 



35 



WO 02/086443 

GTCCTCACTG AGGGGAACCC TCGCTGGGAG CAAACACATC 
TACACGCCAG ATTTGCCAAG AGCAGATGTG GACCATGCCA 
TGGAGTAAT6 TCACACCTCT GACATTCACC AAGGTCTCTG 
ATATCTTTTG TCAGGGGAGA TCATCGGGAC AACTCTCCTT 
CTTGCTCATG CTTTTCAACC AGGCCCAGGT ATTGGAGGGG 
GAAAGGTGGA CCAACAATTT CAGAGAGTAC AACTTACATC 
GGCCATTCTC TTGGACTCTC CCATTCTACT GATATCGGGG 
ACCTTCAGTG GTGATGTTCA GCTAGCTCAG GATGACATTG 
GGACGTTCCC AAAATCCTGT CCAGCCCATC GGCCCACAAA 
AAGCTAACCT TTGATGCTAT AACTACGATT CGGGGAGAAG 
TTCTACATGC GCACAAATCC CTTCTACCCG GAAGTTGAGC 
TGGCCACAAC TGCCAAATGG GCTTGAAGCT GCTTACGAAT 
CGGTTTTTCA AAGGGAATAA GTACTGGGCT GTTCAGGGAC 
CCCAAGGACA TCTACAGCTC CTTTGGCTTC CCTAGAACTG 
CTTTCTGAGG AAAACACTGG AAAAACCTAC TTCTTTGTTG 
GATGAATATA AACGATCTAT GGATCCAGGT TATCCCAAAA 
GGAATTGGCC ACAAAGTTGA TGCAGTTTTC ATGAAAGATG 
GGAACAAGAC AATACAAATT TGATCCTAAA ACGAAGAGAA 
AATAGCTGGT TCAACTGCAG GAAAAATTAG 

Seg ID MO: 391 Protein sequence 
Protein Accession #: NP_002412.1 



PCT/US02/12476 



MHSFPPLLLL 
VEKIjKQMQEF 
YTPDLPRADV 
LAHAFQPGPG 
TFSGDVQLAQ 
FYMRTNPFYP 
PKDIYSSFGF 
GIGHKVDAVF 



11 

1 

LFWGWSHSF 
FGLKVTGKPD 
DHAIEKAFQL 
IGGDAHFDED 
DDIDGIQAIY 
EVELNFISVF 
PRTVKHIDAA 
MKDGFFYFFH 



21 
1 

PATLETQEQD 
AETLKVMKQP 
WSNVTPLTFT 
ERWTNNFREY 
GRSQNPVQPI 
WPQIiPNGLEA 
IiSEENTGKTY 
GTRQYKFDPK 



31 

1 

VDLVQKYLEK 
RCGVPDVAQF 
KVSEGQADIM 
NLHRVAAHEL 
GPQTPKACDS 
AYEF7VDRDEV 
FFVANKYWRY 
TKRILTLQKA 



TGACCTACAG 


GAT TGAAAAT 


360 


TTGAG/U^GC 


CTTCCAACTC 


420 


AGGGTCAAGC 


AGACAT CAT6 


480 


TTGATGGACC 


TGGAGGAAAT 


540 


ATGCTCATTT 


T G ATG AAG AT 


600 


GTGTTGCGGC 


T CATGAACT t. 


660 


CTTTGATGTA 


CCCTAGCTAC 


720 


ATGGCATCCA 


AGCCATATAT 


780 


CCCCAAAAGC 


ATGTGACAGT 


840 


TGATGTTCTT 


rp 7\ TV TV T\ 7\ A 

TAAAGAUAUA 


900 


TCAATTTCAT 


TTCTGTTTTC 


960 


TTGCCGACAG 


AGATGAAGTC 


1020 


AGAATGTGCT 


ACACGGATAC 


1080 


TGAAGCATAT 


CGATGCTGCT 


1140 


CTAACAAATA 


CTGGAGGTAT 


1200 


TGATAGCACA 


TGACTTTCCT 


1260 


GATTTTTCTA 


TTTCTTTCAT 


1320 


TTTTGACTCT 


CCAGAAAGCT 


1380 


41 

1 


51 
1 




1 


1 


60 


vLililiKsNctiVici 


nTWT.TVD TTTW 
\l X rtLi 1 X K X CiX* 


120 


ISFVRGDHRD 


NSPFDGPGGN 


180 


GHSLGLSHST 


DIGALMYPSY 


240 


KLTFDAITTI 


RGEVMFFKDR 


300 


RFFKGNKYWA 


VQGQNVLHGY 


360 


DEYKRSMDPG 


YPKMIAHDFP 


420 


NSWFNCRKN 







Seq ID NO: 392 DNA sequence 

Nucleic Acid Accession #: NM_002421.2 

Coding sequence! 1..1409 



40 
45 
50 
55 
60 
65 



ATGCACAGCT 
CCAGCGACTC 
TACTACAACC 
GTTG/JiAAAAT 
GCTGAAACCC 
GTCCTCACTG 
TACACGCCAG 
TGGAGTAATG 
ATATCTTTTG 
CTTGCTCATG 
GAAAGGTGGA 
GGCCATTCTC 
ACCTTCAGTG 
GGACGTTCCC 
AAGCT7UVCCT 
TTCTACATGC 
TGGCCACAAC 
CGGTTTTTCA 
CCCAAGGACA 
CTTTCTGAGG 
GATGAATATA 
GGAATTGGCC 
GGAACAAGAC 
AATAGCTGGT 



11 

1 

TTCCTCCACT 
TAGAAACACA 
TGAAGAATGA 
TGAAGCAAAT 
TGAAGGTGAT 
AGGGGAACCC 
ATTTGCCAAG 
TCACACCTCT 
TCAGGGGAGA 
CTTTTCAACC 
CCAACAATTT 
TTGGACTCTC 
GTGATGTTCA 
AAAATCCTGT 
TTGATGCTAT 
GCACAAATCC 
TGCCAAATGG 
AAGGGAATAA 
TCTACAGCTC 
AAAACACTGG 
AACGATCTAT 
ACAAAGTTGA 
AATACAAATT 
TCAACTGCAG 



21 

1 

GCTGCTGCTG 
AGAGCAAGAT 
TGGGAGGCAA 
GCAGGAATTC 
GAAGCAGCCC 
TCGCTGGGAG 
AGCAGATGTG 
GACATTCACC 
TCATCGGGAC 
AGGCCCAGGT 
CAGAGAGTAC 
CCATTCTACT 
GCTAGCTCAG 
CCAGCCCATC 
AACTACGATT 
CTTCTACCCG 
GCTTGAAGCT 
GTACTGGGCT 
CTTTGGCTTC 
AAAAACCTAC 
GGATCCAGGT 
TGCAGTTTTC 
TGATCCTAAA 
GAAAAATTAG 



31 

1 

CTGTTCTGGG 
GTGGACTTAG 
GTTGAAAAGC 
TTTGGGCTGA 
AGATGTGGAG 
CAAACACATC 
GACCATGCCA 
AAGGTCTCTG 
AACTCTCCTT 
ATTGGAGGGG 
AACTTACATC 
GATATCGGGG 
GATGACATTG 
GGCCCACAAA 
CGGGGAGAAG 
G/iAGTTGAGC 
GCTTACGAAT 
GTTCAGGGAC 
CCTAGAACTG 
TTCTTTGTTG 
TATCCCAAAA 
ATGAAAGATG 
ACGAAGAGAA 



41 

GTGTGGTGTC 
TCCAGAAATA 
GGAGAT^TAG 
AAGTGACTGG 
TGCCTGATGT 
TGACCTACAG 
TTGAGAAAGC 
AGGGTCAAGC 
TTGATGGACC 
ATGCTCATTT 
GTGTTGCGGC 
CTTTGATGTA 
ATGGCATCCA 
CCCCAAAAGC 
TGATGTTCTT 
TCAATTTCAT 
TTGCCGACAG 
AGAATGTGCT 
TGAAGCATAT 
CTAACAAATA 
TGATAGCACA 
GATTTTTCTA 
TTTTGACTCT 



51 
I 

ACACAGCTTC 
CCTGGAAAAA 
TGGCCCAGTG 
GAAACCAGAT 
GGCTCAGTTT 
GATTGAAAAT 
CTTCCAACTC 
AGACATCATG 
TGGAGGAAAT 
TGATGAAGAT 
TCATGCCCTC 
CCCTAGCTAC 
AGCCATATAT 
ATGTGACAGT 
TAAAGACAGA 
TTCTGTTTTC 
AGATGAAGTC 
ACACGGATAC 
CGATGCTGCT 
CTGGAGGTAT 
TGACTTTCCT 
TTTCTTTCAT 
CCAGAAAGCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NO: 393 Protein sequence 
Protein Accession #: NP 002412.1 



70 



75 



80 



MHSFPPLLIiL 
VEKLKQMQEF 
YTPDLPRADV 
LAHAFQPGPG 
TFSGDVQLAQ 
FYMRTNPFYP 
PKDIYSSFGF 
GIGHKVDAVF 



11 

I 

LFWGWSHSF 
FGLKVTGKPD 
DHAIEKAFQL 
IGGDAHFDED 
DDIDGIQAIY 
EVELNFISVF 
PRTVKHIDAA 
MKDGFFYFFH 



21 
I 

PATLETQEQD 
AETLKVMKQP 
WSNVTPLTFT 
ERWTNNFREY 
GRSQNPVQPI 
WPQLPNGLEA 
LSEENTGKTY 
GTRQYKFDPK 



31 



1 

VDLVQKYLEK 
RCGVPDVAQF 
KVSEGQADIM 
NLHRVAAHAL 
GPQTPKACDS 
AYEFADRDEV 
FFVANKYWRY 
TKRILTLQKA 



41 

1 

YYNLKNDGRQ 
VLTEGNPRWE 
ISFVRGDHRD 
GHSLGLSHST 
KLTFDAITTI 
RFFKGNKYWA 
DEYKRSMDPG 
NSWFNCRKN 



51 
I 

VEKRRNSGPV 
QTHLTYRIEN 
NSPFDGPGGN 
DIGALMYPSY 
RGEVMFFKDR 
VQGQNVLHGY 
YPKMIAHDFP 



60 
120 
180 
240 
300 
360 
420 



85 



Seq ID NO: 3 94 DNA sec[uence 

Nucleic Acid Accession NM_014331,2 

Coding sequence: 1..1506 



11 



21 



31 



331 



wo 02/086443 

ATGGTCAGAA AGCCTGTTGT GTCCACCATC TCCAAAGGAG GTTACCTGCA GGGAAATGTT 60 

AACGGGAGGC TGCCTTCCCT GGGCAACAAG GAGCCACCTG GGCAGGAGAA AGTGCAGCTG 120 

AAGAGGAAAG TCACTTTACT GAGGGGAGTC TCCATTATCA TTGGCACCAT CATTGGAGCA 180 

GGAATCTTCA TCTCTCCTAA GGGCGTGCTC CAGAACACGG GCAGCGTGGG CATGTCTCTG 240 

ACCATCTGGA CGGTGTGTGG GGTCCTGTCA CTATTTGGAG GTTTGTCTTA TGCTGAATTG 300 

GG AACAACTA TAAAGAAATC TGGAGGTCAT TACACATATA TTTTGGAAGT CTTTGGTCCA 360 

TTACCAGCTT TTGTACGAGT CTGGGTGGAA CTCCTCATAA TACGCCCTGC AGCTACTGCT 420 

GTGATATCCC TGGCATTTGG ACGCTACATT CTGGAACCAT TTTTTATTCA ATGTGAAATC 480 

CCTGAACTTG CGATCAAGCT CATTACAGCT GTGGGCATAA CTGTAGTGAT GGTCCTAAAT 540 

AGCATGAGTG TCAGCTGGAG CGCCCGGATC CAGATTTTCT TAACCTTTTG CAAGCTCACA 600 

GCAATTCTGA TAATTATAGT CCCTGGAGTT ATGCAGCTAA TTAAAGGTCA AACGCAGAAC 660 

TTTAAAGACG CGTTTTCAGG AAGAGATTCA AGTATTACGC GGTTGCCACT GGCTTTTTAT 720 

TATGGAATGT ATGCATATGC TGGCTGGTTT TACCTCAACT TTGTTACTGA AGAAGTAGAA 780 

AACCCTGAAA AAACCATTCC CCTTGCAATA TGTATATCCA TGGCCATTGT CACCATTGGC 840 

TATGTGCTGA CAAATGTGGC CTACTTTACG ACCATTAATG CTGAGGAGCT GCTGCTTTCA 900 

AATGCAGTGG CAGTGACCTT TTCTGAGCGG CTACTGGGAA ATTTCTCATT AGCAGTTCCG 960 

ATCTTTGTTG CCCTCTCCTG CTTTGGCTCC ATGAACGGTG GTGTGTTTGC TGTCTCCAGG 1020 

TTATTCTATG TTGCGTCTCG AGAGGGTCAC CTTCCAGAAA TCCTCTCCAT GATTCATGTC 1080 

CGCAAGCACA CTCCTCTACC AGCTGTTATT GTTTTGCACC CTTTGACAAT GATAATGCTC 1140 

TTCTCTGGAG ACCTCGACAG TCTTTTGAAT TTCCTCAGTT TTGCCAGGTG GCTTTTTATT 1200 

GGGCTGGCAG TTGCTGGGCT GATTTATCTT CGATACAAAT GCCCAGATAT GCATCGTCCT 1260 

TTCAAGGTGC CACTGTTCAT CCCAGCTTTG TTTTCCTTCA CATGCCTCTT CATGGTTGCC 1320 

CTTTCCCTCT ATTCGGACCC ATTTAGTACA GGGATTGGCT TCGTCATCAC TCTGACTGGA 1380 

GTCCCTGCGT ATTATCTCTT TATTATATGG GACAAGAAAC CCAGGTGGTT TAGAATAATG 1440 

TCAGAGAAAA TAACCAGAAC ATTACAAATA ATACTGGAAG TTGTACCAGA AGAAGATAAG 1500 

TTATG/WVCTA ATGGACTTGA GATCTTGGCA ATCTGCCCAA GGGGA6ACAC AAAATAGGGA 1560 

TTTTTACTTC ATTTTCTGAA AGTCTAGAGA ATTACAACTT TGGTGATAAA CAAAAGGAGT 1620 

CAGTTATTTT TATTCATATA TTTTAGCATA TTCGAACTAA TTTCTAAGAA ATTTAGTTAT 1680 

AACTCTATGT AGTTATAGAA AGTGAATATG CAGTTATTCT ATGAGTCGCA CAATTCTTGA 1740 

GTCTCTGATA CCTACCTATT GGGGTTAGGA GAAAAGACTA GACAATTACT ATGTGGTCAT 1800 

TCTCTACAAC ATATGTTAGC ACGGCAAAGA ACCTTCAAAT TGT^GACTGA GATTTTTCTG 1860 

TATATATGGG TTTTGTAAAG ATGGTTT']:AC ACACTACAGA TGTCTATACT GTGAAAAGTG 1920 

TTTTCAATTC TGAAAAAAAG CATACATCAT GATTATGGCA AAGAGGAGAG AAAGAAATTT 1980 

ATTTTACATT GACATTGCAT TGCTTCCCCT TAGATACCAA TTTAGATAAC AAACACTCAT 2 040 

GCTTTAATGG ATTATACCCA GAGCACTTTG AACAAAGGTC AGTGGGGATT GTTGAATACA 2100 

TTAAAGAAGA GTTTCTAGGG GCTACTGTTT ATGAGACACA TCCAGGAGTT ATGTTTAAGT 2160 

AAAAATCCTT GAGAATTTAT TATGTCAGAT GTTTTTTCAT TCATTATCAG GAAGTTTTAG 2220 

TTATCTGTCA tTTTTTTTTT TCACATCAGT TTGATCAGGA AAGTGTATAA CACATCTTAG 2280 

AGCAAGAGTT AGTTTGGTAT TAAATCCTCA TTAGAACAAC CACCTGTTTC ACTAATAACT 2340 

TACCCCTGAT GAGTCTATCT AAACATATGC ATTTTAAGCC TTCAAATTAC ATTATCAACA 2400 

TGAGAGAAAT AACCAACAAA GAAGATGTTC AAAATAATAG TCCCATATCT GTAATCATAT 2460 

CTACATGCAA TGTTAGTAAT TCTGAAGTTT TTTAAATTTA TGGCTATTTT TACACGATGA 2520 

TGAATTTTGA CAGTTTGTGC ATTTTCTTTA TACATTTTAT ATTCTTCTGT TAAAATATCT 2580 

CTTCAGATGA AACTGTCCAG ATTAATTAG6 AAAAGGCATA TATTAACATA AAAATTGCAA 2640 

AAGAAATGTC GCTGTAAATA AGATTTACAA CTGATGTTTC TAGAAAATTT CCACTTCTAT 2700 

ATCTAGGCTT TGTCAGTAAT TTCCACACCT TAATTATCAT TCAACTTGCA AAAGAGACAA 2760 

CTGATAAGAA GAAAATTGi^A ATGAGAATCT GTGGATAAGT GTTTGTGTTC AGAAGATGTT 2820 

GTTTTGCCAG TATTAGAAAA TACTGTGAGC CGGGCATGGT GGCTTACATC TGTAATCCCA 2880 

GCACTTTGGG AGGCTGAGGG GGTGGATCAC CTGAGGTCGG GAGTTCTAGA CCAGCCTGAC 2 940 

CAACATGGAG AAACCCCATC TCTACTAAAA ATACAAAATT AGCTGGGCAT GGTGGCACAT 3000 

GCTGGTAATC TCAGCTATTG AGGAGGCT6A GGCAGGAGAA TTGCTTGAAC CCGGGAGGCG 3060 

GAGGTTGCAG TGAGCCAAGA TTGCACCACT GTACTCCAGC CTGGGTGACA AAGTCAGACT 3120 
CCATCTCCAA AAAAAAAAAA AAAA 

Seq ID NO: 3 95 Protein sequence 
Protein Accession #: NP_055146.1 

1 11 21 31 41 51 

I I I I I I 

MVRKPWSTI SKGGYLQGNV NGRLPSLGNK EPPGQEKVQL KRKVTLIiRGV SIIIGTIIGA 60 

GIFISPKGVL QNTGSVGMSL TIWTVCGVLS LFGALSYAEL GTTIKKSGGH YTYILEVFGP 120 

LPAFVRVWVE LLIIRPAATA VISIiAFGRYI LEPFFIQCEI PELAIKLITA VGITWMVJbN 180 

SMSVSWSARI QIFLTFCKLT AILIIIVPGV MQLIKGQTQN FKDAPSGRDS SITRIiPLAFY 240 

YGMYAYAGWF YLNFVTEEVE NPEKTIPLAl CISMAITIGV YVLTNVAYFT TINAEELLLS 300 

NAVAVTFSER LLGNFSLAVP IFVALSCFGS MNGGVFAVSR LFYVASREGH LPEILSMIHV 360 

RKHTPLPAVI VLHPLTMXML FSGDLDSLLN FLSFARWLFI GLAVAGLIYL RYKCPDMHRP 420 

FKVPLPIPAL FSFTCLFMVA LSLYSDPPST GIGFVITLTG VPAYYLFIIW DKKPRWFRIM 480 
SEKITRTLQI ILEWPEEDK L 



Seq ID NO: 396 DNA sequence 

Nucleic Acid Accession #: NM_006528 

Coding sequence: 57. » 764 

1 11 21 31 41 51 

1 I I I I 1 

GCCGCCAGCG GCTTTCTCGG ACGCCTTGCC CAGCGGGCCG CCCGACCCCC TGCACCATGG 60 

ACCCCGCTCG CCCCCTGGGG CTGTCGATTC TGCTGCTTTT CCTGACGGAG GCTGCACTGG 120 

GCGATGCTGC TCAGGAGCCA ACAGGAAATA ACGCGGAGAT CTGTCTCCTG CCCCTAGACT 180 

ACGGACCCTG CCGGGCCCTA CTTCTCCGTT ACTACTACGA CAGGTACACG CAGAGCTGCC 240 

GCCAGTTCCT GTACGGGGGC TGCGAGGGCA ACGCCAACAA TTTCTACACC TGGGAGGCTT 300 

GCGACGATGC TTGCTGGAGG ATAGAAAAAG TTCCCAAAGT TTGCCGGCTG CAAGTGAGTG 360 

TGGACGACCA GTGTGPiGGGG TCCACAGAAA AGTATTTCTT TAATCTAAGT TCCATGACAT 420 

GTGAAAAATT CTTTTCCGGT GGGTGTCACC GGAACCGGAT TGAGAACAGG TTTCCAGATG 480 

AAGCTACTTG TATGGGCTTC TGCGCACCAA AGAAAATTCC ATCATTTTGC TACAGTCCAA 540 

AAGATGAGGG ACTGTGCTCT GCCAATGTGA CTCGCTATTA TTTTAATCCA AGATACAGAA 600 

CCTGTGATGC TTTCACCTAT ACTGGCTGTG GAGGGAATGA CAATAACTTT GTTAGCAGGG 660 
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AGGATTGCAA ACGTGCATGT GCAAAAGCTT TGAAAAAGAA AAAGAAGATG CCAAAGCTTC 
GCTTTGCCAG TAGAATCCGG AAAATTCGGA AGAAGCT^TT TTAAACATTC TTAATATGTC 
ATCTTGTTTG TCTTTATGGC TTATTTGCCT TTATGGTTGT ATCTGAAGAA TAATATGACA 
GCATGAGGAA ACAAATCATT GGTGATTTAT TCACCAGTTT TTATTAATAC AAGTCACTTT 
TTCAAAAATT TGGATTTTTT TATATATAAC TAGCTGCTAT TCAAATGTGA GTCTACCATT 
TTTAATTTAT GGTTCAACTG TTTGTGAGAC GAATTCTTGC AATGCATAAG ATATAAAAGC 
AAATATGACT CACTCATTTC TTGGGGTCGT ATTCCTGATT TCAGAAGAGG ATCATAACTG 
AAACAACATA AGACAATATA ATCATGTGCT TTTAACATAT TTGAGAATAA AAAGGACTAG 
CC 

Seq ID NO: 3 97 Protein sequence 
Protein Accession #: NP_006519 

1 11 21 31 41 51 

I I I I I I 

MDPARPLGLS ILLLFLTEAA LGDAAQEPTG NNAEICLLPL DYGPCRALLL RYYYDRYTQS 
CRQFLYGGCE GNANNFYTWE ACDDACWRIE KVPKVCRLQV SVDDQCEGST EKYFFNLSSM 
TCEKFFSGGC HRNRIENRFP DEATCMGFCA PKKIPSFCYS PKDEGLCSAN VTRYYFNPRY 
RTCDAPTYTG CGGNDNNFVS REDCKRACAK ALKKKKKMPK LRFASRIRKI RKKQF 

Seq ID NO: 3 98 DNA sequence 

Nucleic Acid Accession #: NM_001508.1 

Coding sequence: 1..1361 

1 11 21 31 41 51 

I I I I ] I 

ATGGCTTCAC CCAGCCTCCC GGGCAGTGAC TGCTCCCAAA TCATTGATCA CAGTCATGTC 
CCCGAGTTTG AGGTGGCCAC CTGGATCAAA ATCACCCTTA TTCTGGTGTA CCTGATCATC 
TTCGTGATGG GCCTTCTGGG GAACAGCGTC ACCATTCGGG TCACCCAGGT GCTGCAGAAG 
AAAGGATACT TGCAG7\AGGA GGTGACAGAC CACATGGTGA GTTTGGCTTG CTCGGACATC 
TTGGTGTTCC TCATCGGCAT GCCCATGGAG TTCTACAGCA TCATCTGGAA TCCCCTGACC 
ACGTCCAGCT ACACCCTGTC CTGCAAGCTG CACACTTTCC TCTTCGAGGC CTGCAGCTAC 
GCTACGCTGC TGCACGTGCT GACGCTCAGC TTTGAGCGCT ACATCGCCAT CTGTCACCCC 
TTCAGGTACA AGGCTGTGTC GGGACCTTGC CAGGTGAAGC TGCTGATTGG CTTCGTCTGG 
GTCACCTCCG CCCTGGTGGC ACTGCCCTTG CTGTTTGCCA TGGGTACTGA GTACCCCCTG 
GTGAACGTGC CCAGCCAGCG GGGTCTCACT TGCAACCGCT CCAGCACCCG CCACCACGAG 
CAGCCCGAGA CCTCCAATAT GTCCATCTGT ACCAACCTCT CCAGCCGCTG GACCGTGTTC 
CAGTCCAGCA TCTTCGGCGC CTTCGTGGTC TACCTCGTGG TCCTGCTCTC CGTAGCCTTC 
ATGTGCTGGA ACATGATGCA GGTGCTCATG AAAAGCCAGA AGGGCTCGCT GGCCGGGGGC 
ACGCGGCCTC CGCAGCTGAG GAAGTCCGAG AGCGAAGAGA GCAGGACCGC CAGGAGGCAG 
ACCATCATCT TCCTGAGGCT GATTGTTGTG ACATTGGCCG TATGCTGGAT GCCCAACCAG 
ATTCGGAGGA TCATGGCTGC GGCCAAACCC AAGCACGACT GGACGAGGTC CTACTTCCGG 
GCGTACATGA TCCTCCTCCC CTTCTCGGAG ACGTTTTTCT ACCTCAGCTC GGTCATCARC 
CCGCTCCTGT ACACGGTGTC CTCGCAGCAG TTTCGGCGGG TGTTCGTGCA GGTGCTGTGC 
TGCCGCCTGT CGCTGCAGCA CGCCAACCAC GAGAAGCGCC TGCGCGTACA TGCGCACTCC 
ACCACCGACA GCGCCCGCTT TGTGCAGCGC CCGTTGCTCT TCGCGTCCCG GCGCCAGTCC 
TCTGCAAGGA GAACTGAGAA GATTTTCTTA AGCACTTTTC AGAGCGAGGC CGAGCCCCAG 
TCTAAGTCCC AGTCATTGAG TCTCGAGTCA CTAGAGCCCA ACTCAGGCGC GAAACCAGCC 
AATTCTGCTG CAGAGAATGG TTTTCAGGAG CATGAAGTTT GA 

Seq ID NO; 3 99 Protein sequence 
Protein Accession #: NP_0 014 99.1 

1 11 21 31 41 51 

I I I I I I 

MASPSLPGSD CSQIIDHSHV PEFBVATWIK ITLIIiVYLII FVMGLLGNSV TIRVTQVLQK 
KGYLQKEVTD HMVSLACSDI LVFLIGMPME FYSIIWNPLT TSSYTLSCKL HTFIiFEACSY 
ATIiLHVLTLS FERYIAICHP FRYKAVSGPC QVKLLIGFVW VTSAbVALPL LFAMGTEYPL 
VNVPSHRGLT CNRSSTRHHE QPETSNMSIC TNLSSRWTVF QSSIFGAFW YLWLLSVAF 
MCWNMMQVLM KSQKGSLAGG TRPPQLRKSE SEESRTARRQ TIIFDRLIW TLAVCSffMPNQ 
IRRIMAAAKP KHDWTRSYFR AYMIIiLPFSE TFFYLSSVIN PLIiYTVSSQQ FRRVFVQVLC 
CRIiSLQHRNH EKRMIVHAHS TTDSARFVQR PLLFASRRQS SARRTEKIFI* STFQSEAEPQ 
SKSQSIjSLES LEPNSGAKPA NSAAENGFQE HEV 

Seq ID NO: 400 DNA sequence 

Nucleic Acid Accession #: NM_006475.1 

Coding sequence : 2 8 . . 2 S 3 8 

1 11 21 31 41 51 

11111-1 
AACAGAACTG CAACGGAGAG ACTCAAGATG ATTCCCTTTT TACCCATGTT TTCTCTACTA 
TTGCTGCTTA TTGTTAACCC TATAAACGCC AACAATCATT ATGACAAGAT CTTGGCTCAT 
AGTCGTATCA GGGGTCGGGA CCAAGGCCCA T^TGTCTGTG CCCTTCAACA 6ATTTTGGGC 
ACCAAAT^GA AATACTTCAG CACTTGTAAG AACTGGTATA AAAAGTCCAT CTGTGGACAG 
AAAACGACTG TTTTATATGA ATGTTGCCCT GGTTATATGA GAATGGAAGG AATGAAAGGC 
TGCCCAGC7VG TTTTGCCCAT TGACCATGTT TATGGCACTC TGGGCATCGT GGGAGCCACC 
ACAACGCAGC GCTATTCTGA CGCCTCAAAA CTGAGGGAGG AGATCGAGGG AAAGGGATCC 
TTCACTTACT TTGCACCGAG TAATGAGGCT TGGGACAACT TGGATTCTGA TATCCGTAGA 
GGTTTGGAGA GCAACGTGAA TGTTGAATTA CTGAATGCTT TACATAGTCA CATGATTAAT 
AAGAGAATGT TGACCJ\AGGA CTTAAAAAAT GGCATGATTA TTCCTTCAAT GTATAACAAT 
TTGGGGCTTT TCATTAACCA TTATCCTAAT GGGGTTGTCA CTGTTAATTG TGCTCGAATC 
ATCCATGGGA ACCAGATTGC AACAAATGGT GTTGTCCATG TCATTGACCG TGTGCTTACA 
CAAATTGGTA CCTCAATTCA AGACTTCATT GAAGCAGAAG ATGACCTTTC ATCTTTTAGA 
GCAGCTGCCA TCACATCGGA CATATTGGAG GCCCTTGGAA GAGACGGTCA CTTCACACTC 
TTTGCTCCCA CCAATGAGGC TTTTGAGAAA CTTCCACGAG GTGTCCTAGA AAGGTTCATG 
GGAGACAAAG TGGCTTCCGA AGCTCTTATG AAGTACCACA TCTTAAATAC TCTCCAGTGT 
TCTGAGTCTA TTATGGGAGG AGCAGTCTTT 6AGACX3CTGG AAGGAAATAC AATTGAGATA 
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GGATGTGACG GTGACAGTAT AACAGTAAAT GGAATCAAAA TGGTGAACAA AAAGGATATT 1080 

GTGACAAATA ATGGTGTGAT CCATTTGATT GATCAG6TCC TAATTCCTGA TTCTGCCAAA 1140 

CAAGTTATTG AGCTGGCTGG ATU^ACAGCAA ACCACCTTCA CGGATCTTGT GGCCCAATTA 1200 

GGCTTGGCAT CTGCTCTGAG GCCAGATGGA GAATACACTT TGCTGGCACC TGTGAATAAT 1260 

5 GCATTTTCTG ATGATACTCT CAGCATGGTT CAGCGCCTCC TTAAATTAAT TCTGCAGAAT 132 0 

CACATATTGA AAGTAAAAGT TGGCCTTAAT GAGCTTTACA ACGGGCAAAT ACTGGAAACC 1380 

ATCGGAGGCA AACAGCTCAG AGTCTTCGTA TATCGTACAG CTGTCTGCAT TGAAAATTCA 1440 

TGCATGGAGA AAGGGAGTAA GCAAGGGAGA AACGGTGCGA TTCACATATT CCGCGAGATC 1500 

ATCAAGCCAG CAGAGAAATC CCTCCATGAA AAGTTAAAAC AAGATAAGCG CTTTAGCACC 15 60 

10 TTCCTCAGCC TACTTGAAGC TGCAGACTTG AAAGAGCTCC TGACACAACC TGGAGACTGG 1620 

ACATTATTTG TGCCAACCAA TGATGCTTTT AAGGGAATGA CTAGTGAAGA AAAAGAAATT 1680 

CTGATACGGG ACAAAAATGC TCTTCAAAAC ATCATTCTTT ATCACCTGAC ACCAGGAGTT 1740 

TTCATTGGAA AAGGATTTGA ACCTGGTGTT ACTAACATTT TAAAGACCAC ACAAGGAAGC 1800 

ATW^ATCTTTC TGAAAGAAGT AAATGATACA CTTCTGGTGA ATGAATTGAA ATCAAAAGAA 1860 

15 TCTGACATCA TGACAACAAA TGGTGTAATT CATGTTGTAG ATAAACTCCT CTATCCAGCA 1920 

GACACACCTG TTGGAAATGA TCAACTGCTG GAAATACTTA ATAAATTAAT CAAATACATC 1980 

CAAATTAAGT TTGTTCGTGG TAGCACCTTC AAAGAAATCC CCGTGACTGT CTATACAACT 2 040 

AAAATTATAA CCAAAGTTGT GGAACCAAAA ATTAAAGTGA TTGAAGGCAG TCTTCAGCCT 2100 

ATTATCAAAA CTGAAGGACC CACACTAACA AAAGTCAAAA TTGAAGGTGA ACCTGAATTC 2160 

20 AGACTGATTA AAGAAGGTGA AACAATAACT GAAGTGATCC ATGGAGAGCC AATTATTAAA 2220 

AAATACACCA AAATCATTGA TGGAGTGCCT GTGGAAATAA CTGAAAAAGA GACACGAGAA 2280 

GAACG7\ATCA TTACAGGTCC TGAAATAAAA TACACTAGGA TTTCTACTGG AGGTGGAGAA 2340 

ACAGAAGAAA CTCTGAAGAA ATTGTTACAA GAAGAGGTCA CCAAGGTCAC CAAATTCATT 2400 

GAAGGTGGTG ATGGTCATTT ATTTGAAGAT GAAGAAATTA AAAGACTGCT TCAGGGAGAC 2460 

25 ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG ACGATTAAGG 2520 

GAAGGTCGTT CTCAGTGAAA ATCCAAAAAC CAGAAAAAAA TGTTTATACA ACCCTAAGTC 2580 

AATAACCTGA CCTTAGAAAA TTGTGAGAGC CAAGTTGACT TCAGGAACTG AAACATCAGC 2640 

ACAAAGAAGC AATCATCAAA TAATTCTGAA CACAAATTTA ATATTTTTTT TTCTGAATGA 2 700 

GAAACATGAG GGAAATTGTG GAGTTAGCCT CCTGTGGTAA AGGAATTGAA GAAAATATAA 2 760 

30 CACCTTACAC CCTTTTTCAT CTTGACATTA AAAGTTCTGG CTAACTTTGG AATCCATTAG 2820 

AGAAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATCG AAGAGTTGTG AACT6TTATC 2880 

CCATTGAAAA GACCGAGCCT TGTATGTATG TTATGGATAC ATAAAATGCA CGCAAGCCAT 2940 

TATCTCTCCA TGGGAAGCTA AGTTATAAAA ATAGGTGCTT GGTGTACAAA ACTTTTTATA 300 0 

TCAAAAGGCT TTGCACATTT CTATATGAGT GGGTTTACTG GTAAATTATG TTATTTTTTA 3060 

35 CAACTAATTT TGTACTCTCA GAATGTTTGT CATATGCTTC TTGCAATGCA TATTTTTTAA 3120 

TCTCAAACGT TTCAATAAAA CCATTTTTCA GATATAAAGA GAATTACTTC AAATTGAGTA 3180 
ATTCAGAAAA ACTCAAGATT TAAGTTAAAA AGTGGTTTGG ACTTGGGAA 
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40 



Seg ID NO: 401 Protein sequence 
Protein Accession #: NP_006466.1 



45 
50 
55 



MIPFIiPMFSL 
KNWYKKSICG 
KLREEXEGKG 
NGMIIPSMYN 
lEAEDDLSSF 
MKYHILNTLQ 
IDQVIilPDSA 
VQRIjLKIilLQ 
RNGAIHIFRE 
FKGMTSEEKE 
TLLVNELKSK 
FKEIPVTVYT 
TEVIHGEPII 
QEEVTKVTKF 



11 
1 

LLLLIVNPIN 
QKTTVLYECC 
SFTYFAPSNE 
NLGLFINHYP 
RAAAITSDIL 
CSESIMGGAV 
KQVIELAGKQ 
NHILKVKVGL 
IIKPAEKSIiH 
ILIRDKNALQ 
ESDIMTTNGV 
TKIITKWEP 
KKYTKIIDGV 
lEGGDGHLFE 



21 

I 

ANNHYDKILA 
PGYMRMEGMK 
AWDNLDSDIR 
NGWTVNCAR 
EALGRDGHFT 
FETLEGNTIE 
QTTFTDIiVAQ 
NELYNGQILE 
EKLKQDKRFS 
NIILYHLTPG 
IHWDKLLYP 
KIKVIEGSLQ 
PVBITEKETR 
DEEXKRLLQG 



31 
I 

HSRIRGRDQG 
GCPAVLPIDH 
RGLESNVNVE 
I IHGNQIATN 
LFAPTNEAFE 
IGCDGDSITV 
LGLASAIiRPD 
TIGGKQIiRVF 
TFLSXjLEAAD 
VFIGKGFEPG 
ADTPVGNDQL 
PIIKTEGPTL 
EBRIITGPEI 
DTPVRKLQAN 



41 

1 

PNVCALQQIL 
VYGTLGIVGA 
LLNALHSHMI 
GWHVIDRVL 
KLPRGVLERF 
NGIKMVNKKD 
GEYTLLAPVN 
VYRTAVCIEN 
LKELLTQPGD 
VTNILKTTQG 
LEILNKLIKY 
TKVKIEGEPE 
KYTRISTGGG 
KKVQGSRRRL 



51 
I 

GTKKKYFSTC 
TTTQRYSDAS 
NKRMLTKDLK 
TQIGTSIQDF 
MGDKVASEAL 
IVTNNGVIHL 
NAFSDDTLSM 
SCMEKGSKQG 
WTLFVPTNDA 
SKIFLKEVND 
IQIKFVRGST 
FRLIKEGETI 
ETEETLKKIiIi 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



Seq ID NO: 402 DNA sequence 
60 Nucleic Acid Accession NM_002416 
Coding sequence: 40.. 417 



65 



70 



75 



80 



85 



ATCCAATACA 
TTCCTCTTGG 
AAGGGTCGCT 
GACCTTAAAC 
AAGAATGGAG 
AAGTGGGAGA 
AAGAAAGTTC 
ACCACTTCAC 
TTCCAAAGGA 
ATTACTCTGA 
TTGTTAAAGG 
TTAAGGCCAT 
CTCACAACAG 
TATCTGAGGC 
TGAAATTGAG 
CTACAGGCCT 
ACTGGAGATC 
TTCCATCTTG 
AAGTCAGCTC 
CATATGCTCT 
AGTGCTTTCT 
AAATAAACCT 
CACATGGGTG 



11 

I 

GGAGTGACTT 
GCATCATCTT 
GTTCCTGCAT 
AATTTGCCCC 
TTCAAACATG 
AACAGGTCAG 
TGAAAGTTCG 
CAATAAGTAT 
GGATGGCATA 
AATTGTAACT 
CTATGATTGT 
GATTTTAGCA 
CTGCCTGGAA 
ACATGTCAGC 
CTGGACCTCA 
CACACACAAT 
ACCAGTGTGT 
CCCGCTCAGG 
TTCTCCATCC 
GATTTATCTG 
TCTCCCAATT 
TTTTGGACAC 
AACACTCAAT 



21 
I 

GGAACTCCAT 
GCTGGTTCTG 
CAGCACCAAC 
AAGCCCTTCC 
TCTAAACCCA 
CCAAAAGAAA 
AAAATCTCAA 
TCTGTGTTAA 
TAATACAAAG 
AAAGTTAGAA 
CTTTGTTCTT 
ATACCCATGT 
GAGCAGCCCT 
AAGTCCTAAG 
CCAAGCTGCT 
GTGTCTGAGA 
GGCTTTCAGA 
CTGACCACTT 
TACCACAATG 
AGTCAACTCC 
CATCCTCACT 
ACAAATTATC 
GGTTAACTAA 



31 

1 

TCTATCACTA 
ATTGGAGTGC 
CAAGGGACTA 
TGCGAGAA7U\ 
GATTCAGCAG 
AAGCAAAAGA 
CGTTCTCGTC 
AAATGTTCTA 
GCTTATTAAT 
AGTTGATTTT 
CTACCACCCA 
CTACACAGAT 
AGGCTTCCAC 
CCTGTTAGCA 
GTGGCCATCA 
GATTCATGCT 
GCCTCCTTTC 
TATTTCTTTT 
CAGTGCCTTT 
TTTCTCATCT 
CAGTCCAGCT 
TTAAAACTCC 
TTCTTGGGTG 



41 

1 

TGAAGAAT^G 
AAGGAACCCC 
TCCACCTACA 
TTGAAATCAT 
ATGTGAAGGA 
ATGGGAAAAA 
AAAAGAAGAC 
TTTTAATTAT 
TTGACTAGAA 
AAGAATCCAA 
CCAGTTGAAT 
GTTCACCCAA 
GTACTGCAGC 
TGCTGGTGAG 
ACCTCTGTAT 
GATTGTTATT 
TGGCTTTGGA 
TGTTCCCCTT 
CTTCTCTCCA 
TGTCCCCAAC 
TAGTTCAAGT 
TGTTTCACTT 
TTTATCCTAT 



51 

! 

TGGTGTTCTT 
AGTAGTGAGA 
ATCCTTGAAA 
TGCTACACTG 
ACTGATTAAA 
ACATCAAAAA 
TACATAAGAG 
ACCGCTATCA 
AATTTAAAAC 
ACGTTAAGAA 
TTCATCATGC 
CCACATCCCA 
CTCCAGAGAG 
CCAAGCAGTT 
TTGAATCAGC 
GGGTATCACC 
AGCCATGTGA 
TGCTTCATTC 
GTGCACCTGT 
ACCCCACAGA 
CCTGCCTCTT 
GGTTCAGTAC 
CTCTCCAACC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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AGATTGTCAG CTCCTTGAGG GCAAGAGCCA CAGTATATTT CCCTGTTTCT TCCACAGTGC 1440 

CTAATAATAC TGTGGAACTA GGTTTTAATA ATTTTTTAAT TGATGTTGTT ATGGGCAGGA 1500 

TGGCAACCAG ACCATTGTCT CAGAGCAGGT GCTGGCTCTT TCCTGGCTAC TCCATGTTGG 1560 

CTAGCCTCTG GTAACCTCTT ACTTATTATC TTCAGGACAC TCACTACAGG GACCAGGGAT 162 0 

5 GATGCAACAT CCTTGTCTTT TTATGACAGG ATGTTTGCTC AGCTTCTCCA ACAATAAGAA 1680 

GCACGTGGTA AAACACTTGC GGATATTCTG GACTGTTTTT AAAAAATATA CAGTTTACCG 1740 

AAAATCATAT AATCTTACAA TGAAAAGGAC TTTATAGATC AGCCAGTGAC CAACCTTTTC 1800 

CCAACCATAC AAAAATTCCT TTTCCCGAAG GAAAAGGGCT TTCTCAATAA GCCTCAGCTT 1860 

TCTAAGATCT AACAAGATAG CCACCGAGAT CCTTATCGAA ACTCATTTTA GGCAAATATG 1920 

10 AGTTTTATTG TCCGTTTACT TGTTTCAGAG TTTGTATTGT GATTATCAAT TACCACACCA 1980 

TCTCCCATGA AGAAAGGGAA CGGTGAAGTA CTAAGCGCTA GAGGAAGCAG CCAAGTCGGT 2 040 

TAGTGGAAGC ATGATTGGTG CCCAGTTAGC CTCTGCAGGA TGTGGAAACC TCCTTCCAGG 2100 

GGAGGTTCAG TGAATTGTGT AGGAGAGGTT GTCTGTGGCC AGAATTTAAA CCTATACTCA 2160 

CTTTCCCAAA TTG7VATCACT GCTCACACT6 CT6ATGATTT AGAGTGCTGT CCGGTGGAGA 2220 

15 TCCCACCCGA ACGTCTTATC TAATCATGAA ACTCCCTAGT TCCTTCATGT AACTTCCCTG 2280 

AAAAATCTAA GTGTTTCATA AATTTGAGAG TCTGTGACCC ACTTACCTTG CATCTCACAG 2340 

GTAGACAGTA TATAACTAAC AACCAAAGAC TACATATTGT CACTGACACA CACGTTATAA 2400 

TCATTTATCA TATATATACA TACATGCATA CACTCTCAAA GCAAATAATT TTTCACTTCA 2460 

. AAACAGTATT GACTTGTATA CCTTGTAATT TGAAATATTT TCTTTGTTAA AATAGAATG6 2520 

20 TATCAATAAA TAGACCATTA ATCAG 
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30 
35 
40 
45 
50 
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60 
65 
70 
75 
80 
85 



Seq ID NO: 403 Protein sequence 
Protein Accession #: NP_002407 

1 11 21 31 41 51 

I 1 I I I I 

MKKSGVLFIiIi GIILLVIjIGV QGTPWRKGR CSCISTNQGT IHLQSLKDLK QFAPSPSCEK 
IBIIATLKNG VQTCLNPDSA DVKELIKKWE KQVSQKKKQK NGKKHQKKKV LKVRKSQRSR 
QKKTT 

Seq ID NO: 404 DNA sequence 

Nucleic Acid Accession #: NM_006670 

Coding sequence: 85.. 1347 



CCGGCTCGCG 
AGCTCCGGGG 
GACGGGCGTC 
TCTCCCACCT 
TCCGCCCAGC 
CGCACAGTCA 
GTGCGCAACC 
CGCCGGCCGC 
GTGCGCGCGG 
CCACTGGCCG 
AGTCCCCTTG 
CGGAGCTTCG 
CGCCGCTTGG 
CTGCCCAGCC 
TCCTTCCGCA 
CTTCACAATG 
AACAATCCCT 
GAGGTAGTGC 
GTCCTCTTGG 
CAAACCTCTT 
GTTTTGTATT 
AGGGATCACA 
AACCTCAGTT 
CATGAGATGT 
TAGATACAAC 
TTTCTCGGTG 
TTCTTTTTCT 
TGGGCTTCTT 
ACAGATAGCA 
TATCAGTTTT 
CTGCAGACGT 
AGAGCATGCT 
TTCTTTGACA 
TTTTAATAAA 
ATTCTTAAAA 



11 
1 

CCCTCCGGGC 
AAACGCGAGC 
TGCGGCTGGC 
CCTCGGCATC 
CCCCGCTGCC 
AGTGCGTTAA 
TCTTCCTTAC 
CGCTGGCGGA 
GCGCCTTCGA 
ACCTCAGTCC 
TGGAACTGAT 
AGGGCATGGT 
AGCTGGCCAG 
TCAGGCACCT 
ACCTGACACA 
GCACCCTGGC 
GGGTCTGCGA 
AGGGCAAAGA 
AACTCAACAG 
ATGTCTTCCT 
TG7y\CCGCAA 
TGGAAGGGTA 
CTAACTCGGA- 
AGACTTAAGC 
GGACTTTGAC 
TGTTCTGTTA 
TGGAACTCCT 
GCTGTCTGTC 
TTCAACAAAA 
ATTCTCATGT 
TAGCAGGCTC 
TACATTTTAC 
AAGTAAATTA 
CTGCATCGAG 
GAA 



21 

CCAGCCTCCC 
CGCGATGCCT 
GCGACTAGCG 
CTCCTTCTCC 
GGACCAGTGC 
CCGCAATCTG 
CGGCAACCA6 
GCTGGCCGCG 
GCATCTGCCC 
CTTCGCTTTC 
CCTGAACCAC 
GGTGGCGGCC 
CAACCACTTC 
GGACTTAAGT 
TCTAGAAAGC 
TGAGTTGCAA 
CTGCCACATG 
CCGGCTCACC 
TGCTGACCTG 
GGGTATTGTT 
GGGGATAAAA 
TCATTACAGA 
TGTCTGAGAA 
TTTATCCCTA 
TAAAAGCAGT 
ATGTAAGACG 
CAACACGTAT 
TCTCTCTCAG 
GCTGCCTCAA 
ACCTAAGTTG 
TTCAAAATAA 
TGTTCTGCAT 
CTTTTTTGAT 
ATCCAACCGA 



31 

I 

GAGCCTTCGG 
GGGGGGTGCT 
CTGGTACTCC 
TCCTCGGCGC 
CCCGCGCTGT 
ACCGAGGTGC 
CTGGCCGTGC 
CTCAACCTCA 
AGCCTGCGCC 
TCGGGCAGCA 
ATCGTGCCCC 
CTGCTGGCGG 
CTTTACCTGC 
AATAATTCGC 
CTCCACCTGG 
GGTCTACCCC 
GCAGACATGG 
TGTGCATATC 
GACTGTGACC 
TTAGCCCTGA 
AAGTGGATGC 
TATGAAATCA 
ATATTAGAGG 
CTAGGCTTGC 
GAAGGGGATT 
ATGAACAGTT 
GGAGGGATTT 
TACAGTTCAA 
CTTTTTCGAG 
TGGAGAAAAT 
CTCCATGGTG 
ATTACAAAAA 
TGCAGTTTAT 
CTGAATTGTT 



41 

I 

AGCGGGCGCC 
CCCGGGGCCC 
TGGGCTGGGT 
CGTTCCTGGC 
GCGAGTGCTC 
CCACGGACCT 
TCCCTGCCGG 
GCGGCAGCCG 
AGCTCGACCT 
ATGCCAGCGT 
CTGAAGATGA 
GCCGTGCACT 
CGCGGGATGT 
TGGTGAGCCT 
AGGACAATGC 
ACATTAGGGT 
TGACCTGGCT 
CGGAAAAAAT 
CXSATTCTTCC 
TA6GCGCTAT 
ATAACATCAG 
ATGCGGACCC 
ACAGACCAAG 
TCCACTTTCA 
TGCTTCCTTG 
GTGTATAGTG 
TTCAGGTTTC 
GGTGTAGCAA 
AAAAATACTT 
AATTGCATCC 
CACAGGAGCA 
ATAACTTGCA 
ATGATUW^TGT 
AA7!U\AAAAAA 



51 

I 

GTCCCAGCCC 
CGCCGCCGGG 
CTCCTCGTCT 
TTCCGCCGTG 
CGAGGCAGCG 
GCCCGCCTAC 
CGCCTTCGCC 
CCTGGACGAG 
CAGCCACAAC 
CTCGGCCCCC 
GCGGCAGAAC 
GCAGGGGCTC 
GCTGGCCCAA 
GACCTACGTG 
CCTCAAGGTC 
TTTCCTGGAC 
CAAGGAAACA 
GAGGAATCGG 
CCCATCCCTG 
TTTCCTCCTG 
AGATGCCTGC 
CAGATTAACA 
GACAACTCTG 
TCCTCCACTA 
TTATGTAAAG 
TTTTACCCTC 
AGCATGAACA 
GTGTACCCAC 
TATTCATAAA 
TATAAACTGC 
CCTGCATCCA 
ACTTCATAAC 
ACTGATTTTT 
A7\AAATAAAG 



Seq ID NO: 405 Protein sequence 
Protein Accession #: NP_006661 



MPGGCSRGPA 
QCPALCECSE 
AALNLSGSRIi 
NHIVPPEDER 
LSNNSLVSLT 
HMADMVTWLK 
IVLAIiIGAIF 



11 
I 

AGDGRDRIiAR 
AARTVKCVNR 
DEVRAGAFEH 
QNRSFEGMW 
YVSFRNLTHL 
ETEWQGKDR 
LLVLYLNRKG 



21 

I 

LAIiVLLGWVS 
NLTEVPTDIiP 
LPSLRQLDLS 
AALLAGRALQ 
ESLHLEDNAL 
LTCAYPEKMR 
IKKWMHNXRD 



31 

1 

SSSPTSSASS 
AYVRNLFLTG 
HNPLADLSPF 
GLRRLELASN 
KVLHNGTLAE 
NRVLLELNSA 
ACRDHME6YH 



41 
I 

FSSSAPFIiAS 
NQIiAVLPAGA 
AFSGSNASVS 
HFLYLPRDVL 
LQGLPHIRVF 
DLDCDPILPP 
YRYEINADPR 



51 

i 

AVSAQPPLPD 
FARRPPIjAEL 
APSPLVELIL 
AQLPSLRHLD 
LDNNPWVCDC 
SLQTSYVFLG 
LTNIiSSNSDV 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



60 
120 
180 
240 
300 
360 



Seq ID NO: 406 DNA sequence 

Nucleic Acid Accession #: Eos sequence 



335 



wo 02/086443 

Coding sequence ; 1 . , 927 



PCT/US02/12476 



1 11 21 31 41 51 

^1 I 1 I I I 

D ATGGCTGGGG GGTGCTCCCG GGGCCCCGCC GCCGGGGACG GGCGTCTGCG GCTGGCGCGA 60 

CTAGCGCTGG TACTCCTGGG CTGGGTCTCC TCGTCTTCTC CCACCTCCTC GGCATCCTCC 120 

TTCTCCTCCT CGGCGCCGTT CCTGGCTTCC GCCGTGTCCG CCCAGCCCCC GCTGCCGGAC 180 

CAGTGCCCCG CGCTGTGCGA GTGCTCCGAG GCAGCGCX3CA CAGTCAAGTG CGTTAACCGC 240 

AATCTGACCX3 AGGTGCCCAC GGACCTGCCC GCCTACGTGC GCAACCTCTT CCTTACCGGC 300 

10 AACCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCAACTGCCC 360 

AGCCTCAGGC ACCTGGACTT AAGTAATAAT TCGCTGGTGA GCCTGACCTA CGTGTCCTTC 420 

CGCAACCTGA CACATCTAGA AAGCCTCCAC CTGGAGGACA ATGCCCTCAA GGTCCTTCAC 480 

AATGGCACCC TGGCTGAGTT GCAAGGTCTA CCCCACATTA GGGTTTTCCT GGACAACAAT 540 

CCCTGGGTCT GCGACTGCCA CATGGCAGAC ATGGTGACCT GGCTC/^AGGA AACAGAGGTA 600 

15 GTGCAGGGCA AAGACCGGCT CACCTGTGCA TATCCGGAJ^A AAATGAGGAA TCGGGTCCTC 660 

TTGGAACTCA ACAGTGCTGA CCTGGACTGT GACCCGATTC TTCCCCCATC CCTGCAAACC 72 0 

TCTTATGTCT TCCTGGGTAT TGTTTTAGCC CTGATAGGCG CTATTTTCCT CCTGGTTTTG 780 

TATTTGAACC GCAAGGGGAT AAAAAAGTGG ATGCATAACA TCAGAGATGC CTGCAGGGAT 840 

CACATGGAAG GGTATCATTA CAGATATGAA ATCAATGCGG ACCCCAGATT AACAAACCTC 900 

20 AGTTCTAACT CGGATGTCCT CGAGTGA 

Seq ID NO: 4 07 Protein sequence 
Protein Accession #: Eos sequence 

25 1 11 21 31 41 51 

I I i I I 1 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS FSSSAPFIiAS AVSAQPPLPD 60 

QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLASNHFLY LPRDVLAQLP 120 

SLRHLDLSNN SLVSLTYVSF RNLTHLESLH LEDNALKVLH NGTLAELQGL PHIRVFLDNN 180 

30 PWVCDCHMAD MVTWIiKETEV VQGKDRLTCA YPBKMRNRVL LELNSADLDC DPILPPSLQT 240 

SYVFLGIVIiA LIGAIFLLVL YLNRKGIKKW MHNIRDACRD HMEGYHYRYE INADPRLTNL 300 
SSNSDVLE 

Seq ID NO: 408 DNA sequence 
35 Nucleic Acid Accession #: NM_000095.1 
Coding sequence : 2 6 .. 2 2 9 9 

1 11 21 31 41 51 

/tn 1 I I ' ' ' 

4U CAGCACCCAG CTCCCCGCCA CCGCCATGGT CCCCGACACC GCCTGCGTTC TTCTGCTCAC 60 

CCTGGCTGCC CTCGGCGCGT CCGGACAGGG CCAGAGCCCG TTGGGCTCAG ACCTGGGCCC 120 

GCAGATGCTT CGGGAACTGC AGGAAACCAA CGCGGCGCTG CAGGACGTGC GGGACTGGCT 180 

GCGGCAGCAG GTCAGGGAGA TCACGTTCCT GAAAAACACG GTGATGGAGT GTGACGCGTG 240 

CGGGATGCAG CAGTCAGTAC GCACCGGCCT ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 300 

45 GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGCG GCCGCTGCGG 3 60 

CCCCTGCCCC GCGGGCTTCA CGGGCAACGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 

CGCCCACCCC TGCTTCCCCG GAGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 480 

GGCTTGCCCG CCGGGGTACA GCGGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA 540 

GGCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 600 

50 CCCCAACTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 660 

CTTCGTGGGC GACCAGGCGT CCGGCTGCCA GCGCGGCGCA CAGCGCTTCT GCCCCGACGG 720 

CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 

GTGCGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTCGCG ACACTGACCT 840 

AGACGGCTTC CCGGACGAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 

55 CGTGACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 

CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGA/^T^G GACTW^CTGCC CGCTGGTGCG 1020 

GAACCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCCG 1080 

GTCCCAGAAG AACGACGACC AAAAGGACAC AGACCAGGAC GGCCGGGGCG ATGCGTGCGA 1140 

CGACGACATC GACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCCCTA GGGTACCCAA 1200 

60 CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGGGGAT GCCTGTGACA ACTGTCCCCA 1260 

GAAGAGCAAC CCGGATCAGG CGGATGTGGA CCACGACTTT GTGGGAGATG CTTGTGACAG 1320 

CGATCAAGAC CAGGATGGAG ACGGACATCA GGACTCTCGG GACAACTGTC CCACGGTGCC 1380 

T7UVCAGTGCC CAGGAGGACT CA6ACCACGA TGGCCAGGGT GATGCGTGCG ACGACGACGA 1440 

CGACAATGAC GGAGTCCCTG ACAGTCGGGA CAACTGCCGC CTGGTGCCTA ACCCCGGCCA 1500 

65 GGAGGACGCG GACAGGGACG GCGTGGGCGA CGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 

GGTGGTAGAC AAGATCGACG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 1620 

GGCCTTCCAG ACAGTCGTGC TGGACCCGGA GGGTGACGCG CAGATTGACC CCAACTGG6T 1680 

GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 

GGGTTACACT GCCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 

70 GGATGACGAC TATGCGGGCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT 1860 

CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCGAAC CCCTTCCGTG CTGTGGCCGA 1920 

GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGG7UV 1980 

CGCTCTGTGG CATACAGGAG ACACAGAGTC CCAGGTGCGG CTGCTGTGGA AGGACCCGCG 2040 

AAACGTGGGT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 

75 GGGCTACATC AGGGTGCGAT TCTATGAGGG CCCTGAGCTG GTGGCCGACA GCAACGTGGT 2160 

CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 

CATCTGGGCC AACCTGCGTT ACCGCTGCAA TGACAGCATC CCAGAGGACT ATGAGACGCA 22 8 0 

TCAGCTGCGG CAAGCCTAGG GACCAGGGTG AGGACCCGCG GGATGACAGG CACCCTCACC 2340 

GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 2400 

80 AAGGGCTGAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 

Seq ID NO: 409 Protein sequence 
Protein Accession NP_000086.1 

85 1 11 21 31 41 51 

I 1 I I i I 

MVPDTACVLIi LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQQVREIT 60 



336 



wo 02/086443 

FLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGFCFPGV ACIQTESGGR CGPCPAGFTG 
NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG PTHQGVGLAF AKANKQVCTD 
INECETGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 
ADCVLERDGS RSCVCRVGWA GNGILCGRDT DLDGFPDEKIj RCPEPQCRKD NCVTVPNSGQ 
EDVDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRNTD EDKWGDACDN CRSQKNDDQK 
DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQKSNPDQAD 
VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVPDS 
RDNCRLVPNP GQEDADRDGV GDVCQDDFDA DKWDKIDVC PENAEVTLTD FRAFQTWIiD 
PEGDAQIDPN WWLNQGREI VQTMNSDPGL AVGYTAFNGV DFEGTFHVNT VTDDDYAGFI 
FGYQDSSSFY WMWKQMEQT YWQANPFRAV AEPGIQLKAV KSSTGPGEQL RNALWHTGDT 
ESQVRLLWKD PRNVGWKDKK SYRWFLQHRP QVGYIRVRFY EGPELVADSN WLDTTMRGG 
RLGVFCFSQE NIIWANLRYR CNDTIPEDYE THQLRQA 

Seq ID NO: 410 DNA sequence 

Nucleic Acid Accession #: NM_00 1565.1 

Coding sequence: 6 7.. 3 63 

1 11 21 31 41 51 

I I I I I I 

GAGACATTCC TCAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC CTCCAGTCTC 
AGCACCATGA ATCAAACTGC GATTCTGATT TGCTGCCTTA TCTTTCTGAC TCTAAGTGGC 
ATTCAAGGAG TACCTCTCTC TAGAACCGTA CGCTGTACCT GCATCAGCAT TAGTAATCAA 
CCTGTTAATC CAAGGTCTTT AGAAAAACTT GAAATTATTC CTGCAAGCCA ATTTTGTCCA 
CGTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGA6A AGAGATGTCT GAATCCAG7VA 
TCGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCTiAGG AAATGTCTAA AAGATCTCCT 
TAAAACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGCTG 
CCTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 
GTTACACTAA AAGGTGACCA ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA 
GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 
GCTCTACTGA GGTGCTATGT TCTTAGTGGA TGTTCTGACC CTGCTTCAAA TATTTCCCTC 
ACCTTTCGCA TCTTCCAAGG GTACTARQGA ATCTTTCTGC TTTGGGGTTT ATCAGAATTC 
TCAGAATCTC AAATAACTJ^A AAGGTATGCA ATCAAATCTG CTTTTTAAAG AATGCTCTTT 
ACTTCATGGA CTTCCACTGC CATCCTCCCA AGGGGCCC7VA ATTCTTTCAG TGGCTACCTA 
CATACAATTC CAAACACATA CAGGAAGGTA GAAATATCTG AAAATGTATG TGTAAGTATT 
CTTATTTAAT GAAAGACTGT ACAAAGTATA AGTCTTAGAT GTATATATTT CCTATATTGT 
TTTCAGTGTA CATGGAATAA CATGTAATTA AGTACTATGT ATCAATGAGT AACAGGAAAA 
TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 
TTTTCAAATA AAAATGAGGT ACTCTCCTGG ATU^TATTAAG 

Seq ID NOi 411 Protein sequence 
Protein Accession #: NP_0 01556.1 

1 11 21 31 41 51 

1 I I 1 I 1 

MNQTAILICC LIFLTLSGIQ GVPLSRTVRC TCISISNQPV NPRSLEKLEI IPASQFCPRV 
EIIATMKKKG EKRCLNPESK AIKNLLKAVS KEMSKRSP 

Seq ID NOs 412 DNA sequence 

Nucleic Acid Accession #j XM_057014 

Coding sequence: 143.. 874 

1 11 21 31 41 51 

I I I I I I 

GGGAGGGAGA GAGGCGCGCG GGTGAAAGGC GCATTGATGC AGCCTGCGGC GGCCTCGGAG 
CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG 
CGCTGCCCGG CAGCCGGGAG CCATGCGACC CCAGGGCCCC GCCGCCTCCC CGCAGCGGCT 
CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG CCGTCGAGCG CCTCTGAGAT 
CCCCAAGGGG AAGCAAAAGG CGCAGCTCCG GCAGAGGGAG GTGGTGGACC TGTATAATGG 
AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAGAC GGGAGCCCTG GGGCCAATGG 
CATTCCGGGT ACACCTGGGA TCCCAGGTCG GGATGGATTC AAAGGAGA/^ AGGGGGTW^TG 
TCTGAGGGAA AGCTTTGAGG AGTCCTGGAC ACCCAACTAC AAGCAGTGTT CATGGAGTTC 
ATTGAATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC 
AAATAGTGCT CTAAGAGTTT TGTTCAGTGG CTCACTTCGG CTAAAATGCA GAAATGCATG 
CTGTCAGCGT TGGTATTTCA CATTCAATGG AGCTGAATGT TCAGGACCTC TTCCCATTGA 
AGCTATAATT TATTTGGACC AAGGAAGCCC TGAAATGAAT TCAACAATTA ATATTCATCG 
CACTTCTTCT GTGGAAGGAC TTTGTGAAGG AATTGGTGCT GGATTAGTGG ATGTTGCTAT 
CTGGGTTGGC ACTTGTTCAG ATTACCCAAA AGGAGATGCT TCTACTGGAT GGAATTCAGT 
TTCTCGCATC ATTATTGAAG AACTACCAAA ATAAATGCTT TAATTTTCAT TTGCTACCTC 
TTTTTTTATT ATGCCTTGGA ATGGTTCACT TAAATGACAT TTTAAATAAG TTTATGTATA 
CATCTGAATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGATT TCACACTGTT 
TTTA7\ATCTA GCATTATTCA TTTTGCTTCA ATCAAAAGTG GTTTCAATAT TTTTTTTAGT 
TGGTTAGJ\AT ACTTTCTTCA TAGTCACATT CTCTCAACCT ATAATTTGGA ATATTGTTGT 
GGTCTTTTGT TTTTTCTCTT AGTATAGCAT TTTTAAAAAA ATATAAAAGC TACCAATCTT 
TGTACAATTT GTAAATGTTA AGAATTTTTT TTATATCTGT TAAATAA7VAA TTATTTCCAA 
CAACCTTAAA AAAftAAAAAA AAAA 

Seq ID NO: 413 Protein sequence 
Protein Accession #: XP_057014 

1 11 21 31 41 51 

111111 
MRPQGPAASP QRLRGLLLLL LLQLPAPSSA SEIPKGKQKA QLRQREWDL YNGMCLQGPA 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 
GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPL PIEAIIYLDQ 
GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW NSVSRIIIEE 
LPK 



337 



wo 02/086443 

Seq ID HO: 414 DNA sequence 

Nucleic Acid Accession #: XM_0B4007 

Coding sequence: 13 8.. 2405 



PCT/US02/12476 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
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80 
85 



CTCGTGCCGA 
CCAGTGGGCC 

GCGGAGACGA 
TCTCTGTCAC 
AAATTAGTCC 
ATCATCTACA 
TCAGAAAATT 
ACCACGACCA 
AGCATCACTC 
CTGCTTCTGG 
GTAAAGATCC 
GAAGGAATGT 
TCTCTGAAGG 
CCAAAGATGT 
TGGCTGGTAG 
GAAACACAAA 
GCATGGGCAT 
TCAACCAAAT 
CTCCAAAGAC 
TCAGTTTCCT 
AATTTCTCCT 
TACACCTTCT- 
CAATGGAAAT 
GTGCCTATTT 
TTCTTGTTGA 
AGAAGATUVCC 
CTCAACTTTC 
GAGGAGACTC 
AAGAGGTCAT 
GGTGCAAGAA 
TTCACCACCA 
CTCACAGTCA 
TGGCCTGGAT 
GTGCTGCTTT 
ATGAGTTGCC 
AGCAGGCTGT 
GAATTTTCAT 
GCTTATTCAT 
GTGACCATGG 
GTTTTGGAAT 
TCTAGTTAAG 
AGGGAGATGA 
TTGTATTGAA 
TATTCTATCT 
TAAACAAGAG 
TTTTCAAGAA 
TGTTTAGGAA 
AGCAAAGAAA 
AAAAATCACA 
CAGAATTAGT 
GTAGTGAGCA 
AAATATATTT 
TTCGTGCGGG 
TATTGCCAAG 
CAATU^TTATC 
TCATTTGATT 
GAGCAATTGT 
GATGTTTCTT 



11 

I 

ATTCGGCACG 
CGTGTGGAAC 

AGGCGCAATG 
AAATCCCCTT 
GAATTGGGAA 
ACAGCTTTTC 
ACTTCAAAAT 
TCACTCAGAC 
AGACCACGAG 
TAAAAATAAG 
TAGAAACAGC 
CAAGGACAGT 
AACTCACTTT 
AAGCAGCTCC 
GAAAACAAAT 
TGAAAATCCT 
CCAGGTTCCG 
TGATGCTAGA 
CTATTCATTA 
GTCTCTGCTG 
GAGTTTCCTT 
TCCACATTCT 
GAAAAGAGGA 
TGATTCCACG 
ACATGTCCTC 
TGAAAATGAT 
AACAAATGAG 
ACAAGAGCCC 
GATAGCTCAT 
TAAATGCCAT 
TCATGACTAC 
CAGCCAGCGC 
GGTGATAATG 
TACTGAAGGC 
TCATGAATTA 
CCTTTATAAT 
TGGTCATTAT 
GTATGTTGCT 
ATGTAGCCqC 
TATGTTACTT 
GTTTAAATGC 
GTTTGTATGC 
TATTGCTGTC 
TGGAGATAAA 
ATTTGGCATG 
CTAACACAGT 
TAAGAATGTG 
TAAAGGAGAA 
AAATTTGTTG 
ATAGAGTACA 
CTCTCATATA 
AATGAATTCA 
TTATATACCA 
TTATATATCA 
AGAGTAGTAA 
CGATTCAGAA 
CTTTATATAC 
TTTTACACAA 



21 
1 

AGACCGCGTG 
CTUW^CCTGCG 
GCGAGGAAGT 
CATGAACTAA 
TCTGGCATTA 
TACCGCTATG 
ATAGGCATAG 
CACGAGCATC 
CATCACTCTG 
CGAAAAGCTC 
CAGGGGAAAG 
GTTAGTGCTA 
CTA6AGACAA 
ACTCCACCCA 
GAATCTGTGA 
CAGGAGTGTT 
CTGAATGCAA 
TCTTGTCTGA 
CAAATAGCCT 
GGGGTTATCT 
GTGGCACTGG 
CATGCAAGTC 
CCACTTTTCA 
TGGAAGGGTC 
ACATTGATCA 
GATGATGTGG 
GAGAAAGTAG 
TCCCACTTTG 
GCTCATCCAC 
TCACATTTCC 
CATCATATTC 
TACTCTCGGG 
GGTGATGGCC 
TTATCAAGTG 
GGTGACTTTG 
GCATTGTCAG 
6CTGAAAATG 
CTGGTTGATA 
TGGGGGTATT 
ATTTCCATAT 
TAGAGTAGCT 
TGTACTATGC 
TGTTACAAAG 
ATCTGTATGT 
ACATGTTCTG 
TATTCCTATA 
CATGAAGCCT 
AAGAGAAGAA 
TAAATTAGAG 
TTCATTAAAC 
CTAATTAGTG 
AGCAATATAC 
GATGAGTACA 
CCAAAAGCTG 
AACTTTGATA 
AGTACTTTGA 
GGTACTGTAG 
TAAATTCCTT 



31 
I 

TTC6CGCCTG 
CGCGTGGCCG 
TATCTGTAAT 
AAGCAGCTGC 
ATGTTGACTT 
GAGAAAATAA 
ATAAGATTAA 
ACTCAGACCA 
ACCATGATCA 
TTTGCCCAGA 
GAGCTCACCG 
GTGAAGTGAC 
TA6AGACTCC 
GTGTCACATC 
GTGAGCCCCG 
TCAATGCATC 
CAGAGTTCAA 
TTCATACAAG 
GGGTTGGTGG 
TAGTGCCTCT 
CCGTTGGGAC 
ACCACCATAG 
GTCATCTGTC 
TAACAGCTCT 
AACAATTTAA 
AGATTAAGAA 
ATACAGATGA 
ATTCTCAGCA 
AGGAAGTCTA 
ACGATACACT 
TCCATCATCA 
AGGAGCTGAA 
TGCACAATTT 
GTTTAAGTAC 
CTGTTCTACT 
CCATGCTGGC 
TTTCTATGTG 
TGGTACCTGA 
TCTTTTTACA 
TTGAACATAA 
TAAAAAGTTG 
AGCGTTTAAA 
TCAGTTAAAG 
GCAATTCACC 
TATGTTTCAG 
CTGGATTTTA 
AAAATACCAA 
TCTGAGAATT 
GGGAGAAATT 
ATTTTTGTCA 
TACATTTAAC 
ACTTGACCAA 
GTGAGTAGTT 
TATGACTGGA 
TATATGAGGA 
TATCTCTCAG 
CCATACTAGG 
ATATCAGCTT 



41 

1 

GTAGAGATTT 
GGCCGTGGGA 
CTTGATCCTG 
TTTCCCCCAG 
GGCAATTTCC 
TTCTTTGTCA 
AAGAATCCAT 
TGAGCGTCAC 
TCACTCCCAC 
CCATGACTCA 
ACCAGAACAT 
CTCAACTGTG 
AAGACCTGGA 
AAAGAGCCGG 
AAAAGGCTTT 
AAAGCTACTG 
CTATCTCTGT 
TGAAAAGAAG 
TTTTATAGCC 
CATGAATCGG 
TTTGAGTGGT 
TCATAGCCAT 
TTCTCAAAAC 
AGGAGGCCTG 
AGATAAGAAG 
GCAGTTGTCC 
TCGAACTGAA 
GCCTGCAGTC 
CAATGAATAT 
CGGCCAGTCA 
CCACCACCAA 
AGATGCCGGC 
CAGCGATGGC 
TTCTGTTGCT 
AAAGGCTGGC 
GTATCTTGGA 
GATATTTGCA 
AATGCTGCAC 
GAATGCTGGG 
AATCGTGTTT 
TCATAGTTTC 
GTTAGTGGGT 
GTACGTTTTA 
GGTATTACCA 
GGAAAAATGT 
GGTCTCTGAA 
GAAAGCTTAT 
GGGGAGGCAT 
TAGAATTAAG 
GGATTATTTC 
TTTGTATAAT 
GAAATTGGAA 
TATGTATCAC 
TGTTCTGGTT 
TATTAAAACT 
TGCTTCAGTG 



51 
1 

CTCGAAGACA 
CAACGAGGCC 
ACCTTTGCCC 
ACCACTGAGA 
ACACGGCAAT 
GTTGAAGGGT 
ATACACCATG 
TCAGACCATG 
CATAATCATG 
GATAGTTCAG 
GCCAGTGGTA 
TACAACACTG 
AAACTCTTCC 
GTGAGCCGGC 
ATGTATTCCA 
ACATCTCATG 
CCAGCCATCA 
GCTGAAATCC 
ATTTCCATCA 
GTGTTTTTCA 
GATGCTTTTT 
GAAGAACCAG 
ATAGAAGAAA 
TATTTCATGT 
AAAAAGAATC 
AAGTATGAAT 
GGCTATTTAC 
TTGGAAGAAG 
GTACCCAGAG 
GACGATCTCA 
AACCACCATC 
GTCGCCACTT 
CTAGCAATTG 
GTGTTCTGTC 
ATGACCGTTA 
ATGGCAACAG 
CTTACTGCTG 
AATGATGCTA 
ATGCTTTTGG 
CGTATAAATT 
AGTAGGTCAT 
TTTGTGATTT 
ATATTTAAGT 
GTTTATTATG 
CTTTAATGCT 
GAACTGCTGG 
ACTGAATTTA 
AGATTCTTAT 
TATAAAAAGG 
CCGTAAAAAC 
ACAGAAATCT 
TTTCAAAATG 
CAGACTGGGT 
ACCTGGTTTA 
ACACTAAGTA 
CTATCATTGT 
GCATTCTCTA 
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MARKLSVIIiI 
FYRYGENNSIi 
EHHSDHDHHS 
SVSASEVTST 
NESVSEPRKG 
RSCLIHTSBK 
LVALAVGTLS 
TWKGLTALGG 
EEKVDTDDRT 
HSHFHDTLGQ 
MGDGLHNFSD 
NALSAMLAYL 
RWGYFFLQNA 



11 
I 

LTFALSVTNP 
SVEGFRKIiLQ 
HHNHAASGKN 
VYNTVSEGTH 
FMYSRNTNEN 
KAEIPPKTYS 
GDAFLHIiLPH 
LYFMFLVEHV 
EGYLRADSQE 
SDDLIHHHHD 
GLAIGAAFTE 
GMATGIFIGH 
GMLLGFGIML 



21 
I 

LHELKAAAFP 
NIGIDKXKRI 
KRKALCPDHD 
FLETIETPRP 
PQECFNASKL 
LQIAWVGGFI 
SHASHHHSHS 
LTLIKQFKDK 
PSHFDSQQPA 
YHHILHHHHH 
GLSSGLSTSV 
YAENVSMWIF 
LISIFEHKIV 



31 
1 

QTTEKISPNW 
HIHHDHDHHS 
SDSSGKDPRN 
GKLPPKDVSS 
LTSHGMGIQV 
AISIISFIiSL 
HEEPAMEMKR 
KKKNQKKPEN 
VLEEEEVMIA 
QNHHPHSHSQ 
AVFCHELPHE 
ALTAGLFMYV 
FRINF 



ESGINVDLAI 



SQGKGAHRPE 
STPPSVTSKS 
PLNATEFNYL 
LGVILVPLMN 
GPLFSKLSSQ 
DDDVEIKKQL 
HAHPQEVYNE 
RYSREELKDA 
LGDFAVLLKA 
ALVDMVPEML 



51 
1 

STRQYHLQQL 
HSDHBHHSDH 
HASGRRNVKD 
RVSRLAGRKT 
CPAIINQIDA 
RVFFKFLLSF 
NIEESAYPDS 
SKYESQLSTN 
YVPRGCKNKC 
GVATLAWMVI 
GMTVKQAVIjY 
HNDASDHGCS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
€60 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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I I I I I I 

ATGCCCAAGC GCGCGCACTG GGGGGCCCTC TCCGTGGTGC TGATCCTGCT TTGGGGCCAT 60 
CCGCGAGTGG CGCTGGCCTG CCCGCATCCT TGTGCCTGCT ACGTCCCCAG CGAGGTCCAC 120 
TGCACGTTCC GATCCCTGGC TTCCGTGCCC GCTGGCATTG CTAGACACGT GGAAAGAATC 180 
AATTTGGGGT TTAATAGCAT ACAGGCCCTG TCAGAAACCT CATTTGCAGG ACTGACCAAG 240 
TTGGAGCTAC TTATGATTCA CGGCAATGAG ATCCCAAGCA TCCCCGATGG AGCTTTAAGA 300 
GACCTCAGCT CTCTTCAGGT TTTCAAGTTC AGCTACAACA AGCTGAGAGT GATCACAGGA 3 60 
CAGACCCTCC AGGGTCTCTC TAACTTAATG AGGCTGCACA TTGACCACAA CAAGATCGAG 420 
TTTATCCACC CTCAAGCTTT CAACGGCTTA ACGTCTCTGA GGCTACTCCA TTTGGAAGGA 480 
AATCTCCTCC ACCAGCTGCA CCCCAGCACC TTCTCCACGT TCACATTTTT GGATTATTTC 540 
AGACTCTCCA CCATAAGGCA CCTCTACTTA GCAGAGAACA TGGTTAGAAC TCTTCCTGCC 600 
AGCATGCTTC GGAACATGCC GCTTCTGGAG AATCTTTACT TGCAGGGAAA TCCGTGGACC 660 
TGCGATTGTG AGATGAGATG GTTTTTGGAA TGGGATGCAA AATCCAGAGG TW^TTCTGAAG 720 
TGTATiAAAGG ACAAAGCTTA TGAAGGCGGT CAGTTGTGTG CAATGTGCTT CAGTCCAAAG 780 
AAGTTGTACA AACATGAGAT ACACAAGCTG AAGGACATGA CTTGTCTGAA GCCTTCAATA 840 
GAGTCCCCTC TGAGACAGAA CAGGAGCAGG AGTATTGAGG AGGAGCAAGA ACAGGAAGAG 900 
GATGGTGGCA GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG CATCTCTTTG 960 

AATATGACCG ACGAGCACGG GAACATGGTG AACTTGGTCT GTGACATCAA GAAACCAATG 1020 

GATGTGTACA AGATTCACTT GAACCAAAC6 GATCCTCCAG ATATTGACAT AAATGCAACA 10 80 

GTTGCCTTGG ACTTTGAGTG TCCAATGACC CGAGAAAACT ATGAAAAGCT ATGGAAATTG 1140 

ATAGCATACT ACAGTGAAGT TCCCGTGAAG CTACACAGAG AGCTCATGCT CAGCAAAGAC 12 00 

CCCAGAGTCA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA CTACACAGGT 12 60 

GTGAGAGCCC AGATTCTTGC AGAACCAGAA TGGGTCATGC AGCCATCCAT AGATATCCAG 1320 

CTGAACCGAC GTCAGAGTAC GGCCAAGAAG GTGCTACTTT CCTACTACAC CCAGTATTCT 13 80 

CAAACAATAT CCACCAAAGA TACAAGGCAG GCTCGGGGCA GAAGCTGGGT AATGATTGAG 1440 

CCTAGTGGAG CTGTGCAAAG AGATCAGACT GTCCTGGAAG GGGGTCCATG CCAGTTGAGC 1500 

TGCAACGTGA AAGCTTCTGA GAGTCCATCT ATCTTCTGGG TGCTTCCAGA TGGCTCCATC 15 60 

CTGAAAGCGC CCATGGATGA CCCAGACAGC AAGTTCTCCA TTCTCAGCAG TGGCTGGCTG 1620 

AGGATCAAGT CCATGGAGCC ATCTGACTCA GGCTTGTACC AGTGCATTGC TCAAGTGAGG 1680 

GATGAAATGG ACCGCATGGT ATATAGGGTA CTTGTGCAGT CTCCCTCCAC TCAGCCAGCC 1740 

GAGAAAGACA CAGTGACAAT TGGCAAGAAC CCAGGGGAGT CGGTGACATT GCCTTGCAAT 1800 

GCTTTAGCAA TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG GATAATTAAT 1860 

GATTTGGCTA ACACATCACA TGTATACATG TTGCCAAATG GAACTCTTTC CATCCCAAAG 192 0 

GTCCAAGTCA GTGATAGTGG TTACTACAGA TGTGTGGCTG TCAACCAGCA AGGGGCAGAC 198 0 

CATTTTACGG TGGGAATCAC AGTGACCAAG AAAGGGTCTG GCTTGCCATC CAAAAGAGGC 2040 

AGACGCCCAG GTGCAAAGGC TCTTTCCAGA GTCAGAGAAG ACATCGTGGA GGATGAAGGG 2100 

GGCTCGGGCA TGGGAGATGA AGAGAACACT TCAAGGAGAC TTCTGCATCC AAAGGACCAA 2160 

GAGGTGTTCC TCAAAACAAA GGATGATGCC ATCAATGGAG ACAAGAAAGC CAAGAAAGGG 2220 

AGAAGAAAGC TGAAACTCTG GAAGCATTCG GAAAAAGAAC CAGAGACCAA TGTTGCAGAA 22 80 

GGTCGCAGAG TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCCG 2340 

GAGCGCTGGG CTGATATTTT AGCCAAAGTC CGTGGGAAAA ATCTCCCTAA GGGCACAGAA 2400 

GTACCCCCGT TGATTAAAAC CACAAGTCCT CCATCCTTGA GCCTAGAAGT CACACCACCT 2460 

TTTCCTGCTG TTTCTCCCCC CTCAGCATCT CCTGTGCAGA CAGTAACCAG TGCTGAAGAA 252 0 

TCCTCAGCAG ATGTACCTCT ACTTGGTGAA GAAGAGCACG TTTTGGGTAC CATTTCCTCA 25 8 0 

GCCAGCATGG GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC TGAAGTAACA 2640 

AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTGAGA AGACTGAGGA GATAACTTCC 2 700 

ACTGAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA TATCTGAGCC TTATGAACCA 2760 

TCTCCTACTC TGCACACATT AGACACAGTC TATGAAAAGC CCACCCATGA AGAGACGGCA 2820 

ACAGAGGGTT GGTCTGCAGC AGATGTTGGA TCGTCACCAG AGCCCACATC CAGTGAGTAT 2880 

GAGCCTCCAT TGGATGCTGT CTCCTTGGCT GAGTCTGAGC CCATGCAATA CTTTGACCCA 2 940 

GATTTGGAGA CTAAGTCACA ACCAGATGAG GATAAGATGA AAGAAGACAC CTTTGCACAC 3 000 

CTTACTCCAA CCCCCACCAT CTGGGTTAAT GAGTCCAGTA CATCACAGTT ATTTGAGGAT 3060 

TCTACTATAG GGGAACCAGG TGTCCCAGGC CAATCACATC TACAAGGACT GACAGACAAC 3120 

ATCCACCTTG TGAAAAGTAG TCTAAGCACT CAAGACACCT TACTGATTAA AAAG6GTATG 3180 

AAAGAGATGT CTCAGACACT ACAGGGAGGA AATATGCTAG AGGGAGACCC CACACACTCC 3240 

AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCTGA CTCCACACTG 33 0 0 

GGTATAATGA GCAGTATGTC TCCAGTTAAG AAGCCTGCGG AAACCACAGT TGGTACCCTC 33 60 

CTAGACAAAG ACACCACAAC AGTAACAACA ACACCAAGGC AAAAAGTTGC TCCGTCATCC 3 42 0 

ACCATGAGCA CTCACCCTTC TCG7UVGGAGA CCCAACGGGA GAAGGAGATT ACGCCCCAAC 3 480 

AAATTCCGCC ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 3540 

TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA GAGTTCTCTG 3600 

GTTCCTACAG CTTGGGTGGA TAACACAGTT AATACCCCCA AACAGTTGGA AATGGAGAAG 3 660 

AATGCAGAAC CCACATCCAA GGGAACACCA CGGAGAAAAC ACGGGAAGAG GCCAAACAAA 3 720 

CATCGATATA CCCCTTCTAC AGTGAGCTCA AGAGCGTCCG GATCCAAGCC CAGCCCTTCT 3780 

CCAGAAAATA AACATAGAAA CATTGTTACT CCCAGTTCAG ATU^CTATACT TTTGCCTAGA 3 840 

ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATGAC AACCACCAGA 3 900 

AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC CAGTCACATA TAAACCCACA 3 960 

TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG TTGACAAACA TAAAAGTGAC 4020 

ATTTTAGTCA CTGGTGAATC AATTACTAAT GCCATACCAA CTTCTCGCTC CTTGGTCTCC 4080 

ACTATGGGAG AATTTAAGGA AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCCAACCTGG 4140 

AATCCCTCAA GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 4200 

GGGGAAAATC TTACAGACCC TCCCCTTCTT AAAGAGCTTG AGGATGTGGA TTTCACTTCC 4260 

GAGTTTTTGT CCTCTTTGAC AGTCTCCACA CCATTTCACC AGGAAGAAGC TGGTTCTTCC 4320 

ACAACTCTCT CAAGCATAAA AGTGGAGGTG GCTTCAAGTC AGGCAGAAAC CACCACCCTT 4380 

GATCAAGATC ATCTTGAAAC CACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT 4440 

CACACCCCTA CTGCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 4500 

ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG AATATCTCAA 4560 

GCATCTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG TGGGGAATCC AGAAACAGAA 4620 

GCAACCCCAG TCAACAATGA AGGAACACAG CATATGTCAG GGCCAAATGA ATTATCAACA 4680 

CCCTCTTCCG ACCGGGATGC ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA 4740 

TTTGGTAGTA GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 4800 

GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC AACAGTGAGG 4860 

CTACCTGAAA TGTCCACACA AAGCGCTTCC AGATACTTTG TAACTTCCCA GTCACCTCGT 4920 

CACTGGACCA ACAAACCGGA AATAACTACA TATCCTTCTG GGGCTTTGCC AGAGAACAAA 4980 

CAGTTTACAA CTCCAAGATT ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTCCAAA 5040 
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CCCAGCATTC CTAGTAAGTT TACTGACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 5100 

GTGTTTGGAA ATAACJ\ACAT CCCT6AGGCA AGAAACCCAG TTGGAAAGCC TCCCAGTCCA 5160 

AGAATTCCTC ATTATTCCAA TGGAAGACTC CCTTTCTTTA CCAACAAGAC TCTTTCTTTT 5220 

CCACAGTTGG GAGTCACCCG GAGACCCCAG ATACCCACTT CTCCTGCCCC AGTAATGAGA 5280 

5 GAGAGAAAAG TTATTCCAGG TTCCTACAAC AGGATACATT CCCATAGCAC CTTCCATCTG 5340 

GACTTTGGCC CTCCGGCACC TCCGTTGTTG CACACTCCGC AGACCACGGG ATCACCCTCA 5400 

ACTAACTTAC AGAATATCCC TATGGTCTCT TCCACCCAGA GTTCTATCTC CTTTATAACA 5460 

TCTTCTGTCC AGTCCTCAGG AAGCTTCCAC CAGAGCAGCT CAAAGTTCTT TGCAGGAGGA 5520 

CCTCCTGCAT CCAAATTCTG GTCTCTTGGG GAAAAGCCCC AAATCCTCAC CAAGTCCCCA 5580 

10 CAGACTGTGT CCGTCACCGC TGAGACAGAC ACTGTGTTCC CCTGTGAGGC AACAGGAAAA 5640 

CCAAAGCCTT TCGTTACTTG GACAAAGGTT TCCACAGGAG CTCTTATGAC TCCGAATACC 5700 

AGGATACAAC GGTTTGAGGT TCTCAAGAAC GGTACCTTAG TGATACGGAA GGTTCAAGTA 5760 

CAAGATCGAG GCCAGTATAT GTGCACCGCC AGCAACCTGC ACGGCCTGGA CAGGATGGTG 5820 

GTCTTGCTTT CGGTCACCGT GCAGCAACCT CAAATCCTAG CCTCCCACTA CCAGGACGTC 5880 

15 ACTGTCTACC TGGGAGACAC CATTGCAATG GAGTGTCTGG CCAAAGGGAC CCCAGCCCCC 5940 

CAAATTTCCT GGATCTTCCC TGACAGGAGG GTGTGGCAAA CTGTGTCCCC CGTGGAGAGC 6000 

CGCATCACCC TGCACGAAAA CCGGACCCTT TCCATCAAGG AGGCGTCCTT CTCAGACAGA 6060 

GGCGTCTATA AGTGCGTGGC CAGCAATGCA GCCGGGGCGG ACAGCCTGGC CATCCGCCTG 6120 

CACGTGGCGG CACTGCCCCC CGTTATCCAC CAGGAGAAGC TGGAGAACAT CTCX3CTGCCC 6180 

20 CCGGGGCTCA GCATTCACAT TCACTGCACT GCCAAGGCTG CGCCCCTGCC CAGCGTGCGC 6240 

TGGGTGCTCG GGGACGGTAC CCAGATCCGC CCCTCGCAGT TCCTCCACGG GAACTTGTTT 6300 

GTTTTCCCCA ACGGGACGCT CTACATCCGC AACCTCGCGC CCAAGGACAG CGGGCGCTAT 63 60 

GAGTGCGTGG CCGCCAACCT GGTAGGCTCC GCGCGCAGGA CGGTGCAGCT GAACGTGCAG 6420 

CGTGCAGCAG CCAACGCGCG CATCACGGGC ACCTCCCCGC GGAGGACGGA CGTCAGGTAC 6480 

25 GGAGGAACCC TCAAGCTGGA CTGCAGCGCC TCGGGGGACC CCTGGCCGCG CATCCTCTGG 6540 

AGGCTGCCGT CCAAGAGGAT GATCGACGCG CTCTTCAGTT TTGATAGCAG AATCAAGGTG 6600 

TTTGCCAATG GGACCCTGGT GGTGAAATCA GTGACGGACA AAGATGCCGG AGATTACCTG 6660 

TGCGTAGCTC GAAATAAGGT TGGTGATGAC TACGTGGTGC TCAAAGTGGA TGTGGTGATG 6720 

AAACCGGCCA AGATTGAACA CAAGGAGGAG AACGACCACA AAGTCTTCTA CGGGGGTGAC 6780 

30 CTGAAAGTGG ACTGTGTGGC CACCGGGCTT CCCAATCCCG AGATCTCCTG GAGCCTCCCA 6840 

GACGGGAGTC TGGTGAACTC CTTCATGCAG TCXK5ATGACA GCGGTGGACG CACCAAGOGC 6900 

TATGTCGTCT TCAACAATGG GACACTCTAC TTTAACGAAG TGGGGATGAG GGAGGAAGGA 6960 

GACTACACCT GCTTTGCTGA AAATCAGGTC GGGAAGGACG AGATGAGAGT CAGAGTCAAG 702 0 

GTGGTGACAG CGCCCGCCAC CATCCGGAAC AAGACTTACT TGGCGGTTCA GGTGCCCTAT 7080 

35 GGAGACGTGG TCACTGTAGC CTGTGAGGCC AAAGGAGAAC CCATGCCCAA GGTGACTTGG 7140 

TTGTCCCCAA CCAACAAGGT GATCCCCACC TCCTCTGAGA AGTATCAGAT ATACCAAGAT 7200 

GGCACTCTCC TTATTCAGAA AGCCCAGCGT TCTGACAGCG GCAACTACAC CTGCCTGGTC 7260 

AGGAACAGCG CGGGAGAGGA TAGGAAGACG GTGTGGATTC ACGTCAACGT CCAGCCACCC 7320 

AAGATCAACG GTAACCCCAA CCCCATCACC ACCX3TGCGGG AGATAGCAGC CGGGGGCAGT 7380 

40 CGGAAACTGA TTGACTGCAA AGCTGAAGGC ATCCCCACCC CGAGGGTGTT ATGGGCTTTT 7440 

CCCGAGGGTG TGGTTCTGCC AGCTCCATAC TATGGAAACC GGATCACTGT CCATGGCAAC 750 0 

GGTTCCCTGG ACATCAGGAG TTTGAGGAAG AGCGACTCCG TCCAGCTGGT ATGCATGGCA 7560 

CGCAACGAGG GAGGGGAGGC GAGGTTGATC GTGCAGCTCA CTGTCCTGGA GCCCATGGAG 7620 

AAACCCATCT TCCACGACCC GATCAGCGAG AAGATCACGG CCATGGCGGG CCACACCATC 7680 

45 AGCCTCAACT GCTCTGCCGC GGGGACCCCG ACACCCAGCC TGGTGTGGGT CCTTCCCAAT 7740 

GGCACCGATC TGCAGAGTGG ACAGCAGCTG CAGCGCTTCT ACCACAAGGC TGACGGCATG 780 0 

CTACACATTA GCGGTCTCTC CTCGGTGGAC GCTGGGGCCT ACCGCTGCGT GGCCCGCAAT 7860 

GCCGCTGGCC ACACGGAGAG GCTGGTCTCC CTGAAGGTGG GACTGAAGCC AGAAGCAAAC 7920 

AAGCAGTATC ATAACCTGGT CAGCATCATC AATGGTGAGA CCCTGAAGCT CCCCTGCACC 7980 

50 CCTCCCGGGG CTGGGCAGGG ACGTTTCTCC TGGACGCTCC CCAATGGCAT GCATCTGGAG 8040 

GGCCCCCAAA CCCTGGGACG CGTTTCTCTT CTGGACAATG GCACCCTCAC GGTTCGTGAG 8100 

GCCTCGGTGT TTGACAGGGG TACCTATGTA TGCAGGATGG AGACGGAGTA CGGCCCTTCG 8160 

GTCACCAGCA TCCCCGTGAT TGTGATCGCC TATCCTCCCC GGATCACCAG CGAGCCCACC 822 0 

CCGGTCATCT ACACCCGGCC CGGGAACACC GTGAAACTGA ACTGCATGGC TATGGGGATT 8280 

55 CCCAAAGCTG ACATCACGTG GGAGTTACCG 6ATAAGTCGC ATCTGAAGGC AGGGGTTCAG 8340 

GCTCGTCTGT ATGGAAACAG ATTTCTTCAC CCCCAGGGAT CACTGACCAT CCAGCATGCC 8400 

ACACAGAGAG ATGCCGGCTT CTACAAGTGC ATGGCAAAAA ACATTCTCGG CAGTGACTCC 8460 

AAAACAACTT ACATCCACGT CTTCTGAAAT GTGGATTCCA GAATGATTGC TTAGGAACTG 8520 

ACAACAAAGC GGGGTTTGTA AGGGAAGCCA GGTTGGGGAA TAGGAGCTCT TAAATAATGT 85 80 

60 GTCACAGTGC ATGGTGGGCT CTGGTGGGTT TCAAGTTGAG GTTGATCTTG ATCTACAATT 8640 

GTTGGGAAAA GGAAGCAATG CAGACACGAG AAGGAGGGCT CAGCCTTGCT GAGACACTTT 8700 

CTTTTGTGTT TACATCATGC CAGGGGCTTC ATTCAGGGTG TCTGTGCTCT GACTGCAATT 8760 

TTTCTTCTTT TGCAAATGCC ACTCGACTGC CTTCATAAGC GTCCATAGGA TATCTGAGGA 8820 

ACATTCATCA AAAATAAGCC ATAGACATGA ACAACACCTC ACTACCCCAT TGAAGACGCA 8880 

65 TCACCTAGTT AACCTGCTGC AGTTTTTACA TGATAGACTT TGTTCCAGAT TGACAAGTCA 8 940 

TCTTTCAGTT ATTTCCTCTG TCACTTCAAA ACTCCAGCTT GCCCAATAAG GATTTAGAAC 9000 

CAGAGTGACT GATATATATA TATATATTTT AATTCAGAGT TACATACATA CAGCTACCAT 9060 

TTTATATGAA AAAAGAAAAA CATTTCTTCC TGGAACTCAC TTTTTATATA ATGTTTTATA 9120 

TATATATTTT TTCCTTTCAA ATCAGACGAT GAGACTAGAA GGAGAAATAC TTTCTGTCTT 9180 

70 ATTAAAATTA ATAAATTATT GGTCTTTACA AGACTTGGAT ACATTACAGC AGACATGGAA 924 0 

ATATAATTTT AAAAAATTTC TCTCCAACCT CCTTCAAATT CAGTCACCAC* TGTTATATTA 9300 

CCTTCTCCAG GAACCCTCCA GTGGGGAAGG CTGCGATATT AGATTTCCTT GTATGCAAAG 9360 

TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT GAGAGGAGAG GAAGGAGAAA ACTGCATCAT 9420 

AACTTTACAG AATTGAATCT AGAGTCTTCC CCGAAAAGCC CAGA;\ACTTC TCTGCAGTAT 9480 

75 CTGGCTTGTC CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 9540 

CCATGAATAA TACACGACCT 6TTATTTCCA TGACTGCTTT ACTGTATTTT TAAGGTCAAT 9600 
ATAGTGTACA TTTGATAATA AAATAATATT CTCCCAA7W\A AAAAA 

Seq ID NO: 417 Protein sequence 
80 Protein Accession NP 056234.1 
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1 11 21 31 41 51 

I I I I I I 

MPKRAHWGAL SWLILLWGH PRVALACPHP CACYVPSEVH CTFRSLASVP AGIARHVERI 60 

85 NLGFNSIQAL SETSFAGLTK LELLMIHOJE IPSIPDGALR DLSSLQVFKP SYNKLRVITG 120 

QTLQGIiSNLM RLHIDHNKIE FIHPQAFNGL TSLRLLHLEG NLLHQIiHPST FSTFTFLDYF 180 

RLSTIRHLYL AENMVRTLPA SMIiRNMPLLB NLYLQGNPWT CDCEMRWFLE WDAKSRGILK 240 
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CKKDKAYEGG QLCAMCFSPK KLYKHEIHKL KDMTCLKPSI ESPLRQNRSR SIEEEQEQEE 300 

DGGSQIiILBK FQLPQWSISIi NMTDEHGNMV NLVCDIKKPM DVYKIHLNQT DPPDIDINAT 3 60 

VALDFECPMT RENYEKLWKL lAYYSEVPVK LHRELMLSKD PRVSYQYRQD ADEEALYYTG 420 

VRAQILAEPE WVMQPSIDIQ LNRRQSTAKK VLLSYYTQYS QTISTKDTRQ ARGRSWVMIE 480 

PSGAVQRDQT VLEGGPCQLS CWKASESPS IFWVLPDGSI LKAPMDDPDS KFSILSSGWL 540 

RIKSMEPSDS GLYQCIAQVR DEMDRMVYRV LVQSPSTQPA EKDTVTIGKN PGESVTIiPCN 600 

ALAIPEAHLS WILPNRRIIN DLANTSHVYM LPNGTLSIPK VQVSDSGYYR CVAVNQQGAD 660 

HFTVGITVTK KGSGLPSKRG RRPGAKAIiSR VREDIVEDEG GSGMGDEENT SRRIiLHPKDQ 720 

EVFLKTKDDA INGDKKAKKG RRKLKLWKHS EKEPETNVAE GRRVFESRRR INMANKQINP 780 

ERWADILAKV RGKNLPKGTE VPPLIKTTSP PSLSLEVTPP FPAVSPPSAS PVQTVTSAEE 840 

SSADVPLLGE EEHVLGTISS ASMGLEHNHN GVILVEPEVT STPLEEWDD LSEKTEEITS 900 

TEGDLKGTAA PTLISEPYEP SPTLHTLDTV YEKPTHEETA TEGWSAADVG SSPEPTSSEY 960 

EPPLDAVSLA ESEPMQYFDP DLETKSQPDE DKMKEDTFAH LTPTPTIWVN DSSTSQLFED 1020 

STIGEPGVPG QSHLQGIiTDN IHLVKSSLST QDTIiLIKKGM KEMSQTLQGG NhlLEGDPTHS 1080 

RSSESEGQES KSITLPDSTL GIMSSMSPVK KPAETTVGTL LDKDTTTVTT TPRQKVAPSS 1140 

TMSTHPSRRR PNGRRRLRPN KFRHRHKQTP PTTFAPSETF STQPTQAPDI KISS^VESSL 1200 

VPTAWVDNTV NTPKQLEMEK NAEPTSKGTP RRKHGKRPNK HRYTPSTVSS RASGSKPSPS 1260 

PENKHRNIVT PSSETIL.LPR TVSLKTEGPY DSLDYMTTTR KIYSSYPKVQ ETLPVTYKPT 1320 

SDGKEIKDDV ATNVDKHKSD ILVTGESITN AIPTSRSLVS TMGEFKEESS PVGFPGTPTW 1380 

NPSRTAQPGR LQTDIPVTTS GENLTDPPIiL KEDEDVDFTS EFLSSLTVST PFHQEEAGSS 1440 

TTLSSIKVEV ASSQAETTTL DQDHLETTVA ILLSETRPQN HTPTAT^MKE PASSSPSTIL 1500 

MSLGQTTTTK PAIiPSPRISQ ASRDSKEMVF LNYVGNPETE ATPVNNEGTQ HMSGPNELST 1560 

PSSDRDAFNL STKLELEKQV FGSRSLPRGP DSQRQDGRVH ASHQLTRVPA KPILPTATVR 1620 

LPEMSTQSAS RYFVTSQSPR HWTNKPEITT YPSGALPENK QFTTPRLSST TIPLPLHMSK 1680 

PSIPSKFTDR RTDQFNGYSK VFGNNNIPEA RNPVGKPPSP RIPHYSNGRL PFFTNKTLSF 1740 

PQLGVTRRPQ IPTSPAPVMR ERKVIPGSYN RIHSHSTFHL DFGPPAPPLL HTPQTTGSPS 1800 

TNLQNIPMVS STQSSISFIT SSVQSSGSFH QSSSKFFAGG PPASKFWSLG EKPQILTKSP 1860 

QTVSVTAETD TVFPCEATGK PKPFVTWTKV STGALMTPNT RIQRFEVLICN GTLVIRKVQV 1920 

QDRGQYMCTA SNLHGLDRMV VLLSVTVQQP QILASHYQDV TVYLGDTIAM ECLAKGTPAP 1980 

QISWIFPDRR VWQTVSPVES RITLHENRTL SIKEASFSDR GVYKCVASNA AGADSLAIRL 2 040 

HVAALPPVIH QEKLENISLP PGLSIHIHCT AKAAPLPSVR WVLGDGTQIR PSQFLHGNLP 2100 

VFPNGTLYIR NLAPKDSGRY ECVAANLVGS ARRTVQLNVQ RAAANARITG TSPRRTDVRY 2160 

GGTLKLDCSA SGDPWPRILW RLPSKRMIDA LFSFDSRIKV FANGTLWKS VTDKDAGDYL 222 0 

CVARNKVGDD YWLKVDWM KPAKIEHKEE NDHKVFYGGD LKVDCVATGL PNPEISWSLP 22 80 

DGSLVNSFMQ SDDSGGRTKR YWFNNGTLY FNEVGMREEG DYTCFAENQV GKDEMRVRVK 2340 

WTAPATIRN KTYLAVQVPY GDWTVACEA KGEPMPKVTW LSPTNKVIPT SSEKYQIYQD 240 0 

GTLLIQKAQR SDSGNYTCLV RNSAGEDRKT VWIHVNVQPP KINGNPNPIT TVREIAAGGS 2460 

RKLIDCKAEG IPTPRVLWAF PEGWLPAPY YGNRITVHGN GSUDIRSLRK SDSVQLVCMA 2520 

RNEGGEARLI VQLTVLEPME KPIFHDPISE KITAMAGHTI SLNCSAAGTP TPSLVWVLPN 2580 

GTDLQSGQQL QRFYHKADGM LHISGLSSVD AGAYRCVARN AAGHTERLVS IjKVGLKPEAN 2640 

KQYHNLVSII NGETLKLPCT PPGAGQGRFS WTLPNGMHLE GPQTLGRVSL LDNGTLTVRE 2700 

ASVFDRGTYV CRMETEYGPS VTSIPVIVIA YPPRITSEPT PVIYTRPGNT VKLNCMAMGI 2760 

PKADITWELP DKSHLKAGVQ ARLYGNRFLH PQGSLTIQHA TQRDAGFYKC MAKNILGSDS 2820 
KTTYIHVF 

Seg ID NO: 418 DNA sequence 

Nucleic Acid Accession Eos sequence 

Coding sequence : 1 . . 5001 

1 11 21 31 41 51 

I I I I I I 

ATGCCAGGCA C^J^AACTAAC CCGAACAGGC GCCCCAGCAG ACTACAGAGT GATATTGAAG 60 

ACCTCTCAAG AGGACGAATT GGATGTACCT GACGACATCA GCGTCCGGGT TATGTCATCT 12 0 

CAGTCTGTGC TTGTGTCCTG GGTGGATCCT GTTCTGGAAA AACAGAAGAA AGTTGTTGCA 180 

TCAAGACAGT ACACCGTGCG CTATCGAGAG AAGGGGGAAT TGGCCAGGTG GGATTATAAG 240 

CAGATCGCTA ACAGGCGTGT GCTGATTGAG AACCTGATTC CAGACACTGT GTATGAATTT 300 

GCAGTCCGTA TTTCACAGGG TGAAAGAGAT GGCAT^TGGA GTACGTCAGT CTTCCAAAGA 360 

ACACCAGAAT CTGCCCCTAC CACAGCTCCT GAAAACTTGA ACGTCTGGCC AGTCAATGGC 420 

AAACCTACAG TTGTCGCTGC ATCTTGGGAT GCGCTACCAG AGACTGAGGG GAAAGTGAAA 480 

GTCTGTCTGC TGGACACAGG ACTGTTTTCA GTTTCCTCCT TCCAACCATC TGCCAAATCA 540 

TTTCAGAATA CATTCTTTCA TACGCCCCGG CTCTCAAACC ATTTGGAGCA AAGTCCCTCA 600 

CCTATCCTGG AGACACTACT TCTGCCCTGG TGGATGGTCT GCAGCCTGGG GAACGCTATC 660 

TTTTCAAAAT CCGGGCCACA AACAGGAGAG GCCTGGGACC TCACTCCAAA GCCTTCATTG 720 

TCGCTATGCC AACAAGAATG CAGCTGTACC CAG7UVGGATT TCAGTTGTCT AGCTTACCTG 780 

ATCGATATCC AAACCAAACA AGTTAATAAA GATCCACAAC TGGAAGGGAG TGTTTTTGGA 840 

CCATGTTTTC TTTTCTACTT CCTCACATTT ATGCTGGATA TTGGCGGCTT TTCCTTCATT 900 

ATGTGCTATG AAGACCCANN TGTTTCTTCT TTGACAGGCA ATTCTTTAAA ATCTGTTGCA 960 

GCCAGTAAGG CGGATGTTCA GCAGAACACG GAGGACAATG GGAAACCCGA AAAACCTGAG 1020 

CCTTCCTCAC CTTCTCCCAG AGCTCCAGCT TCCTCCCAAC ACCCCTCTGT GCCTGCTTCT 1080 

CCCCAAGGGA GAAATGCCAA GGACCTTCTT CTTGACTTGA AGAACAAT^T ATTGGCTAAT 1140 

GGTGGGGCGC CCCGAAAACC CCAGCTTCGC GCCAAGAAGG CAGAGGAGCT GGATCTTCAG 1200 

TCGACAGAAA TCACTGGGGA GGAGGAGCTG GGTTCCCGGG AGGACTCGCC CATGTCACCC 1260 

TCAGACACCC AAGACCAGAA ACGGACCCTG AGGCCGCCAA GTAGACACGG CCACTCGGTG 132 0 

GTTGCTCCOG GCAGGACTGC AGTGAGGGCC CGGATGCCAG CGCTGCCCCG AAGGGAAGGC 1380 

GTAGAT7\AGC CTGGCTTTTC CCTGGCCACG CAGCCCCX3CC CAGGGGCX3CC CCCCTCGGCT 1440 

TCGGCCTCTC CTGCCCACCA CGCGTCCACC CAGGGCACCT CTCATCGTCC TTCCCTGCCT 1500 

GCCAGCTTGA ATGACAACGA CTTGGTGGAC TCAGACGAAG ATGAGCGCGC TGTGGGCTCC 1560 

CTCCACCCCA AGGGCGCCTT CGCCCAGCCC CGGCCAGCCC TGTCCCCCAG CCGCCAGTCC 1620 

CCGTCCAGCG TTCTCCGCGA CAGAAGCTCT GTGCACCCCG GCGCAAAGCC AGCCTCGCCG 1680 

GCGCGGAGGA CCCCCCATTC AGGGGCCGCA GAGGAAGATT CCAGTGCCTC AGCCCCACCC 1740 

TCAAGACTTT CTCCACCCCA TGGGGGATCA TCTCGGCTGC TGCCCACCCA GCCACACCTG 1800 

AGCTCTCCAC TTTCCAAGGG CGGGAAGGAT GGTGAGGACG CCCCAGCCAC CAACTCCAAT 1860 

GCGCCATCAC GGTCCACCAT GTCCTCCTCC GTCTCTTCTC ATCTCTCGTC CAGGACGCAG 1920 

GTCTCTGAGG GAGCGGAGGC TTCTGATGGT GAAAGCCACG GTGACGGCGA TAGGGAAGAC 1980 

GGCGGAAGGC AGGCGGAGGC CACGGCCCAG ACGCTGCGGG CCCGGCCTGC CTCTGGACAC 2040 

TTCCATTTGC TCAGACACAA ACCCTTTGCT GCCAACGGGA GGTCTCCAAG CAGGTTCAGC 2100 

ATTGGGCGGG GACCTCGGCT GCAGCCCTCC AGCTCCCCAC AGTCGACTGT GCCCTCCCGA 2160 
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GCCCACCCCA GGGTTCCCTC TCACTCTGAT TCCCACCCTA AGCTTAGCTC AGGTATCCAT 2220 

GGAGACGAGG AGGATGAGAA GCCGCTTCCT GCCACCGTTG TC7\ATQACCA CGTGCCTTCC 2280 

TCCTCCAGGC RGCCCATCTC CCGGGGCTGG GAGGACTTAA GGAGAAGCCC 6CAGAGAGGG 2340 

GCCAGCCTGC ATCGGAAGGA ACCCATCCCA GAGAACCCCA AATCCACAGG GGCAGATACA 2400 

5 CATCCTCAGG GCAAGTACTC CTCCCTGGCC TCCAAGGCTC AGGATGTTCA ACAGAGCACA 2460 

GACGCGGACA CGGAGGGTCA TTCTCCCAAA GCACAGCCAG GGTCCACAGA CCGCCACGCG 252 0 

TCCCCTGCTC GTCCTCCCGC AGCACGGTCA CAGCAGCATC CCA6TGTTCC CAGAAGGATG 2580 

ACACCCGGCC GGGCCCCAGA ACAGCAGCCC CCTCCTCCCG TCGCCACGTC CCAGCACCAC 2640 

CCX5GGACCCC AGAGCAGAGA CGCGGGTCGG TCACCTTCCC AGCCCAGGCT CTCACTGACC 2700 

10 CAGGCCGGGC GGCCCCGCCC CACGTCGCAG GGCCGCTCCC ACTCCTCCTC GGACCCTTAC 2760 

ACGGCGAGCT CCAGAGGGAT GCTCCCCACG GCCCTCCAGA ACCAGGACGA GGATGCCCAG 2820 

GGCAGCTACG ACGACGACAG CACAGAAGTC GAGGCCCAGG ATGTGCGGGC CCCCGCGCAC 2880 

GCCGCGCGCG CCAAGGAGGC AGCTGCGTCC CTTCCCAAGC ACCAGCAGGT GGAGTCTCCC 2940 

ACAGGCGCAG GGGCAGGTGG CGACCACAGG TCCCAGCGCG GACATGCGGC CTCCCCCGCC 3000 

15 AGGCCCAGCC GACCCGGCGG CCCCCAGTCC CGCGCCCGGG TCCCCAGCAG GGCAGCGCCX3 3060 

GGGAAGTCGG AGCCTCCTTC CAAGCGGCCC CTGTCCTCCA AGTCCCAGCA GTCGGTCTCA 3120 

GCCGAGGACG AGGAGGAGGA GGACGCGGGG TTTTTTAAAG GCGGGAAAGA AGACCTTCTG 3180 

TCTTCCTCTG TGCCAAAGTG GCCCTCTTCC TCCACTCCCA GGGGCGGCAA AGACGCCGAT 3240 

GGGAGCCTCG CCAAGGAAGA GAGGGAGCCT GCCATCGCGC TTGCCCCTCG CGGAGGGAGC 3300 

20 CTGGCTCCTG TGAAGCGACC TCTCCCCCCA CCTCCAGGCA GCTCCCCCAG GGCCTCCCAC 3360 

GTCCCTTCCC GACCGCCGCC TCGCAGCGCT GCCACCGTGA GCCCCGTCGC GGGCACCCAC 3420 

CCCTGGCCGC GGTACACCAC GCGCGCCCCV CCTGGCCACT TCTCCACCAC CCCGATGCTG 3480 

TCCTTGCGCC AGAGGATGAT GCATGCCAGA TTCCGTAACC CTCTCTCCCG ACAGCCTGCC 3540 

AGACCCTCTT ACAGACAAGG TTATAATGGC AGACCAAATG TAGAAGGGAA AGTCCTTCCT 3600 

25 GGTAGTAATG GAAAACCGAA TGGACAGAGA ATTATCAATG GCCCTCAAGG AACAAAGTGG 3 660 

GTTGTGGACC TTGATCGTGG GTTAGTATTG AATGCAGAAG GAAGGTACCT CCAAGATTCA 3720 

CATGGAAATC CTCTTCGGAT TAAACTAGGA GGAGATGGTC GTVACCATTGT AGATCTGGAA 3780 

GGGACCCCCG TGGTGAGTCC TGACGGCCTC CCACTCTTTG GGCAGGGGCG ACATGGCACA 3840 

CCTCTGGCCA ATGCCCAAGA TAAGCCAATT TTGAGTCTTG GAGGAAAGCC GCTGGTGGGC 3 900 

30 TTGGAGGTCA TCAAAAAAAC CACCCATCCC CCTACCACTA CCATGCAGCC CACCACTAGT 3960 

ACGACGCCCC TGCCTACCAC TACAACCCCG AGGCCCACCA CTGCCACCAC CATGCAGCCC 4020 

ACCACTACTA CGACGCCCCT GCCTACCACT ACACCGAGGC CCACCACTGC CACCACCCGC 4080 

CGCACGACCA CCAGGCGTCC AACAACCACA GTCCGAACCA CTACGCGGAC AACCACCACC 4140 

ACCACCCCCA AACCCACCAC TCCCATCCCC ACCTGTCCCC CTGGGACCTT GGAACGGCAG 4200 

35 GACGATGATG GCAACCTGAT AATGAGCTCC AATGGGATCC CAGAGTGCTA CGCTGAAGAA 42 60 

GATGAGTTCT CAGGCTTGGA GACTGACACT GCAGTACCTA CGGAAGAGGC CTACGTTATA 432 0 

TATGATGAAG ATTATGAATT TGAGACGTCA AGGCCACCAA CCACCACTGA GCCTTCGACC 4380 

ACTGCTACCA CACCGAGGGT GATCCCAGAG GAAGGCGCCA TCAGTTCCTT TCCTGAAGAA 4440 

GAATTTGATC TGGCTGGAAG GAAACGATTT GTTGCTCCTT ACGTGACX3TA CCTA7ATA7\A 4500 

40 GACCCATCAG CCCCGTGCTC TCTGACTGAT GCACTGGATC ACTTCCAAGT GGACAGCCTG 4560 

GATGAAATCA TCCCCAATGA CCTGAAGAAG AGTGATCTGC CTCCCCAGCA TGCTCCCCGC 462 0 

AACATCACCG TGGTGGCCGT GGAAGGTTGC CACTCATTTG TCATTGTGGA TTGGGACAAA 4680 

GCCACCCCAG GAGATTTGGT CACAGGTTAT TTGGTTTACA GTGCATCCTA TGAAGATTTC 4740 

ATCAGGAACA AGTTTTCCAC TCAAGCTTCA TCAGTAACTC ACTTGCCCAT TGAGAACCTA 4800 

45 AAGCCCAACA CGAGGTATTA TTTTAAAGTG CAAGCACAAA ATCCTCATGG CTACGGACCT 4860 

ATCAGCCCTT CGGTCTCATT TGTCACCGAA TCAGATAATC CTCTGCTTGT TGTGAGGCCC 4920 

CCAGGCGGTG AGCTATCTGG ATCCCATTCG CTTTCAAACA TGATCCCAGC TACACGGACT 4 980 

GCCATGGACG GCAATATGTG AAGCGCACGT GGTATCGAAA GTTCGTGGGA GTTGTTCTTT 5040 

GTAATTCACT GAGGTATAAA ATCTACCTCA GTGACAACCT GAAAGATACA TTCTACAGCA 510 0 

50 TTGGAGACAG CTGGGGAAGA GGTGAAGACC ATTGCCAATT TGTGGATTCA CACCTTGATG 5160 

GAAGT^CAGG GCCTCAGTCC TATGTAGAAG CCCTCCCTAC TATTCAAGGC TACTATCGCC 5220 

AGTATCGTCA GGAGCCTGTC AGGTTTGGGA ACATCGGCTT CGGAACCCCC TACTACTATG 5280 

TGGGCTGGTA CGAGTGTGGG GTCTCCATCC CTGGAAAGTG GTAATCACAG GACCGTCATG 5340 

CTGCAAGCTT GCCCTGCCCA GCCCCACCAA CTAAGTCGCA CTAGGGGCTG TGAGCAAAGA 5400 

55 CAGCCAGCAT GCTCAGCCCC GCTGCCCTAG GTGCCAGGAA GGTCACAGAT GGACACTGGC 5460 

CATTCTGGTC ATCTCAGTCT GGAACTCAGT CCCACTTCTT GGCCTGGACA ATGAACAGGA 5520 

TTCAGTTTTG CTGTTAACTT TGCTTCTCTA CTTTTTTTTG TTTGTTTGTA ATAGCACATC 5580 

CCAGAGACAT CAGAAACCAG CAACTGATTC AGTGTGATTT CCCAGACTTT TTAGGCATGA 5640 

AATTCGGACA CTTCAGTATT TCCAGGAATA GCATATGCAC GCTGTTCTTG CTTCATGGAA 57 00 

60 'TGCTACATGC TTTCTGTTTT TCTCATTTTG GATTTCTCCA AAACTAACTG AATTTAAGCT 5760 

TCAGGTCCCT TTGTATGCAG TAGAAAGGAA TTATTAAAAA CACCACCAAA GAAAATAAAT 5820 

ATATCCTACT TGAAATTTAC TCTATGGACT TACCCACTGC TAGAATAAAT 6TATCAAATC 5880 

TTATTTGTAA ATTCTCAATT TTGATATATA TATGTATATA TGCATATACA TATCCACACT 5 940 

TGTCTGCAAG AATATTGATT AAAATTGCTA AATTTGTACT TGTTCACCAA AATW^AAAAA 6000 
65 AAAAAAA 

Seq ID NO: 419 Protein sequence 
Protein Accession #: Eos sequence 

70 1 11 21 31 41 51 

I I I I I I 

MPGTKLTRTG APADYRVILK TSQEDELDVP DDISVRVMSS QSVLVSWVDP VLEKQKKWA 60 

SRQYTVRYRE KGELTURWDYK QIANRRVLIE NLIPDTVYEF AVRISQGERD GKWSTSVFQR 120 

TPESAPTTAP ENLNVWPVNG KPTWAASWD ALPETEGKVK VCLLDTGLFS VSSFQPSAKS 180 

75 FQNTPFHTPR LSNHLEQSPS PILETLLLPW WMVCSLGNAI FSKSGPQTGE AWDLTPKPSL 240 

SLCQQECSCT QKDFSCLAYL IDIQTKQVNK DPQLEGSVFG PCFLPYFLTF MLDIGGFSFI 300 

MCYEDPVSSL TGNSLKSVAA SKADVQQNTE DNGKPEKPEP SSPSPRAPAS SQHPSVPASP 3 60 

QGRNAKDLLL DLKNKILANG GAPRKPQLRA KKAEELDLQS TEITGEEELG SREDSPMSPS 420 

DTQDQKRTLR PPSRHGHSW APGRTAVRAR MPALPRREGV DKPGPSIATQ PRPGAPPSAS 480 

80 ASPAHHASTQ GTSHRPSLPA SLNDNDLVDS DEDERAVGSL HPKGAFAQPR PALSPSRQSP 540 

SSVLRDRSSV HPGAKPASPA RRTPHSGAAE EDSSASAPPS RLSPPHGGSS RLLPTQPHLS 600 

SPLSKGGKDG EDAPATNSNA PSRSTMSSSV SSHLSSRTQV SEGAEASDGE SHGDGDREDG 660 

GRQAEATAQT LRARPASGHF HLLRHKPFAA NGRSPSRFSI GRGPRLQPSS SPQSTVPSRA 720 

HPRVPSHSDS HPKLSSGIHG DEEDEKPLPA TWNDHVPSS SRQPISRGWE DLRRSPQRGA 780 

85 SLHRKEPIPE NPKSTGADTH PQGKYSSLAS KAQDVQQSTD ADTEGHSPKA QPGSTDRHAS 840 

PARPPAARSQ QHPSVPRRMT PGRAPEQQPP PPVATSQHHP GPQSRDAGRS PSQPRLSLTQ 900 

AGRPRPTSQG RSHSSSDPYT ASSRGMLPTA LQNQDEDAQG SYDDDSTEVE AQDVRAPAHA 960 
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10 



ARAKEAAASL 
KSEPPSKRPL 
SLAKEEREPA 
WPRYTTRAPP 
SNGKPNGQRI 
TPWSPDGLP 
TPLPTTTTPR 
TPKPTTPIPT 
DEDYEFETSR 
PSAPCSLTDA 
TPGDLVTGYL 
SPSVSFVTES 



PKHQQVBSPT 
SSKSQQSVSA 
lALAPRGGSL 
GHFSTTPMLS 
INGPQGTKWV 
LFGQGRHGTP 
PTTATTMQPT 
CPPGTLBRHD 
PPTTTEPSTT 
LDHFQVDSIiD 
VYSASYBDFI 
DNPLLWRPP 



GAGAGGDHRS 



APVKRPLPPP 
LRQRMMHARF 
VDLDRGLVLN 
LANAQDKPIL 
TTTTPLPTTT 
DDGNLIMSSN 
ATTPRVIPEB 
EIIPNDLKKS 
RNKFSTQASS 
GGEIiSGSHSL 



QRGHAASPAR 
FKGGKKDIiIiS 
PGSSPRASHV 
RNPLSRQPAR 
AEGRYLQDSH 
SLGGKPLVGL 
PRPTTATTRR 
GIPECYAEED 
GAISSFPEEE 
DLPPQHAPRN 
VTHLPIENLK 
SNMIPATRTA 



PSRPGGPQSR 
SSVPKWPSSS 
PSRPPPRSAA 
PSYRQGYNGR 
GNPLRIKLGG 
EVIKKTTHPP 
TTTRRPTTTV 
EFSGLETDTA 
FDIiAGRKRFV 
ITWAVEGCH 
PNTRYYFKVQ 
MDGNM 



ARVPSRAAPG 
TPRG6KDADG 
TVSPVAGTHP 
PNVEGKVLPG 
DGRTIVDLEG 
TTTMQPTTTT 
RTTTRTTTTT 
VPTEEAYVIY 
APYVTYIiNKD 
SFVIVDWDKA 
AQNPHGYGPI 



1020 
1080 
1140 

1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Seq ID NO: 420 DNA sequence 
Nucleic Acid Accession #: NM_ 
Coding sequence: 128.. 1237 



022743 



GTGGATTTTA 
TCTCTGCTGC 
AAAGCTGATG 
AAAAGCTTGG 
TCCTCCAGAC 
TTCAGAATCA 
TGAAGATAAG 
AGAAATACAG 
AGTGATCTGC 
ATATCCCAGT 
TGGGCCCCAC 
CTGCTACCTG 
CTGCTTTGAA 
TGGTGATGAG 
GGCACACTGG 
TGAACGGCTT 
CTGCATCAAC 
ATACAGGATT 
CAAACTGCAG 
TGATATTATG 
TTTAGAAGAA 
CGGCGTGTGT 
GTGAACCTCT 
GGTTTGCAAA 
ATTTGGTTGA 



11 
I 

GAGATACCTC 
AACACCCCTA 
CGATGCTCTC 
CCAGACCACA 
TCCGTTCGAC 
GAGAAGCTTT 
AAAGAGGGCC 
GATGCCTCTC 
AACTCTTTCA 
ATCTCTTTGC 
CTCTTACTGC 
GATATGCTGA 
TGTGACTGTT 
CAAGTATGGA 
AAGTGGGAGC 
CCCGATATCA 
CTCGGCCTGT 
TTTTTCCCAG 
CTACATCAAG 
AGAGTGACAC 
TGCGACGCCA 
CTTTGTTGAA 
CTTATTGGAA 
CCACAAGAAT 
GGATGCCAAA 



21 

CCCTCCTTCT 
CTGTTCTCCG 
AGTGCCGCGT 
AGCGGGAATG 
TTCTTGGCAG 
ACTCATTTTA 
TCAGGCAACT 
AGCTGCCACC 
CCATCTGTAA 
TCAATCACAG 
GAGCAGTCCG 
TGACCAGTGA 
TCCGTTGCCA 
AGGAAGTTCA 
AGGTTCTGGC 
ACATCTACCA 
TGGAGGAAGC 
GAAGCCATCC 
GCATGTTTCC 
ATGGCAGAGA 
ACATCAGAGC 
TGCCTTATTG 
ATTCTGTTCC 
CATTAGTTGT 
AAAAAAAAAA 



Seq ID NO: 421 Protein sequence 
Protein Accession #: NP_073580 



MRCSQCRVAK 
SEKLYSFYDL 
CNSFTICNAE 
LDMLMTSEER 
WKWEQVLAMC 
IFFPGSHPVR 
ECDANIRAS 



11 
I 

YCSAKCQKKA 
ESNINKLTED 
MQEVGVGLYP 
RKQLRDQYCF 
QAIISSNSER 
GVQVMKVGKL 



21 

1 

WPDHKRECKC 
KKEGLRQLVM 
SISLLNHSCD 
ECDCFRCQTQ 
LPDINIYQLK 
QLHQGMFPQA 



31 
I 

GCTCAGCTGC 
TGTATTGGCT 
CGCCAAATAC 

CAAATGCCTT 
AGTTGTCTTC 
TGATCTGGAG 
CGTAATGACA 
TGCCTTTGAC 
TGCGGAGATG 
CTGTGACCCC 
AGACATCGAG 
GGAGCGCCGG 
AACCCAGGAC 
AGAATCCCTG 
CATGTGCCAG 
GCTGAAGGTG 
CTTGTTCTAT 
CGTCAGAGGG 
CCAAGCAATG 
ACACAGCCTG 
ATCCTAAGGG 
AGGTCACACA 
GTGTTTGTGT 
AGAGAAGCAC 
AAAAAAA 



31 
I 

LKSCKPRYPP 
TFQHFMREEI 
PNCSIVFNGP 
DKDADMIiTGD 
VLDCAMDACI 
MKNIiRLAFDI 



41 

1 

CTTGCAGTAA 
TTTCTGGGCA 
TGTAGTGCTA 
AAAAGCTGCA 
AAACTTATGG 
TCAAATATTA 
TTTCAACATT 
CTTTTTGAAG 
CAGGAAGTTG 
AACTGTTCGA 
GTGGGAGAGG 
AAGCAGCTGA 
AAGGATGCTG 
AAAAAAATTG 
GCGATCATAA 
CTCGACTGCG 
GGTACTCGGA 
GTTCAAGTGA 
AAGAATCTGA 
ATTGAAGATT 
AACGCAGTCA 
CTCTATGCTT 
AGGTAAATAA 
GATTATAATA 



41 
1 

DSVRLLGRW 
QDASQLPPAF 
HLLLRAVRDI 
EQVWKEVQES 
NL6LLEEALF 
MRVTHGREHS 



51 
I 

TTAAACTCTT 
GCAGGAAGGA 
AGTGTCAGAA 
AACCCAGATA 
ATGGAGCACC 
ACAAACTGAC 
TCATGAGAGA 
CCTTTGCAAA 
GTGTTGGCCT 
TTGTGTTCAA 
AGCTCACCAT 
GGGACCAGTA 
ATATGCTAAC 
AAGAACTGAA 
GCAGCAATTC 
CCATGGATGC 
CCATGGAGCC 
TGAAAGTTGG 
GACTGGCTTT 
TGATTCTACT 
GAGGGA7VATA 
TGTTAGCTGT 
AGGCAGACAT 
AATTCAAAAC 



51 
I 

FKLMDGAPSE 
DLFEAFAKVI 
EVGEELTICY 
LKKIEELKAH 
YGTRTMEPYR 
IiIEDLIIiIiXiE 



60 
120 
180 
240 
300 
. 360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



60 
120 
180 
240 
300 
360 



60 
65 
70 
75 
80 
85 



Seq ID NO: 422 DNA sequence 

Nucleic Acid Accession #t NM_003014.2 

Coding sequence: 238.. 648 



GGCGGGTTCG 
CGGAGCTCCG 
AAACTCTCCT 
GGCAGGAAGA 
TTCCTCTCCA 
GCGCCCTGCG 
ATGCCCAACC 
GAGGAGCTGG 
GCGCCCATTT 
CAACGCGCGC 
AGCCTGGCCT 
ATCGTCACGG 
CAGGAAAGGC 
AAGGTGAAGC 
AAAATAAAAG 
GAGATCTTCA 
TCTTGCCAGT 
CGTTC/WVGGA 
AAAAGATCCA 
AAGAAAACAG 
GCTCCCAAAC 
.AACCCGATUU^ 

'gatgaggctg 



11 
1 

CGCCCCGAAG 
CGGCCGGACC 
GCGCCCCAGA 
GAAGGCGCTT 
TCCTAGTGGC 
AGGCGGTGCG 
ACCTGCACCA 
TGGACGTGAA 
GCACCCTGGA 
GCGACGACTG 
GCGACGAGCT 
ACCTCCCGGA 
CTCTTGATGT 
CAACTTTGGC 
CTGTGCAGAG 
AGTCCTCATC 
GTCCACACAT 
TGATGCTTCT 
TACAGTGGGA 
CCGGGCGCAC 
CAGCCAGTCC 
GAGTGTGAGC 
GGCATTGCCT 



21 

1 

GCTGAGAGCT 
CCGCGGCCCC 
AGATTTCTTC 
TCTGTCTGCC 
GCTGTGCCTG 
CATCCCTATG 
CAGCACGCAG 
CTGCAGCGCC 
GTTCCTGCAC 
CGAGCCCCTC 
GCCTGTCTAT 
GGATGTTAAG 
TGACTGTAAA 
AACGTATCTC 
GAGTGGCTGC 
ACCCATCCCT 
CCTGCCCCAT 
TGAAAATTGC 
AGAGAGGCTG 
CAGTCGTAGT 
CAAGAAGAAC 
TAACTAGTTT 
GGGACAGCCT 



31 

1 

GGCGCTGCTC 
GCTTTGCTGC 
CTCGGCGAAG 
GGGGTCGCAG 
TGGCTGCACC 
TGCCGGCACA 
GAGAACGCCA 
GTGCTGCGCT 
GACCCTATCA 
ATGAAGATGT 
GACCGTGGCG 
TGGATAGACA 
CGCCTAAGCC 
AGCAAAAACT 
AATGAGGTCA 
CGAACTCAAG 
CAAGATGTTC 
TTAGTTGAAA 
CAGGAACAGC 
AATCCCCCCA 
ATTAAAACTA 
CCAAA6CG6A 
ATGTAAGGCC 



41 

1 

GTGCCCTGTG 
CGACTGGAGT 
G6ACAGCGAA 
CGCGAGAGGG 
TGGCGCTGGG 
TGCCCTGGAA 
TCCTGGCCAT 
TCTTCTTCTG 
AGCCGTGCAA 
ACAACCACAG 
TGTGCATTTC 
TCACACCAGA 
CCGATCGGTG 
ACAGCTATGT 
CAACGGTGGT 
TCCCGCTCAT 
TCATCATGTG 
AATGGAGAGA 
GGAGAACAGT 
AACCAAAGGG 
GGAGTGCCCA 
GACTTCCGAC 
ATGTGCCCCT 



51 

1 

TGCCAGACGG 
TTGGGGGAAG 
AGATGAGGGT 
CAGTGCCATG 
CGTGCGCGGC 
CATCACGCGG 
CGAGCAGTAC 
TGCCATGTAC 
GTCGGTGTGC 
CTGGCCCGAA 
GCCTGAAGCC 
CATGATGGTA 
CAAGTGTAAA 
TATTCATGCC 
GGATGTAAAA 
TACAAATTCT 
TTACGAGTGG 
TCAGCTTAGT 
TCAGGACAAG 
AAAGCCTCCT 
GAAGAGAACA 
TTCCTTACAG 
TGCCCTAACA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



343 



10 



15 



20 



25 



WO 02/086443 

ACTCACTGCA GTGCTCTTCA TAGACACATC TTGCAGCATT 
GTTTTTCTTT GTAAGCCATC ACAAGCCATA GTG6TAGGTT 
GAGTTAAAGC TGGTGGAAAA GGCTTATTGC ATTGCATTCA 
CTAGAAGAGT AGGGAAAATA ATGCTTGTTA CAATTCGACC 
AAATGCCATA TTTCAAACAA AACACGTAAT TTTTTTACAG 
TATCTGTTGT TGCAATGTTA GTGATGTTTT AAAATGTGAT 
AAGGAACAGT AGTGGAATGA ATGTTAAAAG ATCTTTATGT 
TTTTTGTGAT GAAAGGGGAT TTTTTGAAAA ATTAGAGAAG 
TGTGTTTTTT TACCAATGAC TTCA6TTTCT 6TTTTTAGCT 
AATAATAAAG AAAAATAAAT AAAAAGGAGA GGCAGACAAT 
GTTACCTGAT TTCCATGATC ATGATGCTTC TTGTCAACAC 
ACAGTGAGTT TGTCTGTACC ATTAGGAGTT AGGTACTAAT 
ATTTTATACC CACAAGAGAG GTATGTCACT CATCTTACTT 
AATAATTTGA CAAGCTTAAA AATGGCCTTC ATGTGAGTGC 
TTAAATATTT TCTTTGCCTA AATACATGTG AGAGGAGTTA 
AAAGTTGAGT TCCACCTCTG AAATGAGAAT TACTTGACAG 
AAAAAGAACT TATTTGCAGC ATTTTATCAA CAAATTTCAT 
ATTTATTTTA AAAAACAATT TTATTGGCCT TTTGCTAACA 
AGGCATTCAA TAAATGCACA ACGCCCAAAG GAAATAAAAT 
ACTACACAGA GGTAATCACT ATTAGTATTT TGGCATATTA 
GCACTTATAA AATGATTTGA ACAAATAAAA CTAGGAACCT 
CTGCCTCCTT TGCTTGGCCC TTTATTGAGA TAAGTTTTCC 
TCTCATTTCT AACAGCTGTG TTATATTCCA TAGTATGCAT 
TATTGGATAC TTAGGTGGTT TCTTCACTGA CAATACTGAA 

Seq ID NO: 423 Protein, sequence 
Protein Accession #: NP_003005.1. 



PCT/US02/12476 



TTTCTTAAGG 

GAGTAACCTG 
TAATATGTGC 
TATGTTTTAT 
GAAAATATAA 
GTTTATGGTC 
TAGCATATGG 
AGAAACTTAA 
GTCTGGATTC 
CCTCTTAAGC 
TAGTTGGCTA 
CCCAGGACAT 
CAAATTTTGT 
AATATAAATG 
TTGGGATACT 
AATTGTGGAC 
CAGTAAGCAT 
CCTATCTAAT 
TTCTCCAGGT 
GTATACATGT 
TGTCAAGAAA 
TACTCAACAA 
TAAACATCTC 



CTATGCTTCA 
TACAGAAGGT 
TGTGCATACT 
ATTGTAAAAT 
TACCTTTTGA 
TGTTTTTAAG 
TGCAGAAGGA 
AAAATTATAA 
AAACAAAAAT 

AGCACCAGAA 
ATGCTCAAGT 
CCACCCTGAG 
TTTTCTTCAT 
TACAGAGAGG 
TTAATCAGAA 
AATTGGAGGC 
GTATTTTATA 
CCTACTCTCC 
GTTTGCTTAT 
GTTTCATAAC 
GCAGAAACCA 
ACTGTTGTGC 
ACCGGAATTC 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



30 



35 



MFLSILVALC 
YEELVDVNCS 
ESLACDELPV 
KKVKPTLATY 
SSCQCPHILP 
KKKTAGRTSR 



11 

I 

LWLHLALGVR 
AVLRFFFCAM 
YDRGVCISPE 
LSKNYSYVIH 
HQDVLIMCYE 
SNPPKPKGKP 



21 
I 

GAPCEAVRIP 
YAPICTLEFL 
AIVTDLPEDV 
AKIKAVQRSG 
WRSRMMI/LEN 
PAPKFASPKK 



31 

1 

MCRHMPWNIT 
HDPIKPCKSV 
KWIDITPDMM 
CNBVTTVVDV 
CIiVBKWRDQL 
NIKTRSAQKR 



41 
I 

RMPNHLHHST 
CQRARDDCEP 
VQERPIiDVDC 
KEIFKSSSPI 
SKRSIQWBER 
TNPKRV 



51 

I 

QENAILAIEQ 
LMKMYNHSWP 
KRLSPDRCKC 
PRTQVPLITN 
LQEQRRTVQD 



60 
120 

180 
240 
300 



Seq ID NO: 424 DNA sequence 
Nucleic Acid Accession #: BC010423 
40 Coding sequence: 248.. 1780 



45 
50 
55 
60 
65 
70 
75 
80 
85 



CACAGCGTGG 
AGCTACGGCT 
CAAGTGCGAG 
TCTGCAGCCG 
TTCAACCATG 
GCTGCTACTG 
CGTGGTAACT 
CGGCGAGCAA 
ACTAGCGCTA 
GGAGCAGCCG 
GCAGGCGGAT 
GGCGCGGCTG 
ACTAGAAGAG 
CCCCAGCGTG 
CTCCCGCTCT 
GCAGCCACTG 
CATCCTCCAC 
GTGGCACATT 
CTCATACAAC 
CACTTTGGGC 
CAATGAGTTC 
CTCTGGGAAG 
ACTCTTGTTC 
GGCCCAGCAG 
CCGGAGGCTG 
6AGA6CCGAG 
AGAGCCCGAG 
TGAACTGCTG 
CAAACAGGCC 
CAATGGCATC 
CTAGGCCTGG 
ACACCCCCAT 
AACCCTTCTG 
CACTGTGTGT 
TGACTGTCCG 
AAGTGAACTG 
GTTTGGCGTG 
CAGACCCCAG 
CAGACCCAGG 
TCTCCTACCA 
GAGGCTTGAA 
ACATATTTTC 
ACTTTTAATT 
TTTTATTTTT 



11 
I 

GAAGCAGCTC 
GGGTGTGTAG 
AGGCAAGAAC 
GCTCCCAGGG 
CCCCTGTCCC 
CTGGCATCAT 
GTGGTGCTGG 
GTGGGGCAAG 
CTGCACTCCA 
CCGCCCCCAC 
GAGGGCGAGT 
CGGCTCCGAG 
GGCCAGGGCC 
ACCTGGGACA 
GCTGCCGTCA 
ACTTGTGTGG 
GTGTCCTTCC 
GGCAGAGAAG 
TGGACACGGC 
TTTCCCCCAC 
TCCTCAAGGG 
CAGGTGGACC 
TGCCTTCTGG 
ATGACCCAGA 
CATTCCCATC 
GGCCACCCTG 
6GCCGCAGTT 
TCTCCAGGCT 
ATGAACCATT 
TACATCAATG 
CTCCTTCTGT 
TTCTTGCGGA 
TTCATCGGGA 
GTGCATGTGT 
TGGAGGGGTG 
TGGTGTATGT 
T6TGTCATGT 
AGCAGTATTA 
TGTGCGGGCA 
CTTCGGAGCC 
CTGTTACAGA 
TGTAAATATA 

ATTTTTTTTT 



21 
1 

TGGGGGAGCT 
AACGGGGCCG 
TCTGCAGCTT 
AGATCTCGGT 
TGGGAGCCGA 
TTACAGGCCG 
GCCAGGACGC 
TGGCATGGGC 
AATACGGGCT 
GCAACCCCCT 
ACGAGTGCCG 
TGCTGGTGCC 
TGACCCTGGC 
CGGAGGTCAA 
CCTCAGAGTT 
TGTCCCATCC 
TTGCTGAGGC 
GAGCTATGCT 
TGGATGGGCC 
TGACCACTQA 
ATTCTCAGGT 
TAGTGTCAGC 
TGGTGGTGGT 
AATATGAGGA 
ACACGGACCC 
ATAGTCTCAA 
ACTCCACGCT 
CTGGGCGGGC 
TTGTTCAGGA 
GGCGGGGACA 
TGACATGGGA 
AGATGCTCCC 
GGGCTCCACC 
GCCTGTGTGA 
ACTGTGTCCG 
GCCACGGGAT 
GGCTGTGTGT 
ATGATGCAGA 
TAGCTGGAGC 
ATGGGGGCAA 
AGCCCTCTGC 
CATGCGCCGG 
TTTTTTCTTG 
AGAGTTTGAG 



31 
I 

CGGAGCTCCC 
GGGCTGGGGC 
CCTGCCTTCT 
GGAACTTCAG 
GATGTGGGGG 
GTGCCCCGCG 
AAAACTGCCC 
TCGGGTGGAC 
TCATGTGAGC 
GGACGGCTCA 
GGTCAGCACC 
TCCCCTGCCC 
AGCCTCCTGC 
AGGCACAACG 
CCACTTGGTG 
TGGCCTGCTC 
CTCl-GTGAGG 
CAAGTGCCTG 
TCTGCCCAGT 
GCACAGCGGC 
CACTGTGGAT 
CTCGGTGGTG 
GGTGCTCATG 
GGAGCT6ACC 
CAGGAGCCAG 
GGACAACAGT 
GACCACGGTG 
CGAGGAGGAG 
GAATGGGACC 
CCTGGTCTGA 
GATTTTAGCT 
CATCCCACTG 
AATTGAGTCT 
GTGTTGACTG 
TGGTGTGTAT 
TTGAGTGGTT 
GACCTCTGCC 
GGTTGGAGGA 
TGGAATCTGC 
GTGTGAAGCA 
CCTCTGGTGG 
GAGCTTCTTG 
CCCTTTCCAT 
TCCAGCCTGG 



41 
I 

GATCACGGCT 
TGGGTCCCCT 
GGGTCAGTTC 
AAACGCTGGG 
CCTGAGGCCT 
GGTGAGCTGG 
TGCTTCTACC 
GCGGGCGAAG 
CCGGCTTACG 
GTGCTCCTGC 
TTCCCCGCCG 
TCACTGAATC 
ACAGCTGAGG 
TCCAGCCGTT 
CCTAGCCGCA 
CAGGACCAAA 
GGCCTTGAAG 
AGTGAAGGGC 
GGGGTACGAG 
ATCTACGTCT 
GTTCTTGACC 
GTGGTGGGTG 
TCCCGATACC 
CTGACCAGGG 
CCGGAGGAGA 
AGCTGCTCTG 
AGGGAGATAG 
GAAGATCAGG 
CTACGGGCCA 
CCCAGGCCTG 
CATCTTGGGG 
ACTGCTTGAC 
CTCCCACCAT 
ACTGTGTGTG 
TATGCTGTCA 
GCGTGGGCAA 
TGAA/^GCA 
GAGAGGTGGA 
CTCCGGTGTG 
GCCAGTCCCT 
CCTCTGGGCC 
CAGGAATACT 
TAGTTGTATT 
ACGATATAGC 



51 
1 

TCTTGGGGGT 
AGTGGAGACC 
CTTATTCAAG 
CAGTCTGCCT 
GGCTGCTGCT 
AGACCTCAGA 
GAGGGGACTC 
GCGCCCAGGA 
AGGGCCGCGT 
GCAACGCAGT 
GCAGCTTCCA 
CTQGTCCAGC 
GCAGCCCAGC 
CCTTCAAGCA 
GCATGAATGG 
GGATCACCCA 
ACCAAAATCT 
AGCCCCCTCC 
TGGATGGGGA 
GCCATGTCAG 
CCCAGGAAGA 
TGATCGCCGC 
ATCGGCGCAA 
AGAACTCCAT 
GTGTAGGGCT 
TGATGAGTGA 
AAACACAGAC 
ATGAAGGCAT 
AGCCCACGGG 
CCTCCCTTCC 
GCCTCCTTAA 
CTTTACCTCC 
GCATGCAGGT 
TGTGGAGGGG 
TATCAGAGTC 
CACTGTCAGG 
GGTATTTTCT 
GACTGTGGCT 
AGGGAACCTG 
GGGTCAGCCA 
TGCTGCATGT 
GCTCCGAATC 
TTTTATTTAT 
CAGACCCTGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620* 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



344 



wo 02/086443 

CTGTAAAAAA ACCAAAACCC AAAAAAAAAA AAAAAA7\AAA 
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Seq ID NO: 425 Protein sequence 
^ Protein Accession # : AAH10423 

1 11 21 31 41 51 

I I I I I I 

MPIiSLGAEMW GPEAWLLLIiL LLASFTGRCP AGEIiETSDW TWLGQDAKL PCFYRGDSGE 60 

QVGQVAWARV DAGEGAQEUV LLHSKYGLHV SPAYEGRVEQ PPPPRNPLDG SVIiLRNAVQA 120 

10 DEGBYECRVS TFPAGSFQAR LRLRVLVPPIi PSLNPGPALE BGQGLTLAAS CTAEGSPAPS 180 

VTWDTEVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQP LTCWSHPGL LQDQRITHIL 240 

HVSFLAEASV RGLEDQNLWH IGREGAMLKC LSEGQPPPSY NWTRLDGPLP SGVRVDGDTL 3 00 

GFPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVIiDPQEDSG KQVDLVSASV VWGVIAALL 360 

FCIiLVWWIi MSRYHRRKAQ QMTQKYEEEL TLTRENSIRR LHSHHTDPRS QPEESVGLRA 420 

15 EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIETQTEL LSPGSGRAEE EEDQDEGIKQ 480 
AMNHFVQENG TLRAKPTGNG lYINGRGHLV 

Seq ID NO: 426 DNA sequence 
Nucleic Acid Accession #: NM_003474.2 
20 Coding sequence: 3 7.. 3 03 6 

1 11 21 31 41 51 

I I I I 1 I 

CACTAACGCT CTTCCTAGTC CCCGGGCCAA CTCGGACAGT TTGCTCATTT ATTGCAACGG 60 

25 TCAAGGCTGG CTTGTGCCAG AACGGCGCGC GCGCGACGCA CGCACACACA CGGGGGGAAA 120 

CTTTTTTAAA AATGAAAGGC TAGAAGAGCT CAGCX3GCGGC GCGGGCCGTG CGCGAGGGCT 180 

CCGGAGCTGA CTCGCCGAGG CAGGAAATCC CTCCGGTCGC GACGCCCGGC CCCGCTCGGC 240 

GCCCGCGTGG GATGGTGCAG CGCTCGCCGC CGGGCCCGAG AGCTGCTGCA CTGAAGGCCG 300 

GCGACGATGG CAGCGCGCCC GCTGCCCGTG TCCCCCGCCC GCGCCCTCCT GCTCGCCCTG 360 

30 GCCGGTGCTC TGCTCGCGCC CTGCGAGGCC CGAGGGGTGA GCTTATGGAA CGAAGGAAGA 420 

GCTGATGAAG TTGTCAGTGC CTCTGTTCGG AGTGGGGACC TCTGGATCCC AGTGAAGAGC 480 

TTCGACTCCA AGAATCATCC AGAAGTGCTG AATATTCX3AC TACAACGGGA AAGCAAAGAA 540 

CTGATCATAA ATCTGGAAAG AAATGAAGGT CTCATTGCCA GCAGTTTCAC GGAAACCCAC 600 

TATCTGCAAG ACGGTACTGA TGTCTCCCTC GCTCGAAATT ACACGGTAAT TCTGGGTCAC 660 

35 TGTTACTACC ATGGACATGT ACGGGGATAT TCTGATTCAG CAGTCAGTCT CAGCACGTGT 72 0 

TCTGGTCTCA GGGGACTTAT TGTGTTTGAA AATGAAAGCT ATGTCTTAGA ACCAATGAAA 7 80 

AGTGCAACCA ACAGATACAA ACTCTTCCCA GCGAAGAAGC TGAAAAGCGT CCGGGGATCA 840 

TGTGGATCAC ATCACAACAC ACCAAACCTC GCTGCAAAGA ATGTGTTTCC ACCACCCTCT 900 

CAGACATGGG CAAGAAGGCA TAAAAGAGAG ACCCTCAAGG CAACTAAGTA TGTGGAGCTG 960 

40 GTGATCGTGG CAGACAACCG AGAGTTTCAG AGGCAAGGAA AAGATCTGGA A/^AAGTTAAG 1020 

CAGCGATTAA TAGAGATTGC TAATCACGTT GACAAGTTTT ACAGACCACT GAACATTCGG 10 80 

ATCGTGTTGG TAGGCGTGGA AGTGTGGAAT GACATGGACA AATGCTCTGT AAGTCAGGAC 1140 

CCATTCACCA GCCTCCATGA ATTTCTGGAC TGGAGGAAGA TGAAGCTTCT ACCTCGCAAA 1200 

TCCCATGACA ATGCGCAGCT TGTCAGTGGG GTTTATTTCC T^GGGACCAC CATCGGCATG 1260 

45 GCCCCAATCA TGAGCATGTG CACGGCAGAC CAGTCTGGGG GAATTGTCAT GGACCATTCA 1320 

GACAATCCCC TTG6TGCAGC CGTGACCCTG GCACATGAGC TGGGCCACAA TTTCGGGATG 13 80 

AATCATGACA CACTGGACAG GGGCTGTAGC TGTCAAATGG CGGTTGAGAA AGGAGGCTGC 1440 

ATCATGAACG CTTCCACCGG GTACCCATTT CCCATGGTGT TCAGCAGTTG CAGCAGGAAG 150 0 

GACTTGGAGA CCAGCCTGGA GAAAGGAATG GGGGTGTGCC TGTTTAACCT GCCGGAAGTC 1560 

50 AGGGAGTCTT TCGGGGGCCA GAAGTGTGGG AACAGATTTG TGGAAGAAGG AGAGGAGTGT 1620 

GACTGTGGGG AGCCAGAGGA ATGTATGAAT CGCTGCTGCA ATGCCACCAC CTGTACCCTG 1680 

AAGCCGGACG CTGTGTGCGC ACATGGGCTG TGCTGTGAAG ACTGCCAGCT GAAGCCTGCA 1740 

GGAACAGCGT GCAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG CACAGGGGCC 1800 

AGCCCTCACT GCCCAGCCAA CGTGTACCTG CACGATGGGC ACTCATGTCA GGATGTGGAC 1860 

55 GGCTACTGCT ACAATGGCAT CTGCCAGACT CACGAGCAGC AGTGTGTCAC ACTCTGGGGA 1920 

CCAGGTGCTA AACCTGCCCC TGGGATCTGC TTTGAGAGAG TCAATTCTGC AGGTGATCCT 1980 

TATGGCAACT GTGGCAAAGT CTCGAAGAGT TCCTTTGCCA AATGCGAGAT GAGAGATGCT 2 040 

AAATGTGGAA TW^TCCAGTG TCAAGGAGGT GCCAGCCGGC CAGTCATTGG TACCAATGCC 210 0 

GTTTCCATAG AAACAAACAT CCCCCTGCAG CAAGGAGGCC GGATTCTGTG CCGGGGGACC 2160 

60 CACGTGTACT TGGGCGATGA CATGCCGGAC CCAGGGCTTG TGCTTGCAGG CACAAAGTGT 2220 

GCAGATGGAA AAATCTGCCT GAATCGTCAA TGTCAAAATA TTAGTGTCTT TGGGGTTCAC 2280 

GAGTGTGCAA TGCAGTGCCA CGGCAGAGGG GTGTGCAACA ACAGGAAGAA CTGCCACTGC 2340 

GAGGCCCACT GGGCACCTCC CTTCTGTGAC AAGTTTGGCT TTGGAGGT^G CACAGACAGC 2400 

GGCCCCATCC GGCAAGCAGA TAACCAAGGT TTAACCATAG GAATTCTGGT GACCATCCTG 2460 

65 TGTCTTCTTG CTGCCGGATT TGTGGTTTAT CTCAAAAGGA AGACCTTGAT ACGACTGCTG 252 0 

TTTACAAATA AGAAGACCAC CATTGAAAAA CTAAGGTGTG TGCGCCCTTC CCGGCCACCC 2580 

CGTGGCTTCC AACCCTGTCA GGCTCACCTC GGCCACCTTG GTW^GGCCT GATGAGGAAG 2640 

CCGCCAGATT CCTACCCACC GAAGGACAAT CCCAGGAGAT TGCTGCAGTG TCAGAATGTT 2700 

GACATCAGCA GACCCCTCAA CGGCCTGAAT GTCCCTCAGC CCCAGTCAAC TCAGCGAGTG 2 760 

70 CTTCCTCCCC TCCACCGGGC CCCACGTGCA CCTAGCGTCC CTGCCAGACC CCTGCCAGCC 2820 

AAGCCTGCAC TTAGGCAGGC CCAGGGGACC TGTAAGCCAA ACCCCCCTCA GAAGCCTCTG 2880 

CCTGCAGATC CTCTGGCCAG AACAACTCGG CTCACTCATG CCTTGGCCAG GACCCCAGGA 2940 

CAATGGGAGA CTGGGCTCCG CCTGGCACCC CTCAGACCTG CTCCACAATA TCCACACCAA 3000 

GTGCCCAGAT CCACCCACAC CGCCTATATT AAGTGAGAAG CCGACACCTT TTTTCAACAG 3060 

75 TGAAGACAGA AGTTTGCACT ATCTTTCAGC TCCAGTTGGA GTTTTTTGTA CCAACTTTTA 312 0 

GGATTTTTTT TAATGTTTAA AACATCATTA CTATAAGAAC TTTGAGCTAC TGCCGTCAGT 3180 

GCTGTGCTGT GCTATGGTGC TCTGTCTACT TGCACAGGTA CTTGTAAATT ATTAATTTAT 3240 

GCAGAATGTT GATTACAGTG CAGTGCGCTG TAGTAGGCAT TTTTACCATC ACTGAGTTTT 3300 

CCATGGCAG6 AAGGCTTGTT GTGCTTTTAG TATTTTAGTG AACTTGAAAT ATCCTGCTTG 3360 

80 ATGGGATTCT GGACAGGATG TGTTTGCTTT CTGATCAAGG CCTTATTGGA AAGCAGTCCC 3420 

CCAACTACCC CCAGCTGTGC TTATGGTACC AGATGCAGCT CAAGAGATCC CAAGTAGAAT 3480 

CTCAGTTGAT TTTCTGGATT CCCCATCTCA GGCCAGAGCC AAGGGGCTTC AGGTCCAGGC 3540 

TGTGTTTGGC TTTCAGGGAG GCCCTGTGCC CCTTGACAAC TGGCAGGCAG GCTCCCAGGG 3600 

ACACCTGGGA GAAATCTGGC TTCTGGCCAG GAAGCTTTGG TGAGAACCTG GGTTGCAGAC 3660 

85 AGGAATCTTA AGGTGTAGCC ACACCAGGAT AGAGACTGGA ACACTAGACA AGCCAGAACT 3720 

TGACCCTGAG CTGACCAGCC GTGAGCATGT TTGGAAGGGG TCTGTAGTGT CACTCAAGGC 3780 

GGTGCTTGAT AGAAATGCCA AGCACTTCTT TTTCTCGCTG TCCTTTCTAG AGCACTGCCA 3 840 



345 



wo 02/086443 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



CCAGTAGGTT 
CTGCAAACCG 
CAATGATCCT 
TGAACCATTA 
AATTAGGCAG 
TATAGTTCAT 
CATCCTCTTT 
ACCTATTTCT 
CAACTTGCTT 
CTCTTCACTC 
AATGGCATGA 
CTGGACTGGT 
TTTATGAGAA 
TGTACCAAGA 
TCCCCACTGT 
AAACACACAC 
TTATTCTATA 
AGATACATAC 
TATATACTAT 
AGATGCCCAA 
ACCAAAAAAA 



ATTTAGCTTG 
CCACCTCCCT 
GTATTCAGAC 
ACCAGATCTA 
ACTCTTTATG 
GTCTGCTATC 
TTCCAACTTG 
TAT^CACTTG 
ATCAACTTCC 
TTCAT^TGCC 
GAAATACAAA 
TTTCACATTA 
AGCCTTCTTT 
ATCTTGGTTT 
ATCTAGGCAA 
AAAAG6GAAC 
GTTATTAAGT 
AGAATTACTG 
TAAAAAGGTT 
ATCCTTAGAT 
AAAA7VAAAAA 



GGAAAGGTGG 
ATACTGCTTG 
AGATGAGGAC 
GTCAATCAAG 
CTTGCAAAAA 
ATTATTCGTA 
GCTGCAGGAA 
CAACCTACCT 
TAAATATTAT 
TGACTAGGGA 
AATACTCAGA 
GAAGACAATT 
TGGGGTCAAC 
GCCTTCCAGA 
CATAGTATTC 
CCAGCTCTAA 
TCTTTAAAAT 
TAACTGATTA 
TACAGAATTT 
CTGGCATGTT 
AA 



TGTTTCTGTA 
GAGCTGAGCA 
TTTCCATGGG 
TCTGTTTACT 
CTACAACCAA 
GATATTGGAC 
TCTTTAAAAG 
GTTGAGCATC 
GAGATGTGGC 
GCCATGTTTC 
TAAGGTAAAA 
GACAACAGTT 
AGTTTTCCTA 
AAACAAAACT 
ATGACTATGG 
TACATTCCAA 
GTAAAGCCAT 
CACTTGGTAA 
TATGGTGCAT 
AGCCCTTCCT 



AGAAACCTAC 
AATCACCACA 
ACCACAACTA 
GCAAGGTTCA 
TGGAATGTGA 
AAAGAACCTT 
ATGCTTTTAA 
ACAGAATGTG 
TTGGGCAGCA 
ACAAGGTCTT 
TGCCATGATG 
ACATAATTCA 
TGCTTTGAAA 
GCATTTCACT 
ATA7ACTAAA 
CTCGTATAGC 
GCTGGAAAAT 
TTGTACTAAA 
TACGTGGGCA 
CCAATTATAA 



Seq ID NOt 427 Protein sequence 
Protein Accession NP 003465 



MAARPIiPVSP 
SKNHPEVLNI 
YHGHVRGYSD 
SHHNTPNLAA 
IiIEIANHVDK 
DNAQLVSGVY 
DTLDRGCSCQ 
SFGGQKCGNR 
ACRDSSNSCD 
AKPAPGICFE 
lETNIPLQQG 
AMQCHGRGVC 
LAAGFWYIiK 
DSYPPKDNPR 
ALRQAQGTCK 
RSTHTAYIK 



11 

I 

ARALLIi/^G 
RLQRESKELI 
SAVSLSTCSG 
KNVFPPPSQT 
FYRPLNIRIV 
FQGTTIGMAP 
MAVEKGGCIM 
FVEEGEECDC 
LPEFCTGASP 
RVNSAGDPYG 
GRILCRGTHV 
NNRKNCHCEA 
RKTLIRIjIiFT 
RLLQCQNVDI 
PNPPQKPLPA 



21 
I 

ALLAPCEARG 
INLERNEGLI 
LRGLIVFENE 
WARRHKRETL 
LVGVEVWNDM 
IMSMCTADQS 
NASTGYPFPM 
GEPEECMNRC 
HCPANVYLHD 
NCGKVSKSSF 
YLGDDMPDPG 
HWAPPFCDKF 
NKKTTIEKLR 
SRPLNGLNVP 
DPLARTTRLT 



31 

I 

VSLWNEGRAD 
ASSFTETHYL 
SYVLEPMKSA 
KATKYVELVI 
DKCSVSQDPF 
GGIVMDHSDN 
VFSSCSRKDL 
CNATTCTLKP 
GHSCQDVDGY 
AKCEMRDAKC 
LVLAGTKCAD 
GFGGSTDSGP 
CVRPSRPPRG 
QPQSTQRVLP 
HALARTPGQW 



41 

I 

EVVSASVRSG 
QDGTDVSLAR 
TNRYKLFPAK 
VADNREFQRQ 
TSLHEFLDWR 
PLGAAVTLAH 
ETSLEKGMGV 
DAVCAHGLCC 
CYNGICQTHE 
GKIQCQGGAS 
GKICLNRQCQ 
IRQADNQGLT 
FQPCQAHLGH 
PLHRAPRAPS 
ETGLRLAPLR 



TGCCCAGGCA 
AACTGTAATA 
TTTTCA6ATG 
ACTTATTAAC 
TGTTCATGGG 
CTCTATGGGG 
CAGAGTCTGA 
ATAAGGAAAT 
TCCCCTTGAA 
TAAAGTGACT 
CCTCTGTCTT 
CTCTGAGTGT 
CAGAAAAATA 
TTCCCGGTGT 
CACGTGACAC 
ATGCATCTGT 
AATACTGCTG 
GCCAAACATA 
TTGTCTTTTT 
GAGGATATGA 



51 
I 

DLWIPVKSFD 
NYTVILGHCY 
KLKSVRGSCG 
GKDLEKVKQR 
KMKLLPRKSH 
ELGHNFGMNH 
CLFNLPEVRE 
EDCQLKPAGT 
QQCVTLWGPG 
RPVIGTNAVS 
NISVFGVHEC 
IGILVTILCL 
LGKGLMRKPP 
VPARPLPAKP 
PAPQYPHQVP 



Seq ID NO: 428 DNA sequence 

Nucleic Acid Accession #: NM_003714 

Coding sequence: 135.. 1043 



GAG6AGGAGG 
GAGGAAGAGG 
TAATACCAAG 
TGGCCACCTT 
ACAGGAGCTC 
GTTTGGTCAA 
GTGAGATTCG 
ATGCCCAGGG 
ACAGGTTCGG 
AGCGGGAATG 
TAGTGGAGAT 
ACTTGCTGCT 
AGTGTGAGCA 
AGAAGCCTCC 
GGGCCCACCA 
GAGGTGCCAA 
TCGGGGGCCT 
AGTATTCTGA 
GTCCATTTTC 
ACGCAGGATT 
TGAGATGGAG 
ACGTACTCAA 
AGTCAGTGGG 
GCAGGGCCCC 
GCAGCAGCCT 
GAATGTAAAA 
AGGGGGTGCT 
CTCTTGGCGA 
TGTCTGGGCT 
ACTGCTTCAA 
TCTAAATAAA 
TAAAAAAAAA 
TTAAAAGCTA 
CACTTGGGGG 
TTTCCCTTAG 
CTGGCGATTC 
GAAAGAAGA6 



11 
1 

GAA7\AGGCX3A 
AGGAGGAGGA 
AACCATGTGT 
TGACCCGGCG 
CCAGCAGAAA 
CGCTGGCGAT 
GGGCTTACAT 
CAAGTCATTC 
CTGCATAAGC 
CTACCTCAAG 
GATCCATTTC 
GACCTGTGGG 
GAACTGGGGA 
CACGGCGCCC 
CGGGGAAGCA 
GGGTGAGCGA 
TGGGGCTCAG 
TATCCGGAGG 
TTATCTATGG 
CTGTGGGGAC 
ACCCCTGGGG 
GGGAGCGCGC 
TGTCGGCCGC 
CAGAGCTGGG 
CTGGTGCTGT 
ATAAATATCG 
TGGTGCCAAA 
GGGTGGAGGG 
GGGGGGGACA 
ATCTCGATTT 
TGGCTTTCAA 
AAAACCAGCC 
TCAAACAGCG 
AAACCTTATA 
GATTTCGTTA 
CAGGAGACCC 
AATGAAGACT 



21 
I 

GCAAT^AAGGA 
AGAGGGGAGC 
GCCGAGCGGC 
CGGGGGACCG 
GGCCGCCTGT 
GTGGGGTGTG 
GGGATTTGCA 
ATCAAAGACG 
CGGAAGTGCC 
CACGACCTGT 
AAGGACTTGC 
GAGGAGGTGA 
AGCCTGTGCT 
CCCGAGCGCC 
GGACATCACC 
GGTAGCAAGA 
GGACCTTCCG 
TGAAATGAAA 
ACATTCCAAA 
TGTGGACTTC 
CCGTGGGGTC 
CCGCGTTATC 
TCTGTTGTGG 
CCACACAGTG 
CTCCGCGGAA 
CTTAGAATGC 
CTGAAATTCA 
AGGAGTGTCA 
CTGTCCAAGG 
CACTTTTTTT 
ACAAAGCAAC 
CATCCTTTGA 
ACATAGCCAT 
CCCAGAGGAA 
TCTCACCTTG 
AGCTGGAAAC 
ACTTAGTAAT 



31 
I 

AGAGTGGGAG 
ACAAAGGATC 
TGGGCCAGTT 
ACGCCACCAA 
CCCTGCAGAA 
GCGTGTTTGA 
TGACTTTTCT 
CCTTGAAATG 
CGGCCATCAG 
GCGCGGCTGC 
TGCTGCACGA 
AGGAGGCCAT 
CCATCTTGAG 
AGCCCCAGGT 
TCCCAGAGCC 
GCCACCCAAA 
GAAGCAGCGA 
GGCCTGGCCA 
ACATTTACCA 
ATCGAGGTGT 
TCAGGGGTGC 
CTCGTACCTT 
GGGAGGTGAA 
GGTGCTGGGC 
GTCAGGGCGG 
AGGAGAAGGG 
GTTTCTTGTG 
TTTCTATGTG 
GAGTGGCCCC 
ATTTATCCAG 
TGGGTCATTA 
GGCTGATTTT 
ACATCTGACT 
AATACACACC 
ACCCTCAGCC 
CTGGCTTCTC 
TCCCATCAGG 



41 
I 

GAGGAGGGGA 
CAGGTCTCCC 
CATGACCCTG 
CCCACCCGAG 
TACAGCGGAG 
ATGTTTCGAG 
GCACAACGCT 
TAAGGCCCAC 
GGAAATGGTG 
CCAGGAGAAC 
ACCCTACGTG 
CACCCACAGC 
CTTCTGCACC 
GGACAGAACC 
CAGCAGTAGG 
CGCCCATGCC 
GTGGGAAGAC 
CGAAATCTTT 
TTAGAGAGGG 
GTGTTCGCGG 
CTGGTGAATT 
TGTCTTCTTT 
CCAGGGAGGG 
CTCGCCCCGA 
CTGGATTCCA 
TGGAGAGGAG 
TGGGGCCTTG 
TAATTTCTGA 
TATGAGTTTA 
TTATATCTAC 
AAACCAGCTC 
TCTTTTTTTT 
GCCTGACATG 
TGGGGAGTAC 
AAGATTGGTA 
CATGTGAGGG 
AAATGCTGAC 



51 
i 

AGCGGCGAAG 
GACGGGAGGT 
GCTTTGGTGT 
GGTCCCCAAG 
ATCCAGCACT 
AACAACTCTT 
GGAAAATTTG 
GCTCTGCGGC 
TCCCAGTTGC 
ACCCGGGTGA 
GACCTCGTGA 
GTGCAGGTTC 
TCGGCCATCC 
AAGCTCTCCA 
GAGACTGGCC 
CGAGGCAGAG 
GAACAGTCT6 
CCTCCACGCC 
GGGATGTCAC 
AACGGACAGG 
CTGCACTTAC 
CCATCTGTGG 
GCAGGGCAAG 
AGCTTCTGGT 
GGACAGGAGT 
GCAGGGGCCG 
CGGTTCAGAG 
GCCATTGTAC 
TATTTTAACC 
ATATCTGTCA 
AAAGGGGGTT 
AAGTTCTATT 
GACTCCTGCC 
ATTTGACA/^ 
AAGCTGCGTC 
GATGGGAAAG 
CTTTTACATA 



3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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AAATCAAGGA GACTGCTGAA AATCTCTAAG GGACAGGATT TTCCAGATCC TAATTGGAAA 
TTTAGCAATA AGGAGAGGAG TCCAAGGGGA CAAATAAAGG CAGA6AGAGA GAGAGAGAGA 
GGGAGAGGAA GAAAAGAGAG AGAGAAAAGA GCCTC6TGCC 



2280 
2340 



PCT/US02/12476 



Seq ID NO: 42 9 Protein sequence 
Protein Accession #: NP_0C3705 



10 
15 



1 
I 

MCAERLGQFM 
GDVGCGVFEC 
ISRKCPAIRE 
CGEEVKEAIT 
EAGHHLPEPS 
RR 



11 

1 

TLALVLATFD 
FENNSCEIRG 
MVSQLQRECY 
HSVQVQCEQN 
SRETGRGAKG 



21 
I 

PARGTDATNP 
LHGICMTFLH 
LKHDLCAAAQ 
WGSLCSILSF 
ERGSKSHPNA 



31 

I 

PEGPQDRSSQ 
NAGKFDAQGK 
ENTRVIVEMI 
CTSAIQKPPT 
HARGRVGGLG 



41 

1 

QKGRLSLQNT 
SFIKDALKCK 
HFKDLLLHEP 
APPERQPQVD 
AQGPSGSSEW 



51 

I 

AEIQHCLVNA 
AHALRHRFGC 
YVDLVNLLLT 
RTKLSRAHHG 
EDEQSEYSDI 



60 
120 
180 
240 
300 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Seq ID NO: 43 0 DNA sequence 

Nucleic Acid Accession #: NM_005940 

Coding sequence: 23.. 1489 



AAGCCCAGCA 
CGCCCTCCTG 
TCTGCCGCCG 
AGCCCTGCCC 
CAGCCTCAGG 
CCGACAGAAG 
GATCCTTCGG 
CCTAAAGGTA 
TGACATCATG 
TGGGGGCATC 
CGACTATGAT 
AGCCCATGAA 
GTCCGCCTTC 
TCAACACCTA 
CCAGGCTGGG 
CTGTGAGGCC 
GGGCTTTGTG 
TCGCCACTGG 
CATTTGGTTC 
CCCCGCACCC 
GGGTCCCGAG 
CAGCACCCGG 
CTCTGAGATC 
CCTCTACTGG 
GGGTCCTGAC 
ATGCCCTCAG 
ATCTTTGTGG 
GGTGGGGTAC 
AGCGACTGTC 
GGGACCCGCT 
GTAGCACCAT 
TCCTTCCAGG 
TGAGCAACTG 
ATCTGTCTGC 
GTTCACAGTC 
CAACATACCT 
ATCCTCCAAA 
TTTTTAAACT 



11 

1 

GCCCCGGGGC 
CCCCCGATGC 
GACGTCCACC 
AGTAGCCCGG 
CCTCCCCGCT 
AGGTTCGTGC 
TTCCCATGGC 
TGGAGCGATG 
ATCGACTTCG 
CTGGCCCATG 
GAGACCTGGA 
TTTGGCCACG 
TACACCTTTC 
TATGGCCAGC 
ATAGACACCA 
TCCTTTGACG 
TGGCGCCTCC 
CAGGGACTGC 
TTCCAAGGTG 
CTCACCGAGC 
AAGAACAA6A 
CGTGTAGACA 
GACGCTGCCT 
AAGTTTGACC 
TTCTTTGGCT 
GGGTGCTGAC 
CTGTGGGCAC 
AACCACCATG 
TCAGACTGGG 
ATGCAGGTCC 
GGCAGGACTG 
GGCTGGCACT 
GGCTGTAGGG 
CTTCTGGCTG 
AAATGGGGAG 
CAATCCTGTC 
GCCATTGTAA 
GAGGATTGTC 



21 
1 

GGATGGCTCC 
TGCTGCTGCT 
ACCTCCATGC 
CACCTGCCCC 
GTGGCGTGCC 
TTTCTGGCGG 
AGTTGGTGCA 
TGACGCCACT 
CCAGGTACTG 
CCTTCTTCCC 
CTATCGGGGA 
TGCTGGGGCT 
GCTACCCACT 
CCTGGCCCAC 
ATGAGATTGC 
CGGTCTCCAC 
GTGGGGGCCA 
CCAGCCCTGT 
CTCAGTACTG 
TGGGCCTGGT 
TCTACTTCTT 
GTCCCGTGCC 
TCCAGGATGC 
CTGTGAAGGT 
GTGCCGAGCC 
CCCTGCCAGG 
CAGGCATGGG 
ACAACTGCCG 
CAGGGAGGCT 
TGGCAAACCT 
GGGGAACTGG 
GAAGCAAGGG 
CAGGGCCACT 
ACAATCCTGG 
GGGTATTCTT 
CCAGGCCGGA 
ATGTGTGTAC 
ATTAAACACA 



31 

1 

GGCCGCCTGG 
GCTCCAGCCG 
CGAGAGGAGG 
TGCCACGCAG 
CGACCCATCT 
GCGCTGGGAG 
GGAGCAGGTG 
CACCTTTACT 
GCATGGGGAC 
CAAGACTCAC 
TGACCAGGGC 
GCAGCACACA 
GAGTCTCAGC 
TGTCACCTCC 
ACCGCTGGAG 
CATCCX5AGGC 
GCTGCAGCCC 
GGACGCTGCC 
GGTGTACGAC 
GAGGTTCCCG 
CCGAGGCAGG 
CCGCAGGGCC 
TGATGGCTAT 
GAAGGCTCTG 
TGCCAACACT 
CCACGAATAT 
ACTGAGCCCA 
GGAGGGCCAC 
TTGGCAT6AC 
GGCTGCCCTG 
AGTGTCCTTG 
TGCTGGGGCC 
TCCTGAGGTC 
AAATCTGTTC 
CATGCAGGAG 
TCCTCCTGAA 
AGTGTGTATA 
GTTGTTTTCT 



41 

I 

CTCCGCAGCG 
CCGCCGCTGC 
GGGCCACAGC 
GAAGCCCCCC 
GATGGGCTGA 
AAGACGGACC 
CGGCAGACGA 
GAGGTGCACG 
GACCTGCCGT 
CGAGAAGGGG 
ACAGACCTGC 
ACAGCAGCCA 
CCAGATGACT 
AGGACCCCAG 
CCAGACGCCC 
GAGCTCTTTT 
GGCTACCCAG 
TTCGAGGATG 
GGTGAAAAGC 
GTCCATGCTG 
GACTACTGGC 
ACTGACTGGA 
GCCTACTTCC 
GAAGGCTTCC 
TTCCTCTGAC 
CAGGCTAGAG 
TGTCTCCTGC 
GCAGGTCGTG 
TTAAGAGG/ikA 
TCTCATCCCT 
CTGTATCCCT 
CCATGGCCTT 
AGGTCTTGGT 
TCCAGAATCC 
ACCCCAGGCC 
GCCCTTTTCG 
AACCTTCTTC 



51 
i 

CGGCCGCGCG 
TGGCCCGGGC 
CCTGGCATGC 
GGCCTGCCAG 
GTGCCCGCAA 
tCACCTACAG 
TGGCAGAGGC 
AGGGCCGTGC 
TTGATGGGCC 
ATGTCCACTT 
TGCAGGTGGC 
AGGCCCTGAT 
GCAGGGGCGT 
CCCTGGGCCC 
CGCCAGATGC 
TCTTCAAAGC 
CATTGGCCTC 
CCCAGGGCCA 
CAGTCCTGGG 
CCTTGGTCTG 
GTTTCCACCC 
GAGGGGTGCC 
TGCGCGGCCG 
CCCGTCTCGT 
CATGGCTTGG 
ACCCATGGCC 
AGGGGGATGG 
GTCACCTGCC 
GGGCAGTCTT 
GTCCCTCAGG 
GTTGTGAGGT 
CAGCCCTGGC 
AGGTGCCTGC 
AGGCCAAAAA 
CTGGAGGCTG 
CAGCACTGCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



Seq ID NO: 431 Protein sequence 
Protein Accession NP 005931 



65 



70 



75 



1 
I 

MAPAAWLRSA 
PAPATQEAPR 
LVQEQVRQTM 
FFPKTHREGD 
YPLSLSPDDC 
VSTIRGELFF 
QYWVYDGEKP 
PVPRRATDWR 
AEPANTFL 



11 
I 

AARALLPPML 
PASSLRPPRC 

AEALKVWSDV 
VHFDYDETWT 
RGVQHLYGQP 
FKAGFVWRLR 
VLGPAPLTEL 
GVPSEIDJUVF 



21 
I 

LLLIiQPPPLL 
GVPDPSDGLS 

TPLTFTEVHE 
IGDDQGTDLL 
WPTVTSRTPA 
GGQLQPGYPA 
GLVRPPVHAA 
QD7U36YAYFL 



31 
I 

ARALPPDVHH 
ARNRQKRFVL 
GRADIMIDFA 
QVAAHEFGHV 
LGPQAGIDTN 
LASRHWQGIiP 
IiVWGPEKNKI 
RGRLYWKFDP 



41 

I 

LHABRRGPQP 
SGGRWEKTDL 
RYWHGDDLPF 
LGLQHTTAAK 
EIAPLEPDAP 
SPVDAAFEDA 
YFPRGRDYWR 
VKVKALEGFP 



51 

1 

WHAALPSSPA 
TYRILRFPWQ 
DGPGGILAHA 
ALMSAFYTFR 
PDACEASFDA 
QGHIWFFQGA 
PHPSTRRVDS 
RLVGPDPFGC 



60 
120 
180 
240 
300 
360 
420 
480 



80 



85 



Seq ID NO: 432 DNA sequence 

Nucleic Acid Accession # : NM__024022 

Coding sequence: 202.. 1563 



1 
I 

ACCGGGCACC 
GGAAAGGGCT 
CCATCTACAT 
AGAGGTCCTG 
TCATTCCGAT 



11 

I 

GGACGGCTCG 
GTGTTTATGG 
TTTTGGGACT 
AAATAGTCAC 
CGCTTTTTGG 



21 
I 

GGTACTTTCG 
GAAGCCAGTA 
CGGGAATTAT 
CATGGGG6AA 
CCTTGATGAT 



31 

1 

TTCTTAATTA 
ACACTGTGGC 
GAGGTAGAGG 
AATGATCCGC 
TTGAAAATAA 



41 
1 

GGTCATGCCC 
CTACTATCTC 
TGGAGGCGGA 
CTGCTGTTGA 
GTCCTGTTGC 



51 

1 

GTGTGAGCCA 
TTCCGTQGTG 
GCCGGATGTC 
AGCCCCCTTC 
ACCAGATGCA 



60 
120 
180 
240 
300 
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GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 360 
ATTGGGATCA TTGCATT6AT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 
TCA6GGAA6T ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 4 80 
GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT GGGTGGTCAG 540 
AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG 60 0 
AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 
GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 
CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 
TGTGCCTCTG GCCACGTGGT TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 
AGCTCACGCA TCGTGGGTGG* AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 
CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 12 00 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGG TGACGCCTCC 12 60 

CCT6TCCTGA ACCACGCGGC CGTCCCTTTG ATTTCCAACA AGATCTGCAA CCACAGGGAC 132 0 

GTGTACGGTG GCATCATCTC CCCCTCCATG CTCTGCGCGG GCTACCTGAC GGGTGGCGTG 1380 

GACAGCTGCC AGGGGGACAG CX3GGGGGCCC CTGGTGTGTC AAGAGAGGAG GCTGTGG7VAG 1440 

TTAGTGGGAG CGACCAGCTT TGGCATCGGC TGCGCAGAGG TGAACAAGCC TGGGGTGTAC 1500 

ACCCGTGTCA CCTCCTTCCT GGACTGGATC CACGAGCAGA TGGAGAGAGA CCTAAAAACC 1560 

TGAAGAGGAA GGGGACAAGT AGCCACCTGA GTTCCTGAGG TGATGAAGAC AGCCCGATCC 162 0 

TCCCCTGGAC TCCCGTGTAG GAACCTGCAC ACGAGCAGAC ACCCTTGGAG CTCTGAGTTC 1680 

CGGCACCAGT AGCAGGCCCG A/^AGAGGCAC CCTTCCATCT GATTCCAGCA CAACCTTCAA 1740 

GCTGCTTTTT GTTTTTTGTT TTTTTGAGGT GGAGTCTCGG TCTGTTGCCC AGGCTGGAGT 1800 

GCAGTGGCGA AATCCCTGCT CACTGCAGCC TCCGCTTCCC TGGTTCAAGC GATTCTCTTG 18 60 

CCTCAGCTTC CCCAGTAGCT GGGACCACAG GTGCCCGCCA CCACACCCAA CTAATTTTTG 192 0 

TATTTTTAGT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGCTCTCAAA CGCCTGACCT 1980 

CAAATGATGT GCCTGCTTCA GCCTCCCACA GTGCTGGGAT TACAGGCATG GGCCACCACG 2 040 

CCTAGCCTCA CGCTCCTTTC TGATCTTCAC TAAGAACAAA AGAAGCAGCA ACTTGCAAGG 2100 

GCGGCCTTTC CCACTGGTCC ATCTGGTTTT CTCTCCAGGG GTCTTGCAAA ATTCCTGACG 2160 

AGATAAGCAG TTATGTGACC TCACGTGCAA AGCCACCAAC AGCCACTCAG AAAAGACGCA 2220 

CCAGCCCAGA AGTGCAGAAC TGCAGTCACT GCACGTTTTC ATCTCTAGGG ACCAGAACCA 2280 

AACCCACCCT TTCTACTTCC AAGACTTATT TTCACATGTG GGGAGGTTAA TCTAGGAATG 2340 

ACTCGTTTAA GGCCTATTTT CATGATTTCT TTGTAGCATT TGGTGCTTGA CGTATTATTG 2400 

TCCTTTGATT CCAAATAATA TGTTTCCTTC CCTCAAAAAA AAAAAAAAAA AAAAATUUUUi, 2460 
AAAAA 

Seq ID NO: 433 Protein sequence 
Protein Accession #: NP_076927 

1 11 21 31 41 51 

111111 

MGENDPPAVB APFSFRSLFG LDDLKISPVA PDADAVAAQI IiSLIiPLKFFP IIVIGIIALI 60 
LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 120 
TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 180 
VTALHHSVYV REGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 240 
IiCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSIiLDN PAPSHLVEKI VYHSKYKPKR 300 
LGNDIALMKL AGPLTFNEMI QPVCLPNSBB NFPDGKVCWT SGWGATEDGG DASPVLNHAA 360 
VPLISNKICN HRDVYGGIIS PSMLCAGYLT GGVDSCQGDS GGPLVCQERR LWKIiVGATSF 420 
GIGCAEVNKP GVYTRVTSFL DWIHBQMERD LKT 

Seq ID NO: 434 DNA sequence 

Nucleic Acid Accession NM_000493.2 

Coding sequence : 97 . . 213 9 

1 11 21 31 41 51 

I I I I I I 

CACCTTCTGC ACTGCTCATC TGGGCAGAGG AAGCTTCAGA AAGCTGCCAA GGCACCATCT 60 
CCAGGAACTC CCAGCACGCA GAATCCATCT GAGAATATGC TGCCACAAAT ACCCTTTTTG 120 
CTGCTAGTAT CCTTGAACTT GGTTCATGGA GTGTTTTACG CTGT^CGATA CCAAATGCCC 180 
ACAGGCATAA AAGGCCCACT ACCCAACACC AAGACACAGT TCTTCATTCC CTACACCATA 240 
AAGAGTAAAG GTATAGCAGT AAGAGGAGAG CAAGGTACTC CTGGTCCACC AGGCCCTGCT 3 00 
GGACCTCGAG GGCACCCAGG TCCTTCTGGA CCACCAGGAA AACCAGGCTA CGGAAGTCCT 3 60 
GGACTCCAAG GAGAGCCAGG GTTGCCAGGA CCACCGGGAC CATCAGCTGT AGGGAAACCA 420 
GGTGTGCCAG GACTCCCAGG AAAACCAGGA GAGAGAGGAC CATATGGACC AAAAGGAGAT 480 
GTTGGACCAG CTGGCCTACC AGGACCCCC^ GGCCCACCAG GACCACCTGG AATCCCTGGA 540 
CCGGCTGG7U\ TTTCTGTGCC AGG7U\J\ACCT GGACAACAGG GACCCACAGG AGCCCCAGGA 600 
CCCAGGGGCT TTCCTGGAGA AAAGGGTGCA CCAGGAGTCC CTGGTATGAA TGGACAGAAA 660 
GGGGAAATGG GATATGGTGC TCCTGGTCGT CCAGGTGAGA GGGGTCTTCC AGGCCCTCAG 72 0 
GGTCCCACAG GACCATCTGG CCCTCCTGGA GTGGGAAAAA GAGGTGAAAA TGGGGTTCCA 780 
GGACAGCCAG GCATCAAAGG TGATAGAGGT TTTCCGGGAG AAATGGGACC AATTGGCCCA 840 
CCAGGTCCCC AAGGCCCTCC TGGGGAACGA GGGCCAGJ\AG GCATTGGAAA GCCAGGAGCT 900 
GCTGGAGCCC CAGGCCAGCC AGGGATTCCA GGAACAAAAG GTCTCCCTGG GGCTCCAGGA 960 

ATAGCTGGGC CCCCAGGGCC' TCCTGGCTTT GGGAAACCAG GCTTGCCAGG CCTGAAGGGA 1020 

GAAAGAGGAC CTGCTGGCCT TCCTGGGGGT CCAGGTGCCA AAGGGGAACA AGGGCCAGCA 1080 

GGTCTTCCTG GGAAGCCAGG TCTGACTGGA CCCCCTGGGA ATATGGGACC CCAAGGACCA 1140 

AAAGGCATCC CGGGTAGCCA TGGTCTCCCA GGCCCTAAAG GTGAGACAGG GCCAGCTGGG 1200 

CCTGCAGGAT ACCCTGGGGC TAAGGGTGAA AGGGGTTCCC CTGGGTCAGA TGGAAAACCA 12 60 

GGGTACCCAG GAAAACCAGG TCTCGATGGT CCTAAGGGTA ACCCAGGGTT ACCAGGTCCA 1320 

AAAGGTGATC CTGGAGTTGG AGGACCTCCT GGTCTCCCAG GCCCTGTGGG CCCAGCAGGA 1380 

GCAAAGGGAA TGCCCGGACA CAATGGAGAG GCTGGCCCAA GAGGTGCCCC TGGAATACCA 1440 

GGTACTAGAG GCCCTATTGG GCCACCAGGC ATTCCAGGAT TCCCTGGGTC TAAAGGGGAT 1500 

CCAGGAAGTC CCGGTCCTCC TGGCCCAGCT GGCATAGCAA CTAAGGGCCT CAAT6GACCC 1560 

ACCGGGCCAC CAGGGCCTCC AGGTCCAAGA GGCCACTCTG GAGAGCCTGG TCTTCCAGGG 1620 

CCCCCTGGGC CTCCAGGCCC ACCAGGTCAA GCAGTCATGC CTGAGGGTTT TATAAAGGCA 1680 

GGCCAAAGGC CCAGTCTTTC TGGGACCCCT CTTGTTAGTG CCAACCAGGG GGTAACAGGA 1740 



348 
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ATGCCTGTGT CTGCTTTTAC TGTTATTCTC TCCAAAGCTT ACCCAGCAAT AGG/WVCTCCC 1800 

ATACCATTTG ATAAAATTTT GTATAACAGG CAACAGCATT ATGACCCAAG GACTGGAATC 1860 

TTTACTTGTC A6ATACCAGG AATATACTAT TTTTCATACC ACGTGCATGT GAAAGGGACT 192 0 

CATGTTTGGG TAGGCCTGTA TAAGAATGGC ACCCCTGTAA TGTACACCTA TGATGAATAC 1980 

ACCAAAGGCT ACCTGGATCA GGCTTCAGGG AGTGCCATCA TCGATCTCAC AGAAAATGAC 2040 

CAGGTGTGGC TCCAGCTTCC CAATGCCGAG TCAAATGGCC TATACTCCTC TGAGTATGTC 2100 

CACTCCTCTT TCTCAGGATT CCTAGTGGCT CCT^TGTGAG TACACCCCAC AGAGCTAATC 2160 

TAAATCTTGT GCTAGAAAAA GCATTCTCTA ACTCTACCCC ACCCTACAAA ATGCATATGG 2220 

AGGTAGGCTG AAAAGAATGT AATTTTTATT TTCTGAAATA CAGATTTGAG CTATCAGACC 2280 

AACAAACCTT CCCCCTGAAA AGTGAGCAGC AACGTAAAAA CGTATGTGAA GCCTCTCTTG 2340 

AATTTCTAGT TAGCAATCTT AAGGCTCTTT AAGGTTTTCT CCAATATTAA AT^TATCAC 2400 

CAAAGAAGTC CTGCTATGTT AAAAACAAAC AACAAAAAAC AAAGCAACAA AAAAAAA7VAT 2460 

TAAAAAAAAA AACAGAAATA GAGCTCTAAG TTATGTGAAA TTTGATTTGA GAAACTCGGC 2520 

ATTTCCTTTT TAAAAAAGCC TGTTTCTAAC TATGAATATG AGAACTTCTA GGAAACATCC 2580 

AGGAGGTATC ATATAACTTT GTAGAACTTA AATACTTGAA TATTCAAATT TAAAAGACAC 2 640 

TGTATCCCCT AAAATATTTC TGATGGTGCA CTACTCTGAG GCCTGTATGG CCCCTTTCAT 2700 

CAATATCTAT TCAAATATAC AGGTGCATAT ATACTTGTTA AAGCTCTTAT ATAAAAAAGC 2760 

CCCAAAATAT TGAAGTTCAT CTGAAATGCA AGGTGCTTTC ATCAATGAAC CTTTTCAAAA 2820 

CTTTTCTATG ATTGCAGAGA AGCTTTTTAT ATACCCAGCA TAACTTGGAA ACAGGTATCT 2880 

GACCTATTCT TATTTAGTTA ACACAAGTGT GATTAATTTG ATTTCTTTAA TTCCTTATTG 2940 

AATCTTATGT GATATGATTT TCTGGATTTA CAGAACATTA GCACATGTAC CTTGTGCCTC 3000 

CCATTCAAGT GAAGTTAT7VA TTTACACTGA GGGTTTCAAA ATTCGACTAG AAGTGGAGAT 3 0 60 

ATATTATTTA TTTATGGACT GTACTGTATT TTTATATTGC TGTTTAAT^C TTTTAAGCTG 3120 

TGCCTCACTT ATTAAAGCAC AAAATGTTTT ACCTACTCCT TATTTACGAC ACAATATU^T 3180 

AACATCAATA GATTTTTAGG CTGAATTAAT TTGAAAGCAG CAATTTGCTG TTCTCAACCA 3240 
TTCTTTCAAG GCTTTTCATT CGACACAATA AAATAACATC 7UVTAG 

Seq ID NO: 435 Protein sequence 
Protein Accession #: NP_000484.2 

1 11 21 31 41 51 

I I I I I I 

MLPQIPFIiLL. VSLNLVHGVF YAERYQMPTG IKGPLPNTKT QFFIPYTIKS KGIAVRGEQG 60 

TPGPPGPAGP RGHPGPSGPP GKPGYGSPGL QGEPGLPGPP GPSAVGKPGV PGLPGKPGER 120 

GPYGPKGDVG PAGLPGPRGP PGPPGIPGPA GISVPGKPGQ QGPTGAPGPR GFPGEKGAPG 180 

VPGMNGQKGE MGYGAPGRPG ERGLPGPQGP TGPSGPPGVG KRGENGVPGQ PGIKGDRGFP 240 

GEMGPIGPPG PQGPPGERGP EGIGKPGAAG APGQPGIPGT KGLPGAPGIA GPPGPPGFGK 300 

PGLPGLKGER GPAGIjPGGPG AKGEQGPAGL PGKPGLTGPP GNMGPQGPKG IPGSHGLPGP 360 

KGETGPAGPA GYPGAKGERG SPGSDGKPGY PGKPGLDGPK GNPGIiPGPKG DPGVGGPPGL 420 

PGPVGPAGAK GMPGHNGEAG PRGAPGIPGT RGPIGPPGIP GFPGSKGDPG SPGPPGPAGI 480 

ATKGLNGPTG PPGPPGPRGH SGEPGLPGPP GPPGPPGQAV MPEGFIKAGQ RPSLSGTPLV 540 

SANQGVTGMP VSAFTVILSK AYPAIGTPIP FDKILYNRQQ HYDPRTGIFT CQIPGIYYFS 600 

YHVHVKGTHV WVGLYKNGTP VMYTYDEYTK GYLDQASGSA IIDLTENDQV WLQLPNAESN 660 
GLYSSEYVHS SFSGFLVAPM 

Seq ID NO: 43 6 DNA sequence 

Nucleic Acid Accession #: XM_062811 

Coding sequence: 1..888 

1 11 21 31 41 51 

I I I I I I 

ATGTGGGGCG CTCGCCGCTC GTCCGTCTCC TCATCCTGGA ACGCCGCTTC GCTCCTGCAG 60 

CTGCTGCTGG CTGCGCTGCT GGCGGCGGGG GCGAGGGCCA GCGGCGAGTA CTGCCACGGC 120 

TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 180 

GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 240 

GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGOGCTGG CGAGCCTGGC 300 

CGGGCGGACA AAGACGGCCC CGACGGCTCG GCAGTGCCCA TCTACGTGCC GTTCCTCATT 360 

GTTGGCTCCG TGTTTGTCGC CTTTATCATC TTGGGGTCCC TGGTGGCAGC CTGTTGCTGC 42 0 

AGATGTCTCC GGCCTAAGCA GGATCCCCAG CAGAGCCGAG CCCCAGGGGG TAACCGCTTG 480 

ATGGAGACCA TCCCCATGAT CCCCAGTGCC AGCACCTCCC GGGGGTCGTC CTCACGCCAG 540 

TCCAGCACAG CTGCCAGTTC CAGCTCCAGC GCCAACTCAG GGGCCCGGGC GCCCCCAACA 600 

AGGTCACAGA CCAACTGTTG CTTGCCGGAA GGGACCATGA ACAACGTGTA TGTCAACATG 660 

CCCACGAATT TCTCTGTGCT GAACTGTCAG CAGGCCACCC AGATTGTGCC ACATCAAGGG 720 

CAGTATCTGC ATCCCCCATA CGTGGGGTAC ACGGTGCAGC ACGACTCTGT GCCCATGACA 780 

GCTGTGCCAC CTTTCATGGA CGGCCTGCAG CCTGGCTACA GGCA6ATTCA GTCCCCCTTC 840 
CCTCACACCA ACAGTGAACA GAAGATGTAC CCAGCGGTGA CTGTATAA 

Seq ID NO: 437 Protein sequence 
Protein Accession #: XP_062811 

1 11 21 31 41 51 

I I I I i I 

MWGARRSSVS SSWNAASLLQ LLLAAIjIjAAG ARASGBYCHG WLDAQGVWRI GFQCPERFDG 60 

GDATICCGSC ALRYCCSSAE TU^LDQGGCDN DRQQGAGEPG RADKDGPDGS AVPIYVPFLI 120 

VGSVFVAFII LGSLVAACCC RCLRPKQDPQ QSRAPGGNRL METIPMIPSA STSRGSSSRQ 180 

SSTAASSSSS ANSGARAPPT RSQTNCCLPE GTMNNVYVNM PTNFSVLNCQ QATQIVPHQG 240 

QYIiHPPYVGY TVQHDSVPMT avppfmdgdq pgyrqiqspf phtnseqkmy pavtv 

Seq ID NO: 438 DNA sequence 

Nucleic Acid Accession #: NM_004004.1 

Coding sequence: 1..681 

1 11 21 31 41 51 

I I I I I i 

atggattggg gcacgctgca gacgatcctg gggggtgtga ACAAACACTC CACCAGCATT 60 

GGAAAGATCT GGCTCACCGT CCTCTTCATT TTTCGCATTA TGATCCTCGT TGTGGCTGCA 120 

AAGGAGGTGT GGGGAGATGA GCAGGCCGAC TTTGTCTGCA ACACCCTGCA GCCAGGCTGC 180 
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AAGAACGTGT GCTACGATCA CTACTTCCCC ATCTCCCACA TCCGGCTATG GGCCCTGCAG 
CTGATCTTCG TGTCCAGCCC AGCGCTCCTA GTGGCCATGC ACGTGGCCTA CCGGAGACAT 
GAGAAGAAGA GGAAGTTCAT CAAGGGGGAG ATAAAGAGTG AATTTAAGGA CATCGAGGAG 
ATCi\AAACCC AGAAGGTCCG CATCGAAGGC TCCCTGTGGT GGACCTACAC AAGCAGCATC 
TTCTTCCGGG TCATCTTCGA AGCCGCCTTC ATGTACGTCT TCTATGTCAT GTACGACGGC 
TTCTCCATGC AGCGGCTGGT GAAGTGCAAC GCCTGGCCTT GTCCCAACAC TGTGGACTGC 
TTTGTGTCCC GGCCCACGGA GAAGACTGTC TTCACAGTGT TCATGATTGC AGTGTCTGGA 
ATTTGCATCC TGCTGAATGT CACTGAATTG TGTTATTTGC TAATTAGATA TTGTTCTGGG 
AAGTCAAAAA AGCCAGTTTA A 



PCT/US02/12476 



Seg ID NO: 43 9 Protein sequence 
Protein Accession #: NP 003995.1 



11 
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41 



51 



MDWGTLQTIL GGVNKHSTSI GKIWLTVLFI FRIMILWAA KEVWGDEQAD FVCNTLQPGC 
KNVCYDHYFP ISHIRLWALQ LIFVSSPALL VAMHVAYRRH EKKRKFIKGE IKSEFKDIEE 
IKTQKVRIEG SLWWTYTSSI FFRVIFEAAF MYVFYVMYDG FSMQRLVKCN AWPCPNTVDC 
FVSRPTEKTV FTVFMIAVSG iClLliNVTBIj CYLIjIRYCSG KSKKPV 



Sag ID NO: 440 DNA sequence 

Nucleic Acid Accession #: XM_061091.1 

Coding sequence: l.,2481 



ATGCCAAATA 
CACCGAGCGC 
CCCGGGTACC 
CAGATGCATG 
GTTGGGAAGC 
ATTTCAGCTG 
GATGGGTCTA 
GTCTGTGACG 
TCCACTCCTC 
AGAATCAAGA 
CTTCTGCACA 
GTCACTGATG 
GGTGTCACTG 
GCCAGCGAGC 
GGCCTCTTCA 
CCCGAGCTTG 
CAGCCCTGCC 
TGCCCX3CTGG 
GTCGACCTCC 
GCCAAAGTCT 
GTGGGTGTGG 
GATGTGCCTG 
ACGGGCAGTG 
CAGGACCGGC 
GCGGGCCCAG 
GCCGTGCGGG 
GATCCTCAGG 
CGGCCAGGGT 
GTAGGGCCCG 
GAGGTGAACC 
GCCTTCGGGC 
CCCTACCTAG 
ATGACCGTCC 
GGGAGAGGCG 



CGGGATTCCC 
ATTGAGTGGC 
ATGAATGAGG 
TGGGAGGGCC 
GGATGGATTC 
ACCCCTCCCA 
GTCTGTGCCC 



11 
I 

CTTCAGGAAC 
TGGTCGCCGC 
CGCCAGTGCC 
GTGAGAAGGT 
CTCCTTTTGA 
CCAGC7\;\AAT 
ACAGCGTCGG 
GTCTGGACAT 
ATCTGGAATT 
GGATGGTTTT 
GAGGGTTGCC 
GGJ\AGTCCCA 
TGTTTGCTGT 
CTAGAGGGCA 
GCACCCTCAG 
TCTTCATGGA 
AGAATGGAGG 
CCTTTGGAGG 
TCTTCCTGCT 
TCGTGAAGCG 
CCACATACAG 
ACCTGGTCTG 
CCTTGCGGCA 
CACGTAGAGT 
CGCGTCACGC 
CAGAGCTGGA 
ATCTGTTCAA 
GCCGGACACA 
AGAATTTTGC 
CTGACGTGAC 
TGGACACCAA 
GTGGGGTGGG 
AGAGGGGTGC 
CAGAGGATGC 
TGGGCGTGGG 
TGATCCACGT 
TGTGTGGAGA 
GCAGCTGCGT 
CCCACTGCGA 
TTGAGACGCC 
GCAACTACAG 
CAGGTCCTTA 



21 

I 

AACCAGGATT 
TCTCCTTCCG 
GGCTGCCGAT 
AGATCTCTGG 
GGCAAACGAA 
GATGTGGTGC 
GAAAGGGAGC 
CAGCCCCGAG 
CCCCTTGGAT 
CAAAGGAGGG 
TGGAGGCAGA 
GGGGGATGTG 
GGGGGTCAGG 
GCACGTGCTG 
CAGCTCGGCC 
GCGGTTAATG 
CACATGTGTT 
G6AGGCTAAC 
GGACAGCTCT 
GTTTGTGCGG 
CAGGGAGCTG 
GAGCCTCGAT 
GGCGGCAGAG 
GGTGGTTTTG 
AAGGGCGCGA 
GGAGATCACA 
CCAAATCCCT 
AGCCCTGGAC 
TCAGATGCAG 
ACAGGTCGGC 
ACCCACCCGG 
CTCAGCCGGC 
CCGGCCTGGT 
AGCCGTTCCT 
GCCTGTCCTA 
GGCAGCTTAC 
AGCCAAGCAG 
CCTGCAGAAT 
GAACCGTGAG 
CCTGAGGCAC 
AGAAGGCCTG 
G 



31 

I 

GAT^TTTGGC 
GTGAGTCCCA 
GACCGATTCA 
AGCCTTGGTG 
GTCCATGTAA 
TCGGCTGCAG 
TTTGAAAGGT 
AGGGTCAGAG 
TCATTTTCAA 
CGCACGGAGA 
AATGCTTCTG 
GCACTGCCAT 
TTTCCCAGGT 
TTGGCTGAGC 
ATCTGCTCCA 
GGCATCTCTC 
CCAGAAGGAC 
TGTGCCCTGA 
GCGGGCACCA 
GCCGTGCTGA 
CTGGTGGCGG 
GGCATTCCCT 
CGTGGCTTCG 
CTCACTGAGT 
GAGCTGCTCC 
GGCAGCCCAA 
GAGCTGCAGG 
CTCGTCTTCA 
AGCTTTGTGA 
CTGGTGGTGT 
GCTGCGATGC 
ACCGCCCTGC 
GTCCCCAAAG 
GCCCAGAAGC 
AGTGAGGGTC 
GCCGACCTGC 
CCAGTCAACC 
GGGAGCTACC 
TGGAGCTCTT 
ATGGCTCCCG 
GGCACTGAAA 



41 
I 

TTCtCCAAGA 
GCCCCGAGTT 
GGCTGCCGAT 
TTCTTTGCTA 
GCAAAGAAAC 
TGGACATCAT 
CCAAGCACTT 
TGGGAGCAtT 
CCCAACAGGA 
CGGAACTTGC 
TGCCCCAGAT 
CCAAGCAGCT 
GGGAGGAGCT 
AGGTGGAGGA 
GCGCCACGCC 
TGATAGGCCC 
TGGACGGCTA 
AGCTGAGCCT 
CTCTGGACGG 
GCGAGGACTC 
TGCCTGTGGG 
TCCGTGGTGG 
GGAGCGCCAC 
CACACTCCGA 
TGCTGGGTGT 
AGCATGTGAT 
GGAAGCTGTG 
TGTTGGACAC 
GAAGCTGTGC 
ATGGCAGCCA 
TGCGGGCCAT 
TGCACATCTA 
CTGTGGTGGT 
TGAGGAACAA 
TGCG6AGGCT 
GGTACCACCA 
TCTGCAAACC 
GCTGCAAGTG 
GCTCTGTATG 
TGCAGGAGGG 
TGGTGCCTAC 



51 
1 

GCCGCCCGGG 
GGCTCTGGCG 
GATTGGAGGT 
TGJ\ATTTTTA 
CATCGGGAAG 
GTTTCTGTTA 
TGCCATCACA 
CCAGTTCAGT 
AGTGAAGGCA 
TCTGAAATAC 
CCTCATCATC 
GAAGGAAAGG 
GCATGCACTG 
TGCCACCAAC 
AGCTGGGAGC 
CTGTGACTCG 
CCAGTGCCTC 
GGAATGCAGG 
CTTCCTGCGG 
TCGGGCCCGA 
GGAGTACCAG 
CCCCACCCTG 
CAGGACAGGC 
GGATGAGGTT 
AGGCAGTGAG 
GGTCTACTCG 
CAGCCGGCAG 
CTCTGCCTCA 
CCTCCAGTTT 
GGTGCAGACT 
TAGCCAGGCC 
TGACAAAGTG 
GCTCACAGGC 
TGGCATCTCT 
TGCAGGTCCC 
GGACGTGCTC 
CAGCCCGTGC 
TCGGGATGGC 
TGTGAGCCAG 
CAGCAGCCGT 
CTTCTGGAAT 



Seg ID NO: 441 Protein sequence 
Protein Accession #: XP_061091.1 



MPNTSGTTRI 
QMHGEKVDLW 
DGSNSVGKGS 
RIKRMVFKGG 
GVTVFAVGVR 
PEIiVFMERLM 
VDLLFLLDSS 
DVPDLVWSLD 
AGPARHARAR 
RPGCRTQALD 
AFGLDTKPTR 
GRGAEDAAVP 



11 

I 

EIWLLQEPPG 
SLGVLCYEFL 
FERSKHFAIT 
RTETBLALKY 
FPRWEELHAL 
GISLIGPCDS 
AGTTLDGFLR 
GIPFRGGPTL 
ELIiLLGVGSE 
IjVFMLDTSAS 
AAMLRAISQA 
AQKLRNNGIS 



21 
1 

HRALVAALLP 
VGKPPFEANE 
VCDGLDISPB 
LIiHRGIiPGGR 
ASEPRGQHVL 
QPCQNGGTCV 
AKVFVKRFVR 
TGSALRQAAE 
AVRAELEEIT 
VGPENFAQMQ 
PYLGGVGSAG 
VIiWGVGPVL 



31 
I 

VSPSPELALA 
VHVSKETIGK 
RVRVGAFQFS 
NASVPQim 
LAEQVEDATN 
PEGLDGYQCL 
AVLSEDSRAR 
RGFGSATRTG 
GSPKHVMVYS 
SFVRSCALQF 
TAliliHIYDKV 
SEGLRRLAGP 



41 

1 

PGYPPVPAAD 
ISAASKMNn-JC 
STPHLEFPLD 
VTDGKSQGDV 
GLPSTLSSSA 
CPIAFGGEAN 
VGVATYSREL 
QDRPRRVWL 
DPQDLFNQIP 
EVNPDVTQVG 
MTVQRGTUIPG 
RDSIjIHVAAY 



240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



51 
1 

DRFTLPMIGG 
SAAVDIMFLL 
SFSTQQEVKA 
AIiPSKQLKER 
ICSSATPAGS 
CALKLSLECR 
LVAVPVGEYQ 
LTESHSEDEV 
ELQGKLCSRQ 
LWYGSQVQT 
VPKAVWLTG 
ADLRYHQDVL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



350 



wo 02/086443 PCT/US02/12476 

lEWLCGEAKQ PVNLCKPSPC MNEGSCVLQN GSYRCKCRDG WEGPHCENRE WSSCSVCVSQ 780 
GWILETPLRH MAPVQEGSSR TPPSNYREGL GTEMVPTFWN VCAP6P 



Seq ID NO: 442 DNA sequence 
5 Nucleic Acid Accession #t Eos sequence 
Coding sequence: 1..2424 

1 11 21 31 41 51 

in 1 I ' ' ' ' 

lU ATGCCCCCTT TCCTGTTGCT GGAGGCCGTC TGTGTTTTCC TGTTTTCCAG AGTGCCCCCA 60 

TCTCTCCCTC TCCAGGAAGT CCATGTAAGC AAAGAAACCA TCGGGAAGAT TTCAGCTGCC 12 0 

AGCAAAATGA TGTGGTGCTC GGCTGCAGTG GACATCATGT TTCTGTTAGA TGGGTCTAAC 180 

AGCGTCGGGA AAGGGAGCTT TGAAAGGTCC AAGCACTTTG CCATCACAGT CTGTGACGGT 240 

CTGGACATCA GCCCCGAGAG GGTCAGAGTG GGAGCATTCC AGTTCAGTTC CACTCCTCAT 300 

15 CTGGAATTCC CCTTGGATTC ATTTTCAACC CAACAGGAAG TGAAGGCAAG AATCAAGAGG 360 

ATGGTTTTCA AAGGAGGGCG CACGGAGACG GAACTTGCTC TGi\7U\TACCT TCTGCACAGA 420 

GGGTTGCCTG GAGGCAGAAA TGCTTCTGTG CCCCAGATCC TCATCATCGT CACTGATGGG 480 

AAGTCCCAGG GGGATGTGGC ACTGCCATCC AAGCAGCTGA AGGAAAGGGG TGTCACTGTG 540 

TTTGCTGTGG GGGTCAGGTT TCCCAGGTGG GAGGAGCTGC ATGCACTGGC CAGCGAGCCT 600 

20 AGAGGGCAGC ACGTGCTGTT GGCTGAGCAG GTGGAGGATG CCACCAACGG CCTCTTCAGC 660 

ACCCTCAGCA GCTCGGCCAT CTGCTCCAGC GCCACGCCAG ACTGCAGGGT CGAGGCTCAC 72 0 

CCCTGTGAGC ACAGGACGCT GGAGATGGTC CGGGAGTTCG CTGGCAATGC CCCATGCTGG 780 

AGAGGATCGC GGCGGACCCT TGCGGTGCTG GCTGCACACT GTCCCTTCTA CAGCTGGAAG 840 

AGAGTGTTCC TAACCCACCC TGCCACCTGC TACAGGACCA CCTGCCCAGG CCCCTGTGAC 900 

25 TCGCAGCCCT GCCAGAATGG AGGCACATGT GTTCCAGAAG GACTGGACGG CTACCAGTGC 960 

CTCTGCCCGC TGGCCTTTGG AGGGGAGGCT AACTGTGCCC TGAAGCTGAG CCTGGAATGC 1020 

AGGGTCGACC TCCTCTTCCT GCTGGACAGC TCTGCGGGCA CCACTCTGGA CGGCTTCCTG 1080 

CGGGCCAAAG TCTTCGTGAA GCGGTTTGTG CGGGCCGTGC TGAGCGAGGA CTCTCGGGCC 1140 

CGAGTGGGTG TGGCCACATA CAGCAGGGAG CTGCTGGTGG CGGTGCCTGT GGGGGAGTAC 12 0 0 

30 CAGGATGTGC CTGACCTGGT CTGGAGCCTC GATGGCATTC CCTTCCGTGG TGGCCCCACC 1260 

CTGACGGGCA GTGCCTTGCG GCAGGCGGCA GAGCGTGGCT TCGGGAGCGC CACCAGGACA 1320 

GGCCAGGACC GGCCACGTAG AGTGGTGGTT TTGCTCACTG AGTCACACTC CGAGGATGAG 1380 

GTTGCGGGCC CAGCGCGTCA CGCAAGGGCG CGAGAGCTGC TCCTGCTGGG TGTAGGCAGT 1440 

GAGGCCGTGC GGGCAGAGCT GGAGGAGATC ACAGGCAGCC CAAAGCATGT GATGGTCTAC 1500 

35 TCGGATCCTC AGGATCTGTT CAACCA7UVTC CCTGAGCTGC AGGGGAAGCT GTGCAGCCGG 1560 

CAGCGGCCAG GGTGCCGGAC ACAAGCCCTG GACCTCGTCT TCATGTTGGA CACCTCTGCC 162 0 

TCAGTAGGGC CCGAGAATTT TGCTCAGATG CAGAGCTTTG TGAGAAGCTG TGCCCTCCAG 1680 

TTTGAGGTGA ACCCTGACGT GACACAGGTC GGCCTGGTGG TGTATGGCAG CCAGGTGCAG 1740 

ACTGCCTTCG GGCTGGACAC CAAACCCACC CGGGCTGCGA TGCTGCGGGC CATTAGCCAG 1800 

40 GCCCCCTACC TAGGTGGGGT GGGCTCAGCC GGCACCGCCC TGCTGCACAT CTATGACAAA 1860 

GTGATGACCG TCCAGAGGGG TGCCCGGCCT GGTGTCCCCA AAGCTGTGGT GGTGCTCACA 1920 

GGCGGGAGAG GCGCAGAGGA TGCAGCCGTT CCTGCCCAGA AGCTGAGGAA CAATGGCATC 1980 

TCTGTCTTGG TCGTGGGCGT GGGGCCTGTC CTAAGTGAGG GTCTGCGGAG GCTTGCAGGT 2040 

CCCCGGGATT CGCTGATCCA CGTGGCAGCT TACGCCGACC TGCGGTACCA CCAGGACGTG 2100 

45 CTCATTGAGT GGCTGTGTGG AGAAGCCAAG CAGCCAGTCA ACCTCTGCAA ACCCAGCCCG 2160 

TGCATGAATG AGGGCAGCTG CGTCCTGCAG AATGGGAGCT ACCGCTGCAA GTGTCGGGAT 222 0 

GGCTGGGAGG GCGCCCACTG CGAGAACCGT GAGTGGAGCT CTTGCTCTGT ATGTGTGAGC 22 80 

CAGGGATGGA TTCTTGAGAC GCCCCTGAGG CACATGGCTC CCGTGCAGGA GGGCAGCAGC 23 40 

CGTACCCCTC CCAGCAACTA CAGAGAAGGC CTGGGCACTG AAATGGTGCC TACCTTCTGG 2400 

50 AATGTCTGTG CCCCAGGTCC TTAG 

Seq ID NO: 443 Protein sequence 
Protein Accession #: Eos sequence 

55 1 11 21 31 41 51 

I I I I I I 

MPPFLLLEAV CVFLFSRVPP SLPLQEVHVS KETIGKISAA SBCMMWCSAAV DIMFLLDGSN 60 

SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVKARIKR 12 0 

MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 

60 FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 240 

PCBHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 

SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFIi 360 

RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 420 

LTGSALRQAA ERGFGSATRT GQDRPRRWV LLTESHSEDE VAGPARHARA RELLLLGVGS 480 

65 EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENFAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAWVLT GGRGAEDAAV PAQKLRNNGI 660 

SVLWGVGPV LSEGLRRIjAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 720 

CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EWSSCSVCVS QGWILETPLR HMAPVQEGSS 780 

70 RTPPSNYREG LGTEMVPTFW NVCAPGP 

Seq ID NO: 444 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequences 89,. 2356 

1 11 21 31 41 51 

1111)1 

GCCCCCTGGC CCGAGCGGCG CCCGGGTCTG TGAGTAGAGC CGCCCGGGCA CCGAGCGCTG 60 

GTCGCCGCTC TCCTTCCGTT ATATCAACAT GCCCCCTTTC CTGTTGCTGG AAGCCGTCTG 120 

80 TGTTTTCCTG TTTTCCAGAG TGCCCCCATC TCTCCCTCTC CAGGAAGTCC ATGTAAGCAA 180 

AGAAACCATC GGGAAGATTT CAGCTGCCAG CAAAATGATG TGGTGCTCGG CTGCAGTGGA 240 

CATCATGTTT CTGTTAGATG GGTCTAACAG CGTCGGGAAA GGGAGCTTTG AAAGGTCCAA 300 

GCACTTTGCC ATCACAGTCT GTGACGGTCT GGACATCAGC CCCGAGAGGG TCAGAGTGGG 360 

AGCATTCCAG TTCAGTTCCA CTCCTCATCT GGAATTCCCC TTGGATTCAT TTTCAACCCA 420 

85 ACAGGAAGTG AAGGCAAGAA TCAAGAGGAT GGTTTTCAAA GGAGGGCGCA CGGAGACGGA 480 

ACTTGCTCTG AAATACCTTC TGCACAGAGG GTTGCCTGGA GGCAGA7VATG CTTCTGTGCC 540 

CCAGATCCTC ATCATCGTCA CTGATGGGAA GTCCCAGGGG GATGTGGCAC TGCCATCCAA 600 
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GCAGCTGAAG GAAAGGGGTG TCACTGTGTT TGCTGTGGGG GTCAGGTTTC CCAGGTGGGA 660 

GGAGCTGCAT GCACTGGCCA GCGAGCCTAG AGGGCAGCAC GTGCTGTTGG CTGAGCAGGT 720 

GGAGGATGCC ACCAACGGCC TCTTCAGCAC CCTCAGCAGC TCGGCCATCT GCTCCAGCGC 780 

CACGCCAGAC TGCAGGGTCG AGGCTCACCC CTGTGAGCAC AGGACGCTGG AGATGGTCCG 840 

5 GGAGTTCGCT GGCAATGCCC CATGCTGGAG AGGATCGCGG CGGACCCTTG CGGTGCTGGC 900 

TGCACACTGT CCCTTCTACA GCTGGAAGAG AGTGTTCCTA ACCCACCCTG CCACCTGCTA 960 

CAGGACCACC TGCCCAGGCC CCTGTGACTC GCAGCCCTGC CAGAATGGAG GCACATGTGT 1020 

TCCAGAAGGA CTGGACGGCT ACCAGTGCCT CTGCCCGCTG GCCTTTGGAG GGGAGGCTAA 1080 

CTGTGCCCTG AAGCTGAGCC TGGAATGCAG GGTCGACCTC CTCTTCCTGC TGGACAGCTC 1140 

10 TGCGGGCACC ACTCTGGACG GCTTCCTGCG GGCCAAAGTC TTCGTGAAGC GGTTTGTGCG 1200 

GGCCGTGCTG AGCGAGGACT CTCGGGCCCG AGTGGGTGTG GCCACATACA GCAGGGAGCT 12 SO 

GCTGGTGGCG GTGCCTGTGG GGGAGTACCA GGATGTGCCT GACCTGGTCT GGAGCCTCGA 132 0 

TGGCATTCCC TTCCGTGGTG GCCCCACCCT GACGGGCAGT GCCTTGCGGC AGGCGGCAGA 13 80 

GCGTGGCTTC GGGAGCGCCA CCAGGACAGG CCAGGACCGG CCACGTAGAG TGGTGGTTTT 1440 

15 GCTCACTGAG TCACACTCCG AGGATGAGGT TGCGGGCCCA GCGCGTCACG CAAGGGCGCG 1500 

AGAGCTGCTC CTGCTGGGTG TAGGCAGTGA GGCCGTGCGG GCAGAGCTGG AGGAGATCAC 1560 

AGGCAGCCCA AAGCATGTGA TGGTCTACTC GGATCCTCAG GATCTGTTCA ACCAAATCCC 1620 

TGAGCTGCAG GGGAAGCTGT GCAGCCGGCA GCGGCCAGGG TGCCGGACAC AAGCCCTGGA 1680 

CCTCGTCTTC ATGTTGGACA CCTCTGCCTC AGTAGGGCCC GAGAATTTTG CTCAGATGCA 1740 

20 GAGCTTTGTG AGAAGCTGTG CCCTCCAGTT TGAGGTGAAC CCTGACGTGA CACAGGTCGG 1800 

CCTGGTGGTG TATGGCAGCC AGGTGCAGAC TGCCTTCGGG CTGGACACCA AACCCACCCG 1860 

GGCTGCGATG CTGCGGGCCA TTAGCCAGGC CCCCTACCTA GGTGGGGTGG GCTCAGCCGG 192 0 

CACCGCCCTG CTGCACATCT ATGACAAAGT GATGACCGTC CAGAGGGGTG CCCGGCCTGG 1980 

TGTCCCCAAA GCTGTGGTGG TGCTCACAGG CGGGAGAGGC GCAGAGGATG CAGCCGTTCC 2 040 

25 TGCCCAGAAG CTGAGGAACA ATGGCATCTC TGTCTTGGTC GTGGGCGTGG GGCCTGTCCT 2100 

AAGTGAGGGT CTGCX3GAGGC TTGCAGGTCC CCGGGATTCC CTGATCCACG TGGCAGCTTA 2160 

CGCCGACCTG CGGTACCACC AGGACGTGCT CATTGAGTGG CTGTGTGGAG AAGCCAAGCA 2220 

GCCAGTCAAC CTCTGCAAAC CCAGCCCGTG CATGAATGAG GGCAGCTGCG TCCTGCAGAA 2280 

TGGGAGCTAC CGCTGCAAGT GTCGGGATGG CTGGGAGGGC CCCCACTGCG AGAACCGATT 2340 

30 CTTGAGACGC CCCTGAGGCA CATGGCTCCC GTGCAGGAGG GCAGCAGCCG TACCCCTCCC 2400 

AGCAACTACA GAGT^GGCCT GGGCACTGAA ATGGTGCCTA CCTTCTGG7UV TGTCTGTGCC 2460 

CCAG6TCCTT AGAATGfCTG CTTCCCGCCX3 TGGCCAGGAC CACTATTCTC ACTGAGGGAG 2520 

GAGGATGTCC CAACTGCAGC CATGCTGCTT AGAGACAAGA AAGCAGCTGA TGTCACCCAC 2580 

AAACGATGTT GTTGAAAAGT TTTGATGTGT AAGTAT^TAC CCACTTTCTG TACCTGCTGT 2 640 

35 GCCTTGTTGA GGCTATGTCA TCTGCCACCT TTCCCTTGAG GATAAACAAG GGGTCCTGAA 2700 

GACTTAAATT TAGCGGCCTG ACGTTCCTTT GCACACAATC AATGCTCGCC AGAATGTTGT 2760 
TGACACAGTA ATGCCCAGCA GAGGCCTTTA CTAGAGCATC CTTTGGACGG 
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Seg ID NO: 445 Protein sequence 
Protein Accession #: Eos sequence 
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MPPPLLLEAV 
SVGKGSFERS 
MVFKGGRTET 
FAVGVRFPRW 
PCEHRTLEMV 
SQPCQNGGTC 
RAKVFVKRFV 
LTGSALRQAA 
EAVRAELEEI 
SVGPENFAQM 
APYLGGVGSA 
SVIiWGVGPV 
CMNEGSCVLQ 



11 
I 

CVFLFSRVPP 
KHFAITVCDG 
ELALKYLLHR 
EELHALASEP 
REFAGNAPCW 
VPEGLDGYQC 
RAVLSEDSRA 
ERGFGSATRT 
TGSPPCHVMVY 
QSFVRSCALQ 
GTALLHIYDK 
LSKGLRRLAG 
NGSYRCKCRD 



21 
I 

SLPLQEVHVS 
LDISPERVRV 
GLPGGRNASV 
RGQHVLLAEQ 
RGSRRTLAVL 
LCPLAFGGEA 
RVGVATYSRE 
GQDRPRRVW 
SDPQDLFJSTQI 
FEVNPDVTQV 
VMTVQRGTUiP 
PRDSLIHVAA 
GWEGPHCENR 



31 

1 

KETIGKISAA 
GAFQFSSTPH 
PQILIIVTDG 
VEDATNGLFS 
AAHCPFYSWK 
NCALKLSLEC 
LLVAVPVGEY 
LLTESHSEDE 
PELQGKLCSR 
GLWYGSQVQ 
GVPKAVWLT 
YADLRYHQDV 
FLRRP 



41 
I 

SKMMWCSAAV 
LEFPLDSFST 
KSQGDVALPS 
TLSSSAICSS 
RVFLTHPATC 
RVDLLFLLDS 
QDVPDLVWSL 
VAGPARHARA 
QRPGCRTQAL 
TAFGLDTKPT 
GGRGABDAAV 
LIEWLCGEAK 



51 
I 

DIMFLtiDGSN 
QQEVKARIKR 
KQLKERGVTV 
ATPDCRVEAH 
YRTTCPGPCD 
SAGTTLDGFL 
DGIPFRGGPT 
RELLLLGVGS 
DLVFMLDTSA 
RAAMLRAISQ 
PAQKLRNNGI 
QPVNLCKPSP 



60 

120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: 446 DNA sequence 

Nucleic Acid Accession #: NM_031942.1 

Coding sequence: 145.. 1260 
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CCCGAGCCCC 
TGCTCCTCCT 
CCGATCTGGG 
GTAAAGAAGA 
TCCTCTGATG 
TCAGTTCGGG 
GCGATGAAGT 
CAGCCCTCAG 
AATTTTTTGG 
ATGTCTGAAT 
GACTCACAAT 
CCTGAACGGA 
GCTCTACCCA 
ACCGTGGATG 
GTGACCCTTC 
GTCTGCAGCA 
TGCCGTCAGA 
CGAGGCCAGT 
CTGCTGGATC 
CGGCAGCGAG 
TTTGGGAATG 
TATCTGGAAA 
TTTTTTCACT 
AATCAAGTTA 



11 
I 

GCCCCTCCGG 
GCTGTGGGAC 
CACCCGCCAC 
ACTTAAAGAA 
ACAGTTGTGA 
AAGGCTGTAG 
TTCCAGCGCG 
AGAATTCTGT 
AGAAAAGGGC 
TAGAAAGCTT 
CAAGGAGACC 
GAGCTCGTCC 
TGGAGGAGGA 
GCTACATGAA 
CGCATATAAT 
ATTCTCGAGA 
AGACTATTGA 
TCTGTGGCCC 
CGAACTGGCA 
ATGGACGGTG 
TGCATGCCTA 
ATTTGCTGCC 
GAAACCT6A6 
ATCTTAGCAG 



21 

1 

GCCCGGGTCG 
CGCTGACCGC 
CAGCATGGAC 
ATTCAGATAT 
CAGCTTTGCT 
GACCCGCAGC 
GAGTACCAGG 
GACTGATTCC 
TTTAAATATA 
CCCTGGCTCG 
GCGAAGGCGT 
TCTTACCAGG 
GGAGGAAGAG 
TGAAGATGAC 
TCGCCCAGTG 
GAAGATATAT 
TACCAAAACA 
CTGCCTTCGA 
TTGCCCGCCT 
TGCGACTGGG 
CTTGAAAAGC 
TGCCTTCTAC 
TTAAAAATCT 
ACATGTGTTT 



31 
1 

GCGCGCCCAG 
GCGGCTGCTC 
GCTCGCCGCG 
GTGAAGTTGA 
TCTGATAATT 
CAGTGCAGGC 
GGAGCAACCA 
AACTCCGATT 
AAGCAAAACA 
TTCCGTGGAA 
ACATTCCCGG 
TCAAGGTCCC 
GATT^GTACA 
CTGCCCAGAA 
GAAGAAATTA 
AACCX3TTCAC 
AACTGCAGAA 
AACCGTTATG 
TGTCGAGGAA 
GTCCTTGTGT 
CTGAAACAGG 
TTCTCAAATC 
T6ATGATCAG 
CTGGAGCATC 



41 

I 

CCTGCCAGCC 
CGCTCTCCCC 
TGCCGCAGAA 
TTTCCATGGA 
TTGCAAACAC 
ACTCTGGACG 
ACAAAAAAGC 
CAGAAGATGA 
AAGCAATGCT 
GACATCCCCT 
GTGTTGCTTC 
GGATCCTCGG 
TGTTGGTGAG 
GCCGTCGCTC 
CAGAGGAGGA 
TGGGCTCTAC 
ACCCAGACTG 
GTGAAGAGGT 
TCTGCAACTG 
ATTTAGCCAA 
AATTTGAAAT 
TTTCTTGTAA 
CCTGTTTCAT 
ACAGAAGGTA 



51 

I 

GCGCTGCTGC 
GCTCCAAGCG 
AGATCTCAGA 
AACCTCGTCA 
GAGGCTGCAG 
TCTCAGGGTG 
AGAGTCCCGC 
AAGTGGAATG 
TGCAAAACTC 
CCCAGGCTCC 
CAGGAGAAAC 
GTCCCTTGAC 
AAAGAGGAAG 
CAGATCATCC 
GTTGGAGAAC 
TTGTCATC/VA 
CTGGGGCGTT 
CAGGGATGCT 
CAGTTTCTGC 
ATATCATGGC 
GCAAGCATAA 
AAGTTTCCAA 
AAGAAACTCC 
TATTGCTAGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



352 



wo 02/086443 

TACACTTTGC CCTCCTGCAG TTTCTTCTCT GCTCCCAACC CCCATCTCAT AGCATCCCCC 1500 

TCTATTTCCA ATGCTCCTCT CCAACC6CTT AGTTTCTGAA TTTCTTTTAA ATTACAGTTT 1560 

TATGAAAGCA TATTTTATTT ACTTGGTGTT GAAATAGCCC TCATAAAACC TAAGCACTTG 1620 

GAAACACAAT AATAGTATTA ACTAACTAGA TCTATTGAAT TTCAGAGAAG AGCCTTCTAA 1680 

CTTGTTTACA CAAAAACGAG TATGATTTAG CACTCATACT AGTTGAAATT TTTAATAGAA 1740 

TCAAGGCACA AAAGTCTTAA AACCATGTGG AAAAATTAGG TAATTATTGC AGATTGATGT 1800 

CTCTCAATCC CATGTATTGC GCTTATGTTA CAAGTTGTTG TCACAGTTGA GACTTAATTT 1860 

CTCCTAATTT CTTCTGCCCG AAGGGTAAGT GGTGCGTCCA GCTTACACGA TCATAATTCA 1920 

AAGGTTGGTG GGCAATGTAA TACTTAATTA AAATAATGAT GGAAGAGCTA TCTGGAGATT 1980 

ATGAGTAAGC TGATTTGAAT TTTCAGTATA TWVACTTTAGT ATAATTGTAG TTTGCAAAGT 2 040 

TTATTTCAGT TCACATGTAA GGTATTGC/U^ ATAAATTCTT GGACAATTTT GTATGGAAAC 210 0 

TTGATATTAA AAACTAGTCT GTGGTTCTTT GCAGTTTCTT GTAAATTTAT AAACCAGGCA 2160 

CAAGGTTCAA GTTTAGATTT TAAGCACTTT TATAAC7UVTG ATAAGTGCCT TTTTGGAGAT 2220 

GTAACTTTTA GCAGTTTGTT AACCTGACAT CTCTGCCAGT CTAGTTTCTG GGCAGGTTTC 2280 

CTGTGTCAGT ATTCCCCCTC CTCTTTGCAT TAATCAAGGT ATTTGGTAGA GGTGGAATCT 2340 

AAGTGTTTGT ATGTCCAATT TACTTGCATA TGTAAACCAT TGCTGTGCCA TTCAATGTTT 2400 

GATGCATAAT TGGACCTTGA ATCGATAAGT GTAAATACAG CTTTTGATCT GTAATGCTTT 2460 
TATACAAAAG TTTATTTTAA TAATAAAATG TTTGTTCTAA AAAAAAAAAA 



Seq ID NO: 447 Protein sequence 
Protein Accession #: NP_114148.1 

1 11 21 31 41 51 

I I I I I I 

MDARRVPQKD LRVKKNIiKKF RYVKLISMET SSSSDDSCDS FASDNFANTR LQSVREGCRT 60 
RSQCRHSGPL RVAMKFPARS TRGATNKKAB SRQPSENSVT DSNSDSEDES GMNFLEKRAL 120 
NIKQNKAMLA KLMSELESFP GSFRGRHPLP GSDSQSRRPR RRTFPGVASR RNPERRARPL 180 
TRSRSRILGS LDALPMEEEE EEDKYMLVRK RKTVDGYMNE DDLPRSRRSR SSVTLPHIIR 240 
PVEEITEEEL ENVCSNSREK lYNRSLGSTC HQCRQKTIDT KTNCRNPDCW GVRGQFCGPC 300 
LRNRYGEEVR DALIiDPNWHC PPCRGICNCS FCRQRDGRCA TGVLVYLAKY HGFGNVHAYL 360 
KSLKQEFEMQ A 



Seq ID NO: 448 DNA sequence 

Nucleic Acid Accession #: NM_019894 

Coding sequence: 1..1314 

1 11 21 31 41 51 

I I I I I I 

ATGTTACAGG ATCCTGACAG TGATCAACCT CTGAACAGCC TCGATGTCAA ACCCCTGCGC 60 

AAACCCCGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TCCCCATCAT CATAGCACTA 120 

CTGAGCCTGG CGAGTATCAT CATTGTGGTT GTCCTCATCA AGGTGATTCT GGATAAATAC 180 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCCGAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CGAAGGGCCT 300 

GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 360 

GGGAACTGGT TCTCTGCCTG TTTCGACAAC TTCAGAGAAG CTCTCGCTGA GACAGCCTGT 420 

AGGCAGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGACCAG 480 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 540 

GGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGGACTGTC TTGCCTGTGG GAAGAGCCTG 600 

AAGACCCCCC GTGTGGTGGG TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 660 

AGCATCCAGT ACGACAAACA GCACGTCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 

CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGCGGGCA 780 

GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 840 

TTCAACCCCA TGTACCCCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA GTTCCCACTC 900 

ACTTTCTCAG GCACAGTCAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA 960 

GCCACCCCAC TCTGGATCAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GCGTACCAGG GGGAAGTCAG CGAGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 

GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 12 00 

GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG AGTATACACC 1260 
AAGGTCTCAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT GTAA 

Seq ID NO: 449 Protein sequence 
Protein Accession #: NP_063947.1 

1 11 21 31 41 51 

I 1 I I I I 

MLQDPDSDQP LNSLDVKPLR KPRIPMETFR KVGIPIIIAIi IiSLASIIIW VLIKVILDKY 60 

YFLCGQPLHF IPRKQLCDGE LDCPLGEDEE HCVKSFPEGP AVAVRLSKDR STLQVLDSAT 120 

GNWFSACFDN FTEALAETAC RQMGYSSKPT FRAVEIGPDQ DLDWEITEN SQELRMRNSS 180 

GPCLSGSLVS LHCLACGKSL KTPRWGGEE ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 240 

LTAAHCFRKH TDVFNWKVRA GSDKLGSFPS LAVAKIIIIE FNPMYPKDND lALMKLQFPL 300 

TFSGTVRPIC LPFFDEELTP ATPLWIIGWG FTKQNGGKMS DILLQASVQV IDSTRCNADD 360 

AYQGEVTEKM MCAGIPEGGV DTCQGDSGGP LMYQSDQWHV VGIVSWGYGC GGPSTPGVYT 420 



Seq ID NO: 450 DNA sequence 

Nucleic Acid Accession #: XM_051860.2 

Coding sequence: 52.. 3042 

1 11 21 31 41 51 

I I I 1 1 I 

GCTCACCCAG GAA/VAATATG CAATCGTCCC ATTGATATAC AGGCCACTAC AATGGATGGA 60 
GTTAACCTCA GCACCGAGGT TGTCTACAAA AAAGGCCAGG ATTATAGGTT TGCTTGCTAC 120 
GACCGGGGCA GAGCCTGCCG GAGCTACCGT GTACGGTTCC TCTGTGGGAA GCCTGTGAGG 180 
CCCAAACTCA CAGTCACCAT TGACACCAAT GTGAACAGCA CCATTCTGAA CTTGGAGGAT 240 
AATGTACAGT CATGGAAACC TGGAGATACC CTGGTCATTG CCAGTACTGA TTACTCCATG 300 
TACCAGGCAG AAGAGTTCCA GGTGCTTCCC TGCAGATCCT GCGCCCCCAA CCAGGTCAAA 360 
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GTGGCAGGGA AACCAATGTA CCTGCACATC GGGGAGGAGA TAGACGGCGT GGACATGCGG 420 

GCX3GAGGTTG GGCTTCTGAG CCGGAACATC ATAGTGATGG GGGAGATGGA GGACAAATGC 480 

TACCCCTACA GAAACCACAT CTGCAATTTC TTTGACTTCG ATACCTTTGG GGGCCACATC 540 

AAGTTTGCTC TGGGATTT7VA GGCAGCACAC TTGGAGGGCA CGGAGCTGAA GCATATGGGA 600 

CAGCAGCTGG TGGGTCAGTA CCCGATTCAC TTCCACCTGG CCGGTGATGT AGACGAAAGG 660 

GGAGGTTATG ACCCACCCAC ATACATCAGG GACCTCTCCA TCCATCATAC ATTCTCTCGC 720 

TGCGTCACAG TCCATGGCTC CAATGGCTTG TTGATCAAGG ACGTTGTGGG CTATAACTCT 780 

TTGGGCCACT GCTTCTTCAC GGAAGATGGG CCGGAGGAAC GCAACACTTT TGACCACTGT 840 

CTTGGCCTCC TTGTCAAGTC TGGAACCCTC CTCCCCTCGG ACCGTGACAG CAAGATGTGC 900 

AAGATGATCA CAGGAGACTC CTACCCAGGG TACATCCCCA AGCCCAGGCA AGACTGCAAT 960 

GCTGTGTCCA CCTTCTGGAT GGCCAATCCC AACAACAACC TCATCAACTG TGCCGCTGCA 1020 

GGATCTGAGG AAACTGGATT TTGGTTTATT TTTCACCACG TACCAACGGG CCCCTCCGTG 1080 

GGAATGTACT CCCCAGGTTA TTCAGAGCAC ATTCCACTGG GAAAATTCTA TAACAACCGA 1140 

GCACATTCCA ACTACCGGGC TGGCATGATC ATAGACAACG GAGTCAAAAC CACCGAGGCC 1200 

TCTGCCAAGG ACAAGCGGCC GTTCCTCTCA ATCATCTCTG CCAGATACAG CCCTCACCAG 1260 

GACGCCGACC CGCTGAAGCC CCGGGAGCCG GCCATCATCA GACACTTCAT TGCCTACAAG 1320 

AACCAGGACC ACGGGGCCTG GCTGCGCGGC GGGGATGTGT GGCTGGACAG CTGCCGGTTT 13 80 

GCTGACAATG GCATTGGCCT GACCCTGGCC AGTGGTGGAA CCTTCCCGTA TGACGACGGC 1440 

TCCAAGCAAG AGATAAAGAA CAGCTTGTTT GTTGGCGAGA GTGGCAACGT GGGGACGGAA 1500 

ATGATGGACA ATAGGATCTG GGGCCCTGGC GGCTTGGACC ATAGCGGAAG GACCCTCCCT 1560 

ATAGGCCAGA ATTTTCCAAT TAGAGGAATT CAGTTATATG ATGGCCCCAT CAACATCCAA 1620 

T^CTGCACTT TCCGAAAGTT TGTGGCCCTG GAGGGCCGGC ACACCAGCGC CCTGGCCTTC 1680 

CGCCTGAATA ATGCCTGGCA GAGCTGCCCC CATAACAACG TGACCGGCAT TGCCTTTGAG 1740 

GACGTTCCGA TTACTTCCAG AGTGTTCTTC GGAGAGCCTG GGCCCTGGT7 CAACCAGCTG 1800 

GACATGGATG GGGATAAGAC ATCTGTGTTC CATGACGTCG ACGGCTCCGT GTCCGAGTAC 1860 

CCTGGCTCCT ACCTCACX3AA GAATGAC7UVC TGGCTGGTCC GGCACCCAGA CTGCATCAAT 1920 

GTTCCCGACT GGAGAGGGGC CATTTGCAGT GGGTGCTATG CACAGATGTA CATTCAAGCC 1980 

TACAAGACCA GTAACCTGCG AATGAAGATC ATCAAGAATG ACTTCCCCAG CCACCCTCTT 2040 

TACCTGGAGG GGGCGCTCAC CAGGAGCACC CATTACCAGC AATACCAACC GGTTGTCACC 210 0 

CTGCAGAAGG GCTACACCAT CCACTGGGAC CAGACGGCCC CCGCCGAACT CGCCATCTGG 2160 

CTCATCAACT TCAACAAGGG CGACTGGATC CGAGTGGGGC TCTGCTACCC GCGAGGCACC 2220 

ACATTCTCCA TCCTCTCGGA TGTTCACAAT CGCCTGCTGA AGCAAACGTC CAAGACGGGC 2280 

GTCTTCGTGA 6GACCTTGCA GATGGACAAA GTGGAGCAGA 6CTACCCTGG CAGGAGCCAC 2340 

TACTACTGGG ACX3AGGACTC AGGGCTGTTG TTCCTGAAGC TGAT^GCTCA GAACGAGAGA 2400 

GAGAAGTTTG CTTTCTGCTC CATGAAAGGC TGTGAGAGGA TAAAGATTAA AGCTCTGATT 2460 

CCAAAGAACG CAGGCGTCAG TGACTGCACA GCCACAGCTT ACCCCAAGTT CACCGAGAGG 2520 

GCTGTCGTAG ACGTGCCGAT GCCCAAGAAG CTCTTTGGTT CTCAGCTGAA AACAAAGGAC 2580 

CATTTCTTGG AGGTGAAGAT GGAGAGTTCC AAGCAGCACT TCTTCCACCT CTGGAACGAC 2640 

TTCGCTTACA TTGAAGTGGA TGGGAAGAAG TACCCCAGTT CGGAGGATGG CATCCAGGTG 2700 

GTGGTGATTG ACGGGAACCA AGGGCGCGTG GTGAGCCACA CGAGCTTCAG GAACTCCATT 2760 

CTGCAAGGCA TACCATGGCA GCTTTTCAAC TATGTGGCGA CCATCCCTGA CAATTCCATA 2 82 0 

GTGCTTATGG CATCAAAGGG AAGATACGTC TCCAGAGGCC CATGGACCAG AGTGGTGGAA 2880 

AAGCTTGGGG CAGACAGGGG TCTCAAGTTG AAAGAGCAAA TGGCATTCGT TGGCTTCAAA 2940 

GGCAGCTTCC GGCCCATCTG GGTGACACTG GACACTGAGG ATCACAAAGC CAAJ^ATCTTC 3 000 

CAAGTTGTGC CCATCCCTGT GGTGAAGAAG AAGAAGTTGT GAGGACAGCT GCCGCCCGGT 3 0 60 

GCCACCTCGT G6TAGACTAT GACGGTGACT CTTGGCAGCA GACCAGTGGG GGATGGCTGG 312 0 

GTCCCCCAGC CCCTGCCAGC AGCTGCCTGG GAAGGCCGTG TTTCAGCCCT GATGGGCCAA 3180 

GGGAAGGCTA TCAGAGACCC TGGTGCTGCC ACCTGCCCCT ACTCAAGTGT CTACCTGGAG 3240 

CCCCTGGGGC GGTGCTGGCC AATGCTGGAA ACATTCACTT TCCTGCAGCC TCTTGGGTGC 3300 

TTCTCTCCTA TCTGTGCCTC TTCAGTGGGG GTTTGGGGAC CATATCAGGA GACCTGGGTT 33 60 

GTGCTGACAG C7\AAGATCCA CTTTGGCAGG AGCCCTGACC CAGCTAGGAG GTAGTCTGGA 3420 

GGGCTGGTCA TTCACAGATC CCCATGGTCT TCAGCAGACA AGTGAGGGTG GTAAATGTAG 3 480 

GAGAAAGAGC CTTGGCCTTA AGGAAATCTT TACTCCTGTA AGCAAGAGCC AACCTCACAG 3 540 

GATTAGGAGC TGGGGTAGAA CTGGCTATCC TTGGGGAAGA GGCAAGCCCT GCCTCTGGCC 3 600 

GTGTCCACCT TTCAGGAGAC TTTGAGTGGC AGGTTTGGAC TTGGACTAGA TGACTCTCAA 3660 

AGGCCCTTTT AGTTCTGAGA TTCCAGAAAT CTGCTGCATT TCACATGGTA CCTGGAACCC 3720 

AACAGTTCAT GGATATCCAC TGATATCCAT GATGCTGGGT GCCCCAGCGC ACACGGGATG 3780 

GAGAGGTGAG AACTAATGCC TAGCTTGAGG GGTCTGCAGT CCAGTAGGGC AGGCAGTCAG 3840 

GTCCATGTGC ACTGCAATGC CAGGTGGAGA AATCACAGAG AGGTAAAATG GAGGCCAGTG 3 900 

CCATTTCAGA GGGGAGGCTC AGGAAGGCTT CTTGCTTACA GGAATGAAGG CTGGGGGCAT 3960 

TTTGCTGGGG GGAGATGAGG CAGCCTCTGG AATGGCTCAG GGATTCAGCC CTCCCTGCCG 4020 

CTGCCTGCTG AAGCTGGTGA CTACGGGGTC GCCCTTTGCT CACGTCTCTC TGGCCCACTC 4080 

ATGATGGAGA AGTGTGGTCA GAGGGGAGCA ATGGGCTTTG CTGCTTATGA GCACAGAGGA 4140 

ATTCAGTCCC CAGGCAGCCC TGCCTCTGAC TCCAAGAGGG TGAAGTCCAC AGAAGTGAGC 4200 

TCCTGCCTTA GGGCCTCATT TGCTCTTCAT CCAGGGAACT GAGCACAGGG GGCCTCCAGG 42 60 

AGACCCTAGA TGTGCTCGTA CTCCCTCGGC CTGGGATTTC AGAGCTGG7\A ATATAGAAAA 4320 

TATCTAGCCC AAAGCCTTCA TTTTAACAGA TGGGGAAAGT GAGCCCCCAA GATGGGAAAG 4380 

AACCACACAG CTAAGGGAGG GCCTGGGGAG CCCCACCCTA GCCCTTGCTG CCACACCACA 4440 

TTGCCTCAAC AACCGGCCCC AGAGTGCCCA GGCACTCCTG AGGTAGCTTC TGGAAATGGG 450 0 

GACAAGTCCC CTCGAAGGAA AGGAAATGAC TAGAGTAGAA TGACAGCTAG CAGATCTCTT 4560 

CCCTCCTGCT CCCAGCGCAC ACi\AACCCGC CCTCCCCTTG GTGTTGGCGG TCCCTGTGGC 462 0 

CTTCACTTTG TTCACTACCT GTCAGCCCAG CCTGGGTGCA CAGTAGCTGC AACTCCCCAT 4680 

TGGTGCTACC TGGCTCTCCT GTCTCTGCAG CTCTACAGGT GAGGCCCAGC AGAGGGAGTA 4740 

GGGCTCGCCA TGTTTCTGGT GAGCCAATTT GGCTGATCTT GGGTGTCTGA ACAGCTATTG 4800 

GGTCCACCCC AGTCCCTTTC AGCTGCTGCT TAATGCCCTG CTCTCTCCCT GGCCCACCTT 4860 

ATAGAGAGCC CAAAGAGCTC CTGTAAGAGG GAGAACTCTA TCTGTGGTTT ATAATCTTGC 4 920 

ACGAGGCACC AGAGTCTCCC TGGGTCTTGT GATGAACTAC ATTTATCCCC TTTCCTGCCC 4 980 

CAACCACAAA CTCTTTCCTT CAAAGAGGGC CTGCCTGGCT CCCTCCACCC AACTGCACCC 5040 

ATGAGACTCG GTCCAAGAGT CCATTCCCCA GGTGGGAGCC AACTGTCAGG GAGGTCTTTC SlOO 

CCACCAAACA TCTTTCAGCT GCTGGGAGGT GACCATAGGG CTCTGCTTTT AAAGATATGG 5160 

CTGCTTCAAA GGCCAGAGTC ACAGGAAGGA CTTCTTCCAG GGAGATTAGT GGTGATGGAG 5220 

AGGAGAGTTA AAATGACCTC ATGTCCTTCT TGTCCACGGT TTTGTTGAGT TTTCACTCTT 52 80 

CTAATGCAAG GGTCTCACAC TGTGAACCAC TTAGGATGTG ATCACTTTCA GGTGGCCAGG 5340 

AATGTTGAAT GTCTTTGGCT CAGTTCATTT AAAAAAGATA TCTATTTGAA AGTTCTGAGA 540 0 

GTTGTACATA TGTTTCACAG TACAGGATCT GTACATAAAA GTTTCTTTCC TAAACCATTC 5460 

AGCAAGAGCC AATATCTAGG CATTTTCTTG GTAGCACAAA TTTTCTTATT GCTTAGAAAA 5520 

TTGTCCTCCT TGTTATTTCT 6TTTGTAAGA CTTAAGTGAG TTAGGTCTTT AAGGAAAGCA 5580 
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ACGCTGCTCT GAAATGCTTG TCTTTTTTCT GTTGCCGAAA TAGCTGGTCC TTTTTCGGGA 5640 
GTTAGAT6TA TAGAGTGTTT GTATGTAAAC ATTTCTTGTA GGCATCACCA TGTUiCAAAGA 5700 
TATATTTTCT ATTTATTTAT TATATGTGCA CTTCAAGAAG TCACTGTCAG AGAAATAAAG 5760 
AATTGTCTTA AATGTCAAAA AAAAAAAAAA AAAATW^AAA AAAAAAAA 



PCT/US02/12476 



Seq ID NO: 451 Protein sequence 
Protein Accession #: XP 051860.2 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



MDGVNLiSTEV 
IiEDNVQSWKP 
DMRAEVGLLS 
HMGQQLVGQY 
YNSLGHCFFT 
DCNAVSTFWM 
NNRAHSNYRA 
AYKNQDHGAW 
GTEMMDNRIW 
LAFRLNNAWQ 
SEYPGSYLTK 
HPLYLEGALT 
RGTTFSILSD 
NEREKFAFCS 
TKDHFIjEVKM 
NSILQGIPWQ 
GFKGSFRPIW 



11 

1 

VYKKGQDYRF 
GDTLVIASTD 
RNIIVMGEME 
PIHFHLAGDV 
BDGPEERNTF 
ANPNNNLINC 
GMIIDNGVKT 
LRGGDVWLDS 
GPGGLDHSGR 
SCPHNNVTGI 
NDNWLVRHPD 
RSTHYQQYQP 
VHNRLLKQTS 
MKGCERIKIK 
ESSKQHFFHL 
IiFNYVATIPD 
VTLDTEDHKA 



21 
I 

ACYDRGRACR 
YSMYQAEEFQ 
DKCYPYRNHI 
DERGGYDPPT 
DHCLGLLVKS 
AAAGSEETGF 
TEASAKDKRP 
CRFADNGIGL 
TIiPIGQNFPI 
AFEDVPITSR 
CINVPDWRGA 
WTLQKGYTI 
KTGVFVRTLQ 
ALIPKNAGVS 
WNDFAYIEVD 
NSIVLMASKG 
KIFQWPIPV 



31 
I 

SYRVRFLCQK 
VLPCRSCAPN 
CNFFDFDTFG 
YIRDLSIHHT 
GTLLPSDRDS 
WFIFHHVPTG 
FLSIISARYS 
TLASGGTFPY 
RGIQLYDGPI 
VFFGEPGPWF 
ICSGCYAQMY 
HWDQTAPAEIi 
MDKVEQSYPG 
DCTATAYPKF 
GKKYPSSEDG 
RYVSRGPWTR 
VKKKKL 



41 

I 

PVRPKLTVTI 
QVKVAGKPMY 
GHIKFALGFK 
FSRCVTVHGS 
KMCKMITGDS 
PSVGMYSPGY 
PHQDADPLKP 
DDGSKQEIKN 
NIQNCTFRKF 
NQLDMDGDKT 
IQAYKTSNLR 
AIWLINFNKG 
RSHYYWDEDS 
TERAWDVPM 
IQWVIDGNQ 
VLEKLGADRG 



51 
I 

DTNVNSTILM 
LHIGEEIDGV 
AAHLEGTELK 
NGLLIKDWG 
YPGYIPKPRQ 
SEHIPLGKFY 
REPAIIRHFI 
SLPVGESGNV 
VALEGRHTSA 
SVFHDVDGSV 
MKIIKNDFPS 
DWIRVGLCYP 
GIiLFIiKLKAQ 
PKKIiFGSQLK 
GRWSHTSFR 
LKLKEQMAFV 



Seq ID NO: 452 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 261.. 2861 



1 
1 

GAGCTAGCX3C 
CGGCGCGGGG 
AGCTACCACT 
ACGTCCGGGG 
AGAGGGAGCA 
TGCTGACCAT 
CTGGGTGCCC 
ACCATGTGCA 
CCATCCACAT 
TGCGAACCCG 
GCCCTTTCCA 
CGGATCCTTA 
ATGGACAGAA 
CAGAA6GAGG 
TCGACCCCAA 
AAGAGAGTGA 
TTGCAGTGAA 
AATTGGGAAG 
TGAAAGGAAA 
CTGCTGCTGC 
CTTTGTCCAG 
TATCTCAGAC 
TATGCAATCG 
AGGTTGTCTA 
GCCGGAGCTA 
CCATTGACAC 
AACCTGGAGA 
TCCAGGTGCT 
TGTACCTGCA 
TGAGCCGGAA 
ACATCTGCAA 
TTAAGGCAGC 
AGTACCCGAT 
CCACATACAT 
GCTCCAATGG 
TCACGGAAGA 
AGTCTGGAAC 
ACTCCTACCC 
GGATGGCCAA 
GATTTTGGTT 
GTTATTCAGA 
GGGCTGGCAT 
GGCCGTTCCT 
AGCCCCGGGA 
CCTGGCTGCG 
AAGGCTTCTT 
CCTCTGGAAT 
CGGGGTCGCC 
GGGAGCAATG 
CTCTGACTCC 
TCTTCATCCA 
CCTCGGCCTG 



11 
I 

TCAAGCAGAG 
AGCCAGCGGG 

CCGCTTGCCC 
CCGCTGCGCT 
CACTGCCAGG 
CAGCTGGCTC 
TGACCAGAGC 
TATCGGCCAG 
CTCAGAGGGA 
GCACATCCTG 
GGGCAATTTC 
CTATGGTCTG 
AAAGCTCTCC 
CTATTTTTTT 
ATCAGGCACA 
ACGTCTGGTC 
TGATGAAGGT 
CAAACACTTC 
TCCATCATCT 
CCGGGTATTC 
TGAGTGGGTT 
TAAAGGTGGG 
TCCCATTGAT 
CAAAAAAGGC 
CCGTGTACGG 
CAATGTGT^C 
TACCCTGGTC 
TCCCTGCAGA 
CATCGGGGAG 
CATCATAGTG 
TTTCTTTGAC 
ACACTTGGAG 
TCACTTCCAC 
CAGGGACCTC 
CTTGTTGATC 
TGGGCCGGAG 
CCTCCTCCCC 
AGGGTACATC 
TCCCAACAAC 
TATTTTTCAC 
GCACATTCCA 
GATCATAGAC 
CTCAATCATC 
GCCGGCCATC 
CGGCGGGGAT 
GCTTACAGGA 
GGCTCAGGGA 
CTTTGCTCAC 
GGCTTTGCTG 
AAGAGGGTGA 
GGGAACTGAG 
GGATTTCAGA 



21 
1 

CCCAGCX3CGG 
GCTGAGCGCG 
ACGCCCCGGG 
CCTGGCCCGC 
ATGGGAGCTG 
ACTCTGACCT 
CCTGAGTTGC 
GGCAAGACAC 
GGCAAGCTGG 
ATTGACAACG 
ACCATCATTT 
AAGTACATTG 
TGGACATTTC 
GAAAGGAGCT 
GTCATCCATT 
CAGTATTTGA 
TCTCGAAATC 
CTGCACCTTG 
TCAGTGGAAG 
AAATTGTTCC 
CAAGACGTGG 
GAGAAAATTT 
ATACAGGCCA 
CAGGATTATA 
TTCCTCTGTG 
AGCACCATTC 
ATTGCCAGTA 
TCCTGCGCCC 
GAGATAGACG 
ATGGGGGAGA 
TTCGATACCT 
GGCACGGAGC 
CTGGCCGGTG 
TCCATCCATC 
AAGGACGTTG 
GAACGCAACA 
TCX3GACCGTG 
CCCAAGCCCA 
AACCTCATCA 
CACGTACCAA 
CTGGGAAAAT 
AACGGAGTCA 
TCTGCCAGAT 
ATCAGACACT 
GTGTGGCTGG 
ATGAAGGCTG 
TTCAGCCCTC 
GTCTCTCTGG 
CTTATGAGCA 
AGTCCACAGA 
CACAGGGGGG 
GCTGGAAATA 



31 
1 

TGCTATCGGA 
GCCAGGGTCT 
AGCTCGCGGC 
GAGGCGTGAC 
CTGGGAGGCA 
GCTTCCCTGG 
TiACCCTGGAA 
TGCTGCTCAC 
TCATTAAAGA 
GAGGAGAGCT 
TGTATGGAAG 
GGGTTGGTAA 
TGAACAAGAC 
GGGGCCACCG 
CTGACCGGTT 
ACGCGGTGCC 
TGGATGACAT 
GATTTAGACA 
ACCATATTGA 
AGACAGAGCA 
AGTGGACGGA 
CAGACCTCTG 
CTACAATGGA 
GGTTTGCTTG 
GGAAGCCTGT 
TGAACTTGGA 
CTGATTACTC 
CCAACCAGGT 
GCGTGGACAT 
TGGAGGACAA 
TTGGGGGCCA 
TGAAGCATAT 
ATGTAGACGA 
ATACATTCTC 
TGGGCTATAA 
CTTTTGACCA 
ACAGCAAGAT 
GGCflAGACTG 
ACTGTGCCGC 
CGGGCCCCTC 
TCTATAACAA 
AAACCACCGA 
ACAGCCCTCA 
TCATTGCCTA 
ACAGCTGCCA 
GGGGCATTTT 
CCTGCCGCTG 
CCCACTCATG 
CAGAGGAATT 
AGTGAGCTCC 
CTCCAGGAGA 
TAGAAAATAT 



41 

1 

CAGAGCCTGG 
GAACCCAGAT 
GCCTGGCGGT 
ACTGTCTCGG 
GGACTTCCTC 
GGCCACATCC 
CCCTGGCCAT 
CTCTTCTGCC 
CCACGACGAG 
GCATGCTGGG 
GGCTGATGAA 
AGGAGGCGCT 
CCTTCACCCA 
TGGAGTTATT 
TGACACCTAT 
CGATGGCAGG 
GGCCAGGAAG 
CCCTTGGAGT 
ATATCATGGA 
TGGCGAATAT 
GTGGTTCGAT 
GAAAGCTCAC 
TGGAGTTAAC 
CTACGACCGG 
GAGGCCCAAA 
GGATAATGTA 
CATGTACCAG 
CAAAGTGGCA 
GCGGGCGGAG 
ATGCTACCCC 
CATCAAGTTT 
GGGACAGCAG 
AAGGGGAGGT 
TCGCTGCGTC 
CTCTTTGGGC 
CTGTCTTGGC 
GTGCAA6ATG 
CAATGCTGTG 
TGCAGGATCT 
CGTGGGAATG 
CCGAGCACAT 
GGCCTCTGCC 
CCAGGACGCC 
CAAGAACCAG 
TTTCAGAGGG 
GCTGGGGGGA 
CCTGCTGAAG 
ATGGAGAAGT 
CAGTCCCCAG 
TGCCTTAGGG 
CCCTAGATGT 
CTAGCCCAAA 



51 
1 

CGAGCGCAAG 
TTCCCAGACT 
CAGCGACCAG 
CTACAGACCC 
TTCAAGGCCA 
ACAGTGGCTG 
GACCAAGACC 
ACGGTCTATT 
CCGATTGTTT 
AGTGCCCTCT 
GGTATTCAGC 
CTTGAGTTGC 
GGTG6CATGG 
GTTCATGTCA 
AGATCCAAGA 
ATCCTTTCTG 
GCGATGACCA 
TTTCTAACTG 
CATCGAGGCT 
TTCAATGTTT 
CATGATAAAG 
CCAGGAAAAA 
CTCAGCACCG 
GGCAGAGCCT 
CTCACAGTCA 
CAGTCATGGA 
GCAGAAGAGT 
GGGAAACCAA 
GTTGGGCTTC 
TACAGAAACC 
GCTCTGGGAT 
CTGGTGGGTC 
TATGACCCAC 
ACAGTCCATG 
CACTGCTTCT 
CTCCTTGTCA 
ATCACAGAGG 
TCCACCTTCT 
GAGGAAACTG 
TACTCCCCAG 
TCCAACTACC 
AAGGACAAGC 
GACCCGCTGA 
GACCACGGGG 
GAGGCTCAGG 
GATGAGGCAG 
CTGGTGACTA 
GTGGTCAGAG 
GCAGCCCTGC 
CCTCATTTGC 
GCTCGTACTC 
GCCTTCATTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



355 



wo 02/086443 

TAACAGATGG GGAAAGTGAG CCCCCAAGAT GGGAAAGAAC CACACAGCTA AGGGAGGGCC 3180 

TGGGGAGCCC CACCCTAGCC CTTGCTGCCA CACCACATTG CCTCAACAAC CGGCCCCAGA 3240 

GTGCCCAGGC ACTCCTGAGG TAGCTTCTGG AAATGGGGAC AAGTCCCCTC GAAGGAAAGG 3300 

AAATGACTAG AGTAGAATGA CAGCTAGCAG ATCTCTTCCC TCCTGCTCCC AGCGCACACA 33 60 

AACCCGCCCT CCCCTTGGTG TTGGCGGTCC CTGTGGCCTT CACTTTGTTC ACTACCTGTC 3420 

AGCCCAGCCT GGGTGCACAG TAGCTGCAAC TCCCCATTGG TGCTACCTGG CTCTCCTGTC 3480 

TCTGCAGCTC TACAG6TGAG GCCCAGC7VGA GGGAGTAGGG CTCGCCATGT TTCTGGTGAG 3540 

CCAATTTGGC TGATCTTGGG TGTCTGAACA GCTATTGGGT CCACCCCAQT CCCTTTCAGC 3600 

TGCTGCTTAA TGCCCTGCTC TCTCCCTGGC CCACCTTATA GAGAGCCCAA AGAGCTCCTG 3 660 

TAAGAGGGAG AACTCTATCT GTGGTTTATA ATCTTGCACG AGGCACCAGA GTCTCCCTGG 3 72 0 

GTCTTGTGAT GAACTACATT TATCCCCTTT CCTGCCCCAA CCACAAACTC TTTCCTTCAA 3780 

AGAGGGCCTG CCTGGCTCCC TCCACCCAAC TGCACCCATG AGACTCGGTC CAAGAGTCCA 3 840 

TTCCCCAGGT GGGAGCCAAC TGTCAGGGAG GTCTTTCCCA CCAAACATCT TTCAGCTGCT 3900 

GGGAGGTGAC CATAGGGCTC TGCTTTTAAA GATATGGCTG CTTCAAAGGC CAGAGTCACA 3 960 

GGAAGGACTT CTTCCAGGGA GATTAGTGGT GATGGAGAGG AGAGTTAAAA TGACCTCATG 4020 

TCCTTCTTGT CCACGGTTTT GTTGAGTTTT CACTCTTCTA ATGCAAGGGT CTCACACTGT 4080 

GAACCACTTA GGATGTGATC ACTTTCAGGT GGCCAGGAAT GTTGAATGTC TTTGGCTCAG 4140 

TTCATTTAAA AAAGATATCT ATTTGAAAGT TCTCT^GAGTT GTACATATGT TTCACAGTAC 4200 

AGGATCTGTA CATAATU^GTT TCTTTCCTAA ACCATTCACC AAGAGCCAAT ATCTAGGCAT 42 60 

TTTCTTGQTA GCACAAATTT TCTTATTGCT TAGAAAATTG TCCTCCTTGT TATTTCTGTT 432 0 

TGTAAGACTT AAGTGAGTTA GGTCTTTAAG GAAAGCAACG CTCCTCTGAA ATGCTTGTCT 43 80 

TTTTTCTGTT GCCGAAATAG CTGGTCCTTT TTCGGGAGTT AGATGTATAG AGTGTTTGTA 4440 

TGTAAACATT TCTTGTAGGC ATCACCATGA ACAAAGATAT ATTTTCTATT TATTTATTAT 4500 

ATGTGCACTT CAAGAAGTCA CTGTCAGAGA AATAAAGAAT TGTCTTAAAT GTCATGATTG 4560 

GAGATGTCCT TTGCATTGCT TGGAAGGGGT GTACCTAGAG CCAAGGAAAT TGGCTCTGGT 4620 

TTGGAAAAAT TTTGCTGTTA TTATAGTAAA CATACAAAGG ATGTCA7AAA AAAAAAAAAA 4680 
AAAAAAAAAA AAAAAAAAAA TVA 

Seq ID NO: 453 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I 1 1 I I 

MGAAGRQDFL FKAMLTISWL TLTCFPGATS TVAAGCPDQS PELQPWNPGH DQDHHVHIGQ 60 

GKTLLLTSSA TVYSIHISEG GKLVIKDHDE PIVLRTRHIL IDNGGELHAG SALCPFQGNF 120 

TIILYGRADE GIQPDPYYGL KYIGVGKGGA LELHGQKKLS WTFLNKTLHP GGMAEGGYFF 180 

ERSWGHRGVI VHVIDPKSGT VIHSDRFDTY RSKKESERLV QYLNAVPDGR ILSVAVNDEG 240 

SRNLDDMARK AMTKLGSKHF LHLGFRHPWS FLTVKGNPSS SVEDHIEYHG HRGSAAARVF 300 

KLFQTEHGEY FNVSLSSEWV QDVEWTEWFD HDKVSQTKGG EKISDLWKAH PGKICNRPID 3 60 

IQATTMDGVN LSTEWYKKG QDYRFACYDR GRACRSYRVR FLCGKPVRPK LTVTIDTNVN 420 

STILNIlED^FV QSWKPGDTLV lASTDYSMYQ AEEFQVLPCR SCAPNQVKVA GKPMYLHIGE 480 

EIDGVDMRAE VGLLSRNIIV MGEMEDKCYP YRNHICNFFD FDTFGGHIKF ALGFKAAHLE 540 

GTELKIW4GQQ LVGQYPIHFH LAGDVDERGG YDPPTYIRDL SIHHTFSRCV TVHGSNGLLI 600 

KDWGYNSLG HCFFTEDGPE ERNTFDHCLG LLVKSGTLLP SDRDSKMCKM ITEDSYPGYI 660 

PKPRQDCNAV STFWMANPNN NLINCAAAGS EETGFWFIFH HVPTGPSVGM YSPGYSEHIP 72 0 

LGKFYNNRAH SNYRAGMIID NGVKTTEASA KDKRPFLSII SARYSPHQDA DPLKPREPAI 780 

IRHFIAYKNQ DHGAWLRGGD VWLDSCHFRG EAQEGFLLTG MKAGGIIjLGG DEAASGMAQG 840 
FSPPCRCLLK LVTTGSPFAH VSLAHS 

Seq ID NO: 454 DNA sequence 

Nucleic Acid Accession NM_013282.2 

Coding sequence: 85.. 2466 

1 11 21 31 41 51 

1 1 1 I 1 I 

CGACTCCTTA GAGCATGGCA TGGCTCAGAG GTGCTGGTAA AACTGATGGG GGTTTTTGCT 60 

GTCCCTCCCC TCAGCGCCGA CACCATGTGG ATCCAGGTTC GGACCATGGA CGGGAGGCAG 120 

ACCCACACGG TGGACTCGCT GTCCAGGCTG ACCAAGGTGG AGGAGCTGAG GCGGAAGATC 180 

CAGGAGCTGT TCCACGTGGA GCCAGGCCTG CAGAGGCTGT TCTACAGGGG CAAACAGATG 240 

GAGGACGGCC ATACCCTCTT CGACTACGAG GTCCGCCTGA ATGACACCAT CCAGCTCCTG 300 

GTCCGCCAGA GCCTCGTGCT CCCCCACAGC ACCAAGGAGC GGGACTCCGA GCTCTCCGAC 360 

ACCGACTCCG GCTGCTGCCT GGGCCAGAGT GAGTCAGACA AGTCCTCCAC CCACGGCGAG 420 

GCGGCCGCCG AGACTGACAG CAGGCCAGCC GATGAGGACA TGTGGGATGA GACGGAATTG 480 

GGGCTGTACA AGGTC/VATGA GTACGTCGAT GCTCGGGACA CGAACATGGG GGCGTGGTTT 540 

GAGGCGCAGG TGGTCAGGGT GACGCGGAAG GCCCCCTCCC GGGACGAGCC CTGCAGCTCC 600 

ACGTCCAGGC CGGCGCTGGA GGAGGACGTC ATTTACCACG TGAAATACGA CGACTACCCG 660 

GAGAACGGCG TGGTCCAGAT GAACTCCAGG GACGTCCGAG CGCGCGCCCG CACCATCATC 72 0 

7VAGTGGCAGG ACCTGGAGGT GGGCCAGGTG GTCATGCTCA ACTACAACCC CGACAACCCC 780 

AAGGAGCGGG GCTTCTGGTA CGACGCGGAG ATCTCCAGGA AGCGCGAGAC CAGGACGGCG 840 

CGGGAACTCT ACGCCAACGT GGTGCTGGGG GATGATTCTC TGAACGACTG TCGGATCATC 900 

TTCGTGGACG AAGTCTTCAA GATTGAGCGG CCGGGTGAAG GGAGCCCCAT GGTTGACAAC 960 

CCCATGAGAC GGAAGAGCGG GCCGTCCTGC T^GCACTGCA AGGACGACGT GAACAGACTC 1020 

TGCCGGGTCT GCGCCTGCCA CCTGTGCX3GG GGCCGGCAGG ACCCCGACAA GCAGCTCATG 1080 

TGCGATGAGT GCGACATGGC CTTCCACATC TACTGCCTGG ACCCGCCCCT CAGCAGTGTT 1140 

CCCAGCGAGG ACGAGTGGTA CTGCCCTGAG TGCCGGAATG ATGCCAGCGA GGTGGTACTG 1200 

GCGGGAGAGC GGCTGAGAGA GAGCAAGAAG AAGGCGAAGA TGGCCTCGGC CACATCGTCC 1260 

TCACAGCGGG ACTGGGGCAA GGGCATGGCC TGTGTGGGCC GCACCAAGGA ATGTACCATC 1320 

GTCCCGTCCA ACCACTACGG ACCCATCCCG GGGATCCCCG TGGGCACCAT GTGGCGGTTC 1380 

CGAGTCCAGG TCAGCGAGTC GGGTGTCCAT CGGCCCCACG TGGCTGGCAT ACACGGCCGG 1440 

AGCAACGACG GAGCGTACTC CCTAGTCCTG GCGGGGGGCT ATGAGGATGA CGTGGACCAT 1500 

GGGAATTTTT TCACATACAC GGGTAGTGGT GGTCGAGATC TTTCCGGCAA CAAGAGGACC 1560 

GCGGAACAGT CTTGTGATCA GAAACTCACC AACACCAACA GGGCGCTGGC TCTCAACTGC 1620 

TTTGCTCCCA TCAATGACCA AGAAGGGGCC GAGGCCAAGG ACTGGCGGTC GGGGAAGCCG 1680 

GTCAGGGTGG TGCGCAATGT CAAGGGTGGC AAG/U^TAGCA AGTACGCCCC CGCTGAGGGC 1740 

AACCX3CTACG ATGGCATCTA CAAGGTTGTG AAATACTGGC CCGAGAAGGG GAAGTCCGGG 1800 

TTTCTCGTGT GGCGCTACCT TCTGCGGAGG GACGATGATG AGCCTGGCCC TTGGACGAAG 1860 

GAGGGGAAGG ACCGGATCAA GAAGCTGGGG CTGACCATGC AGTATCCAGA AGGCTACCTO 1920 
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GAAGCCCTGG CCAACCGAGA GCGAGAGAAG GAGAACAGCA AGAGGGAGGA GGAGGAGCAG 1980 

CAGGAGGGGG GCTTCGCGTC CCCCAGGACG GGCAAGGGCA AGTGGAAGCG GAAGTCGGCA 2040 

GGAGGTGGCC CGAGCAGGGC CGGGTCCCCG CGCCGGACAT CCAAGAAAAC CAAGGTGGAG 210 0 

CCCTACAGTC TCACGGCCCA GCAGAGCAGC CTCATCAGAG AGGACAAGAG CAACGCCAAG 2160 

CTGTGGAATG AGGTCCTGGC GTCACTCAAG GACCGGCCGG CGAGCGGCAG CCCGTTCCAG 2220 

TTGTTCCTGA GTAAAGTGGA GGAGACGTTC CAGTGTATCT GCTGTCAGGA GCTGGTGTTC 2280 

CGGCCCATCA CGACCGTGTG CCAGCACAAC GTGTGCAAGG ACTGCCTGGA CAGATCCTTT 2340 

CGGGCACAGG TGTTCAGCTG CCCTGCCTGC CGCTACGACC TGGGCCGCAG CTATGCCATG 2400 

CAGGTGAACC AGCCTCTGCA GACCGTCCTC AACCAGCTCT TCCCCGGCTA CGGCAATGGC 2460 

CGGTGATCTC CAAGCACTTC TCGACAGGCG TTTTGCTGAA AACGTGTCGG AGGGCTCGTT 252 0 

CATCGGCACT GATTTTGTTC TTAGTGGGCT TAACTTAAAC AGGTAGTGTT TCCTCCGTTC 2580 

CCTAAAAAGG TTTGTCTTCC TTTTTTTTTA TTTTTATTTT TCAAATCTAT ACATTTTCAG 2640 

GAATTTATGT ATTCTGGCTA AAAGTTGGAC TTCTCAGTAT TGTGTTTAGT TCTTTGAAAA 2700 

CATAAAAGCC TGCAATTTCT CGACAAAACA ACACAAGATT TTTTAAAGAT GGAATCAGAA 2760 

ACTACGTGGT GTGGAGGCTG TTGATGTTTC TGGTGTC7VAG TTCTCAGAAG TTGCTGCCAC 2820 

CAACTCTTTA AGAAGGCGAC AGGATCAGTC CTTCTCTAGG GTTCTGGCCC CCAAGGTCAG 2 88 0 

AGCAAGCATC TTCCTGACAG CATTTTGTCA TCTAAAGTCC AGTGACATGG TTCCCCGTGG 2 940 

TGGCCCGTGG CAGCCCGTGG CATGGCGTGG CTCAGCTGTC TGTTGAAGTT GTTGCAAGGA 3 000 

AAAGAGGAAA CATCTCGGGC CTAGTTCAAA CCTTTGCCTC AAAGCCATCC CCCACCAGAC 3060 

TGCTTAGCGT CTGAGATCCG CGTGAAAAGT CCTCTGCCCA CGAGAGCAGG GAGTTGGGGC 3120 

CACGCAGAAA TGGCCTCAAG GGGACTCTGC TCCACGTGGG GCCAGGCGTG TGACTGACGC 3180 

TGTCCGACGA AGGCGGCCAC GGACGGACGC CAGCACACGA AGTCACGTGC AAGTGCCTTT 3240 

GATTCGTTCC TTCTTTCTAA AGACGACAGT CTTTGTTGTT AGCACTGAAT TATTGAAAAT 3300 

GTCAACCAGA TTCTAGAAAC TGCGGTCATC CAGTTCTTCC TGACACCGGA TGGGTGCTTG 3360 

GGAACCGTTT GAGCCTTATA GATCATTTAC ATTCAATTTT TTTAACTCAG CAAGTGAGAA 3420 

CTTACAAGAG GGTTTTTTTT TAATTTTTTT TTCTCTTAAT GAACACATTT TCTAAATG7VA 3 480 

TTTTTTTTGT AGTTACTGTA TATGTACCAA GAAAGATATA ACGTTAGGGT TTGGTTGTTT 3 54 0 

TTGTTTTTGT ATTTTTTTTC TTTTGAT^GG GTTTGTTAAT TTTTCTAATT TTACCAAAGT 3600 

TTGCAGCCTA TACCTCAATA AAACAGGGAT ATTTTAAATC ACATACCTGC AGACAAACTG 3660 

GAGCAATGTT ATTTTTAAAG GGTTTTTTTC ACCTCCTTAT TCTTAGATTA TTAATGTATT 3720 

AGGGAAGAAT GAGACAATTT TGTGTAGGCT TTTTCTAAAG TCCAGTACTT TGTCCAGATT 3780 
TTAQATTCTC AGAATAAATG TTTTTCACAG ATTGAAAAAA AAAAAAAA 

Seq ID NO: 455 Protein sequence 
Protein Accession #: NP_037414.2 

1 11 21 31 41 51 

t I I ! I I 

MWIQVRTMDG RQTHTVDSLS RLTKVEELRR KIQELFHVEP GLQRLFYRGK QMEDGHTLFD 60 

YEVRLNDTIQ LLVRQSDVLP HSTKERDSEL SDTDSGCCLG QSESDKSSTH GEAAAETDSR 120 

PADEDMWDET ELGLYKVNEY VDARDTNMGA WFEAQWRVT RKAPSRDEPC SSTSRPALEE 180 

DVIYHVKYDD YPENGWQMN SRDVRARART IIKWQDLEVG QWMLNYNPD NPKERGFWYD 240 

AEISRKRETR TARELYANW LGDDSLNDCR IIFVDEVFKI ERPGEGSPMV DNPMRRKSGP 300 

SCKHCKDDVN RLCRVCACHL CX3GRQDPDKQ LMCDECDMAF HIYCLDPPLS SVPSEDEWYC 360 

PBCRNDASBV VLAGERLRES KKKAKMASAT SSSQRDWGKG MACVGRTKEC TIVPSNHYGP 420 

IPGIPVGTMW RFRVQVSESG VHRPHVAGIH GRSNDGAYSL VLAGGYEDDV DHGNFFTYTG 4 BO 

SGGRDLSGNK RTAEQSCDQK LTNTNRALAL NCFAPINDQE GAEAKDWRSG KPVRWRNVK 540 

GGKNSKYAPA EGNRYDGIYK WKYWPEKGK SGFLVWRYLL RRDDDEPGPW TKEGKDRIKK 600 

LGLTMQYPEG YLBAIANRER EKENSKRBEB EQQEGGFASP RTGKGKWKRK SAGGGPSRAG 660 

SPRRTSKKTK VEPYSLTAQQ SSDIRKDKSN AKLWNEVLAS LKDRPASGSP FQLFLSKVEE 720 
TFQCICCQEL VFRPITTVCQ HNVCKDCLDR SFRAQVFSCP ACRYDLGRSY AMQVNQPLQT 

Seq ID NO: 456 DNA sequence 

Nucleic Acid Accession #: NM_001200.1 

Coding sequence: 325.. 1514 

1 11 21 31 41 51 

I I I I I I 

GGGGACTTCT TGAACTTGCA GGGAGAATAA CTTGCGCACC CCACTTTGCG CCGGTGCCTT 60 

TGCCCCAGCX3 GAGCCTGCTT CGCCATCTCC GAGCCCCACC GCCCCTCCAC TCCTCGGCCT 120 

TGCCCGACAC TGAGACGCTG TTCCCAGCGT GAAAAGAGAG ACTGCGCGGC CGGCACCCGG 180 

GAGAAGGAGG AGGCAAAGAA AAGGAACGGA CATTCGGTCC TTGCGCCAGG TCCTTTGACC 240 

AGAGTTTTTC CATGTGGACG CTCTTTCAAT GGACGTGTCC CCGCGTGCTT CTTAGACGGA 3 00 

CTGCGGTCTC CTAAAGGTCG ACCATGGTGG CCGGGACCCG CTGTCTTCTA GCGTTGCTGC 360 

TTCCCCAGGT CCTCCTGGGC GGCGCGGCTG GCCTCGTTCC GGAGCTGGGC CGCAGGAAGT 420 

TCGCGGCGGC GTCGTCGGGC CGCCCCTCAT CCCAGCCCTC TGACGAGGTC CTGAGCGAGT 480 

TCGAGTTGCG GCTGCTCAGC ATGTTCGGCC TGAAACy\GAG ACCCACCCCC AGCAGGGACG 540 

CCGTGGTGCC CCCCTACATG CTAGACCTGT ATCGCAGGCA CTCAGGTCAG CCGGGCTCAC 600 

CCGCCCCAGA CCACCGGTTG GAGAGGGCAG CCAGCCGAGC CAACACTGTG CGCAGCTTCC 660 

ACCATGAAGA ATCTTTGGAA GAACTACCAG AAACGAGTGG GAAAACAACC CGGAGATTCT 72 0 

TCTTTAATTT AAGTTCTATC CCCACGGAGG AGTTTATCAC CTCAGCAGAG CTTCAGGTTT 780 

TCCGAGAACA GATGCAA6AT GCTTTAGGAA ACAATAGCAG TTTCCATCAC C6AATTAATA 840 

TTTATGAAAT CATAAAACCT GCAACAGCCA ACTCGAAATT CCCCGTGACC AGACTTTTGG 900 

ACACCAGGTT GGTGAATCAG AATGCAAGCA GGTGGGAAAG TTTTGATGTC ACCCCCGCTG 960 

TGATGCGGTG GACTGCACAG GGACACGCCA ACCATGGATT CGTGGTGGAA GTGGCCCACT 1020 

TGGAGGAGAA ACAAGGTGTC TCCAAGAGAC ATGTTAGGAT AAGCAGGTCT TTGCACCAAG 1080 

ATGAACACAG CTGGTCACAG ATAAGGCCAT TGCTAGTAAC TTTTGGCCAT GATGGAATiAG 1140 

GGCATCCTCT CCACAAAAGA GAAAAACGTC AAGCCAAACA CAAACAGCGG AAACGCCTTA 1200 

AGTCCAGCTG TAAGAGACAC CCTTTGTACG TGGACTTCAG TGACGTGGGG TGGAATGACT 1260 

GGATTGTGGC TCCCCCGGGG TATCACGCCT TTTACTGCCA CGGAGAATGC CCTTTTCCTC 132 0 

TGGCTGATCA TCTGAACTCC ACTAATCATG CCATTGTTCA GACGTTGGTC AACTCTGTTA 13 80 

ACTCTAAGAT TCCTAAGGCA TGCTGTGTCC CGACAGAACT CAGTGCTATC TCGATGCTGT 1440 

ACCTTGACGA GAATGAAAAG GTTGTATTAA AGAACTATCA GGACATGGTT GTGGAGGGTT 1500 
GTGGGTGTCG CTAGTACAGC AA?ATTAAAT ACATAAATAT ATATATA 
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MVAGTRCLLA LLLPQVLLGG AAGLVPELGR RKFAAASSGR PSSQPSDEVL 
FGLKQRPTPS RDAWPPYML DLYRRHSGQP GSPAPDHRLE RAASRANTVR 
LPETSGKTTR RFFFNIiSSIP TEEPITSAEL QVPREQMQDA LGNNSSFHHR 
TANSKFPVTR LLDT 

Seq ID NO: 45 8 DNA sequence 
Nucleic Acid Accession #: NM_001999.2 
Coding sequence: 1..8736 

1 IX 21 31 41 51 

1 1 1 1 1 1 ' 

ATGGGGAGAA GACGGAGGCT GTGTCTCCAG CTCTACTTCC TGTGGCTGGG CTGTGTGGTG 60 

CTCTGGGCGC AGGGCACGGC CGGCCAGCCT CAGCCTCCTC CGCCCAAGCC GCCCCGGCCC 120 

CAGCCGCCGC CGCAACAGGT TCGGTCCGCT ACAGCAGGCT CTGAAGGCGG GTTTCTAGCG 180 

CCCGAGTATC GCGAGGAGGG TGCCGCAGTG GCCAGCCGCG TCCGCCGGCG AGGACAGCAG 240 

GACGTGCTCC GAGGGCCCAA CGTGTGCGGC TCCAGATTCC ACTCCTACTG CTGCCCTGGA 3 00 

TGGAA6ACGC TCCCTGGAGG AAACCAGTGC ATTGTCCCGA TTTGTAGAAA TAGTTGTGGA 360 

GATGGATTTT GTTCCCGTCC TAACATGTGT ACTTGTTCCA GTGGGCAAAT ATCATCAACC 420 

TGTGGATCAA AATCAATTCA GCAGTGCAGT GTGAGATGCA TGAATGGTGG GACCTGTGCA 48 0 

GATGACCACT GCCAGTGCCA GAAAGGATAT ATTGGAACTT ATTGTGGACA ACCTGTCTGT 540 

GAAAATGGAT GTCAGAATGG TGGACGTT6C ATCGCCCAAC CGTGTGCTTG TGTTTATGGG 600 

TTCACTGGTC CACAGTGTGA AAGAGATTAC AGGACAGGCC CGTGTTTCAC TCAGGTCAAC 660 

AACCAGATGT GCCAAGGGCA GCTGACAGGC ATTGTCTGCA CGAAGACTCT GTGCTGTGCC 72 0 

ACCACTGGAC GGGCGTGGGG CCATCCCTGT GAGATGTGTC CAGCCCAGCC TCAGCCCTGC 7 80 

CGACGGGGTT TCATCCCCAA CATCCGCACT GGAGCTTGCC AAGATGTTGA TGAATGCCAG 840 

GCTATCCCAG GGATATGCCA AGGAGGAAAC TGTATCAATA CAGTGGGCTC TTTTGAATGC 900 

AGATGCCCTG CTGGTCACAA ACAGAGTGAA ACTACTCAGA AATGTGAAGA CATTGATGAG 960 

TGCAGCATCA TTCCTGGGAT ATGTGAAACT GGTGAATGTT GCAACACCGT GGGAAGCTAT 1020 

TTTTGTGTTT GTCCACGTGG ATATGTAACC TCAACAGATG GCTCTCX3ATG CATCGATCAG 1080 

AGAACAGGCA TGTGTTTCTC GGGCCTGGTG AATGGCCGCT GTGCACAAGA GCTCCCGGGG 1140 

AGAATGACGA TUU^TGCAGTG CTGCTGTGAG CCTGGCCGCT GCTGGGGCAT CGGAACCATT 12 00 

CCTGAAGCCT GTCCTGTCAG AGGTTCTGAG GAATATCGCA GACTTTGCAT GGATGGACTT 1260 

CCAATGGGAG GAATTCCAGG GAGT6CTGGT TCCAGACCTG GAGGCACTGG GGGAAATGGC 1320 

TTTGCCCCAA GTGGCAATGG CAATGGCTAT GGCCCA6GAG GGACAGGCTT CATCCCCATC 1380 

CCTGGAGGCA ATGGCTTTTC TCCTGGCGTT GGGGGAGCCG GTGTGGGGGC CGGGGGACAG 1440 

GGACCTATCA TCACTGGACT AACAATTCTG AACCAGACAA TAGATATCTG TAAGCATCAT 150 0 

GCTAACCTTT GTTTAAATGG ACGCTGTATA CCAACTGTCT CAAGCTACCG ATGTGAATGC 1560 

AACATGGGTT ATAAGCAGGA TGCAAATGGA GATTGTATAG ATGTTGATGA ATGCACATCA 162 0 

AATCCCTGCA CTAATGGAGA TTGTGTTAAC ACACCTGGTT CCTATTATTG TTiAATGTCAT 1680 

GCTGGATTCC AGAGGACTCC TACCAAGCAA GCATGCATTG ATATTGATGA GTGCATCCAG 1740 

AATGGGGTTC TTTGTAAAAA CGGTCGATGC GTGAACTCAG ATGGAAGTTT CCAGTGCATT 1800 

TGCAATGCCG GCTTTGAATT AACTACAGAT GGAAAAAACT GTGTTGATCA TGATGAATGT 1860 

ACAACTACCA ACATGTGTTT GAATGGAATG TGCATCAATG AAGATGGCAG CTTCAAGTGC 1920 

ATCTGCAAAC CAGGATTTGT CTTGGCTCCA AATGGGCGTT ACTGTACTGA TGTTGATGAA 1980 

TGCCAGACCC CAGGT^TCTG CATGAATGGG CACTGCATCA ACAGTGAAGG GTCCTTCCGC 2040 

TGTGACTGTC CCCCAGGCCT GGCTGTGGGC ATGGATGGAC GTGTGTGTGT TGATACTCAC 2100 

ATGCGCAGTA CCTGCTATGG AGGAATCAAG AAAGGAGTGT GTGTGCGTCC TTTCCCCGGT 2160 

GCAGTGACCA AGTCCGAATG GTGCTGTGCC AATCCAGACT ATGGTTTTGG AGAACCCTGC 222 0 

CAGCCATGCC CTOCAAAAAA TTCAGCTGAA TTCCACGGCC TTTGTAGTAG TGGAGTAGGT 2280 

ATCACTGTGG ATGGAAGAGA TATCAATGAA TGTGCTTTGG ATCCTGATAT ATGTGCCAAT 2340 

GGGATTTGTG AAAACTTACG TGGTAGTTAC CGTTGTAATT GCAACAGTGG CTATGAACCA 2400 

GATGCCTCTG GAAGAAACTG TATTGACATT GATGAATGTT TAGTAAACAG ACTGCTTTGT 2460 

GATAACGGAT TGTGCCGAAA CACGCCAGGA AGTTACAGCT GTACGTGCCC ACCAGGGTAT 2520 

GTGTTCAGGA GTGAGACAGA GACCTGTGAA GATATAAATG AATGTGAAAG CAACCCATGT 2580 

GTCAATGGGG CCTGCAGAAA CAACCTTGGA TCTTTCAATT GTGAATGTTC GCCCGGCAGC 2 640 

AAACTCAGCT CCACAGGATT GATCTGTATT GACAGCCTGA AGGGGACCTG TTGGCTCAAC 2700 

ATCCAGGACA GCCGCTGT6A GGTGAATATT AATGGAGCCA CTCTGAAATC TGAATGCTGT 2760 

GCCACCCTCG GAGCCGCCTG GGGGAGCCCC TGTGAGCGGT GTGAACTAGA TACAGCTTGC 2820 

CCAAGAGGGC TTGCCAGGAT TAAAGGTGTT ACGTGTGAAG ATGTTAATGA GTGTGAGGTG 2880 

TTCCCTGGCG TTTGTCCAAA TGGACGCTGT GTCAACAGTA AGGGATCTTT TCATTGCGAG 2940 

TGCCCTGAAG GCCTTACGTT GGATGGGACT GGCCGTGTAT GTTTGGATAT TCGCATGGAG 3000 

CAGTGTTACT TGAAGTGGGA TGAAGATGAA TGCATCCACC CCGTTCCTGG AAAGTTCCGC 3060 

ATGGATGCCT GCTGCTGTGC TGTCGGGGCG GCTTGGGGCA CCGAGTGTGA GGAGTGCCCC 3120 

AAACCTGGCA CCAAGGAATA CGAGACACTG TGCCCCCGCG GGGCTGGCTT TGCTAACCGA 3180 

GGGGATGTTC TTACTGGGCG GCCATTTTAC AAAGACATCA ATGAATGCAA AGCATTTCCT 32 40 

GGGATGTGCA CTTATGGGAA GTGCAGAAAT ACAATCGGAA GCTTCAAATG CCGTTGCAAT 33 0 0 

AGTGGCTTTG CTCTAGACAT GGAGGT^GA AACTGCACGG ACATCGACGA GTGCAGGATT 3 3 60 

TCTCCTGACC TCTGTGGCAG TGGAATCTGC GTCAATACAC CGGGCAGCTT TGAGTGCGAG 3420 

TGCTTCGAAG GCTATGAAAG TGGCTTCATG ATGATGAAGA ACTGCATGGA CATTGACGGA 3480 

TGTGAACGTA ACCCTCTCCT TTGTAGGGGT GGCACCTGTG TGAACACTGA GGGCAGCTTT 3540 

CAGTGTGACT GCCCACTGGG ACACGAGCTG TCACCATCCC GTGAGGACTG TGTGGATATT 3600 

AATGAATGCT CCCTGAGTGA CAATCTCTGC AGAAATGGAA T^TGTGTGAA CATGATTGGA 3 660 

ACCTATCAGT GCTCTTGCAA TCCTGGATAT CAGGCTACGC CAGACCGCCA GGGCTGTACA 3720 

GATATTGATG AATGTATGAT AATGAACGGA GGCTGTGACA CCCAGTGCAC AAATTCAGAG 3780 

GGAAGCTACG TiATGCAGCTG CAGTGAGGGT TATGCCCT6A TGCCAGATGG GAGATCGTGT 3840 

GCAGACATTG ATGAATGTGA AAACAATCCT GATATCTGTG ATGGCGGCCA GTGTACCAAC 3 900 

ATTCCTGGAG AGTATCGCTG CCTCTGCTAT GATGGCTTCA TGGCTTCCAT GGACATGAAA 3 960 

ACATGCATTG ATGTCAATGA ATGTGACCTA AATTCAAATA TCTGCATGTT TGGGGAATGT 4020 

GAGAACACAA AGGGATCCTT CATTTGCCAC TGTCAGCTGG GTTACTCAGT GAAGAAGGGG 4080 

ACCACAGGAT GTACAGATGT GGATGAGTGT GAAATTGGTG CTCATAACTG CGACATGCAT 4140 

GCCTCATGTC TGAATATCCC AGGAAGCTTC AAGTGTAGCT GCA6AGAAGG CTGGATTGGA 4200 

AACGGCATCA AGTGTATTGA TCTGGACGAA TGTTCTAATG GAACCCACCA GTGTAGCATC 4260 

AATGCTCAGT GTGTAAATAC CCCGGGCTCA TACCGCTGTG CCTGCTCCGA AGGTTTCACT 4320 

GGTGATGGCT TTACCTGCTC AGATGTTGAT GAGTGTGCAG AAAACATAAA CCTCTGTGAG 4380 
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AACGGACAGT GCCTTAATGT CCCGGGTGCA TATCGCTGCG AGTGTGAGAT GGGCTTCACT 4440 

CCAGCCTCAG ACAGCAGATC CTGCCAAGAT ATTGATGAAT GCTCCTTCCA AAACATTTGT 4500 

GTCTCTGGAA CATGTAATAA CCTGCCTGGA ATGTTTCATT GCATCTGCGA TGATGGTTAT 4560 

GAATTGGACA GAACAGGAGG GAACTGTACA GATATTGATG AGTGTGCAGA TCCTATAAAC 4620 

TGTGTCAATG GCCTATGTGT CAACACGCCT GGTCGCTATG AGTGTAACTG CCCACCCGAT 4680 

TTTCAGTTGA ACCCAACTGG TGTGGGTTGT GTTGACAACC GTGTGGGCAA CTGCTACCTG 4740 

AAGTTTGGAC CTCGAGGAGA TGGGAGTCTG TCTTGCAACA CCGAGATCGG GGTGGGCGTC 4800 

AGTCGCTCTT CATGCTGCTG CTCTCTGGGA AAGGCCTGGQ GAAACCCCTG TGAGACATGC 4860 

CCCCCTGTCA ATAGCACTGA ATATTACACC CTGTGTCCCG GAGGTGAAGG CTTCAGACCT 4920 

AACCCCATCA CAATCATTTT AGAAGACATT GACGAATGCC AGGAGTTACC AGGTCTCTGC 4980 

CAGGGTGGAA ACTGCATCAA CACTTTTGGG AGCTTCCAGT GTGAGTGCCC ACAAGGCTAC 5040 

TACCTCAGCG AGGATACCCG CATCTGTGAG GATATTGATG AGTGTTTTGC ACATCCTGGT 5100 

GTGTGTGGGC CTGGGACCTG CTATAACACC CTGGGAAATT ACACCTGCAT TTGCCCACCT 5160 

GAGTACATGC AGGTCAATGG AGGCCACAAC TGCATGGACA TGAGT^AAAAG CTTTTGCTAC 5220 

CGAAGCTATA ATGGAACCAC TTGTGAGAAT GAGTTGCCTT TCAATGTGAC AAAAAGGATG 5280 

TGCTGCTGCA CATATAATGT GGGCAAAGCT GGGAACT^AAC CTTGTGAACC ATGCCCAACT 5340 

CCAGGAACAG CTGACTTTAA AACCATATGT GGA7\ATATTC CTGGATTCAC CTTTGACATT 5400 

CACACAGGAA T^GCTGTTGA CATTGATGAA TGTAAAGAGA TTCCAGGCAT TTGTGCAAAT 5460 

GGTGTGTGCA TTAACCAGAT TGGCAGTTTC CGCTGTGAAT GCCCTACAGG ATTCAGTTAC 5520 

AATGACCTGC TGTTGGTTTG TGAAGATATA GATGAGTGCA GCAATGGTGA TAATCTCTGC 5580 

CAGCGGAATG CAGACTGCAT CAATAGTCCT GGTAGTTACC GCTGTGAATG TGCCGCGGGT 5640 

TTCA7ACTTT CACCCAATGG GGCCTGT6TA GATCGC7VATG AATGTTTAGA AATTCCTAAC 5700 

GTTTGCAGTC ATGGCTTGTG TGTTGATCTG CAAGGAAGTT ACCAGTGCAT CTGCCACAAT 5760 

GGCTTTAAGG CTTCTCAGGA CCAGACCATG TGCATGGATG TTGATGAGTG CGAGCGGCAC 5820 

CCATGTGGAA ATGGAACTTG TAAAAACACC. GTTGGATCCT ATAACTGTCT GTGCTACCCA 5880 

GGGTTTGAAC TCACTCATAA TAATGATTGC CTGGACATAG ATGAGTGCAG TTCCTTTTTT 5940 

GGTCAGGTGT GCAGAAATGG ACGTTGTTTT AATGAAATTG GTTCTTTCAA GTGTCTATGT 6000 

AACGAAGGTT ATGAACTTAC CCCAGATGGC AAAAACTGTA TAGACACTAA TGAGTGTGTC 6060 

GCCCTTCCCG GCTCTTGCTC TCCTGGTACC TGTCAGAATT TGGAGGGATC CTTCAGATGC 6120 

ATCTGTCCCC CAGGGTATGA AGTAAAAAGC GAGAACTGCA TTGATATAAA TGAATGTGAT 6180 

GAAGATCCCA ACATTTGTCT TTTTGGTTCC TGTACTAATA CTCCAGGGGG CTTCCAGTGC 6240 

CTCTGCCCCC CTGGCTTTGT ACTATCTGAT AATGGACGGA GATGCTTTGA TACTCGCCAG 6300 

AGCTTCTGCT TCACAAATTT TGAAAATGGA AAGTGTTCTG TACCCAAAGC TTTCAACACC 6360 

AC7\AAAGCAA AATGCTGCTG TAGTAAGATG CCAGGAGAGG GCTGGGGGGA CCCCTGTGAG 6420 

CTGTGCCCCA AAGACGATGA AGTTGCATTT CAGGATTTGT GTCCATATGG CCATGGAACT 6480 

GTCCCTAGTC TTCATGATAC ACGTGAAGAT GTCAATGAGT GTCTTGAGAG CCCAGGCATT 6540 

TGTTCAAATG GTC7UVTGTAT CAACACCGAC GGATCTTTTC GCTGTGAATG TCCAATGGGC 6600 

TACAACCTTG ACTACACTGG AGTACX3CTGT GTGGATACTG ATGAGTGTTC AATCGGCAAT 6660 

CCGTGTGGAA ATGGTACATG CACC7VATGTT ATTGGGAGTT TTG7VATGCAA TTGCAATGAA 6720 

GGCTTTGAGC CAGGGCCCAT GATGAATTGT GAAGATATCA ACGAATGTGC CCAGAACCCA 6780 

CTGCTGTGTG CTTTACGCTG CATGAACACT TTTGGGTCCT ATGAATGCAC GTGCCCGATT 6840 

GGCTATGCCC TCAGGGAAGA TCAAAAGATG TGCAAAGATC TGGATGAATG TGCTGAAGGG 6900 

TTACACGACT GTG/VATCTAG GGGCATGATG TGTAAGAATC TAATCGGCAC CTTCATGTGC 6960 

ATCTGCCCTC CTGGAATGGC CCGAAGGCCC GATGGAGAAG GCTGTGTAGA TGAAAATGAA 7020 

TGCAGGACCA AGCCAGGAAT CTGTGAAAAT GGACGTTGTG TTAACATTAT TGGAAGCTAT 7080 

AGATGTGAGT GTAATGAAGG ATTCCAGTCA AGTTCTTCAG GCACTGAATG CCTTGACAAT 7140 

CGACAGGGTC TCTGCTTTGC AGAGGTACTG CAGACAATAT GTCAAATGGC ATCCAGTAGT 72 0 0 

CGCAATCTCG TCACTAAGTC AGAATGCTGC TGTGATGGTG GGCGAGGCTG GGGCCACCAG 7260 

TGCGAGCTTT GCCCACTTCC TGGAACTGCC CAGTACAAAA AGATATGTCC TCATGGCCCA 7320 

GGATATACAA CTGATGGAAG AGATATTGAT GAATGTAAGG TAATGCCAT^ CCTCTGCACC 7380 

AATGGTCAGT GCATC7VATAC CATGGGCTCA TTCCGATGCT TCTGCAAGGT TGGCTACACC 7440 

ACAGACATCA GTGGAACCTC TTGTATAGAC. CTTGATGAAT GCTCCCAGTC CCCGAAACCA 7500 

TGCAACTACA TCTGCAAGAA CACTGAGGGG AGTTATCAGT GTTCATGTCC GAGGGGGTAT 7560 

GTCCTGCAAG AGGATGGAAA GACATGCAAA GACCTTGATG AATGTCAAAC AAAGCAGCAT 7620 

AACTGCCAGT TCCTCTGTGT CAACACCCTG GGGGGGTTTA CCTGTAAATG TCCACCTGGT 7680 

TTCACACAGC ATCACACTGC TTGTATCX3AC AACAACGAAT GTGGGTCTCA ACCTTTGCTT 7740 

TGTGGAGGAA AGGGAATCTG TCAAAACACT CCAGGCAGTT TCAGCTGTGA ATGCCAAAGA 7800 

GGGTTCTCTC TTGATGCCAC OGGACTGAAC TGTGAAGATG TTGATGAATG TGATGGGAAC 7860 

CACAGGTGCC AACACGGCTG CCAGAACATC CTGGGTGGCT ACAGATGTGG CTGCCCCCAA 7 920 

GGCTACATCC AGCACTACCA GTGGAATCAG TGTGTCGATG AGAATGAATG CTCCAATCCC 7 980 

AATGCCTGTG GCTCTGCTTC CTGCTACAAC ACCCTGGGGA GTTACAAGTG CGCCTGCCCC 8040 

TCGGGGTTCT CCTTCGACCA GTTCTCCAGT GCCTGCCACG ACGTGAATGA GTGCTCGTCC 8100 

TCCAAGAACC CCTGCAATTA CGGCTGCTCT AACACX3GAGG GGGGCTACCT CTGTGGCTGC 8160 

CCCCCTGGGT ATTACAGAGT GGGACAAGGC CACTGTGTCT CAGGAATGGG ATTTAACAAG 8220 

GGGCAGTACC TGTCACTGGA TACAGAGGTC GATGAGGAAA ATGCTCTGTC CCCAGAAGCA 82 80 

TGCTACGAGT GCAAAATCAA CGGCTATCCT AAGAAAGACA GCAGGCAGAA GAGAAGTATT 8340 

CATGAACCTG ATCCCACTGC TGTTGAACAG ATCAGCCTAG AGAGTGTCGA CATGGACAGC 8400 

CCCGTCAACA TGAAGTTCAA CCTCTCCCAC CTCGGCTCTA AGGAGCACAT CCTGGAACTA 8460 

AGGCCCGCCA TCCAGCCCCT CAACAACCAC ATCCGTTATG TCATCTCTCA AGGGAACGAT 8520 

GACAGCGTCT TCCGCATCCA CCAAAGGAAT GGGCTCAGCT ACTTGCACAC GGCCAAGAAG 8580 

AAGCTCATGC CCGGCACATA CACACTGGAA ATCACTAGCA TCCCTCTCTA CAAGAAGAAG 8 640 

GAGCTTAAGA AACTGGAAGA GAGCAATGAG GATGACTACC TCCTAGGGGA GCTTGGGGAG 8700 

GCTCTCAGAA TGAGGCTGCA GATTCAGCTC TATTAACCGT TCACAGACTT GGGCCCAGGC 8760 

TCAAATCCTA GCACAGCCAG TCTGCAGAAG CATTTGAAAA GTCAAGGACT AATTTTAAAG 8820 

AGGAAAAATA ATAATAACTC TTGTTTCTTT CCTCCCTGTC TTAGACTTTG AATGTTGACC 8880 

CTCACAGGGA GGGATAATTT AGACTCTGGT ATGGCCAAAG ATTTGAGCTC AAAGGCAACC 8940 

GTGGTTACTG TATTTTTTAT ATAACTTCAT TTTAAAATAT ATTAAAAGAA ACCTAAATGT 9000 

TCAAGATATC AGCATATGGC ACTAAATGCA CAAAAATAAT GTGAGCTTTT -ttTTTTTTTT 9060 

CCTGTTAGCA GTCTGTAACA CTTTGGGTAT TTTGCTATAG TTGCTAATTA AAAAT^ATATA 9120 

GATGTTTATT TATTTTTAAT GCAGTAATAT ATGGAGAAAT GAACAAACTA TGTAAACAAA 9180 

AAGGGAAACT CACTTGTTTT TCTTTAGATT TATAAATTTG AGCTATTTTT TTTAGAGGTG 9240 

CTTTTTAATU^ ATCCAATAGA TACAAGAGAT GTTTCCTTTG GTTTTCTGCC AGTCATCCAG 9300 

CTGATACACA CCTGATCGAT TTTAAAGAAA GCCACACAGA GCTGAATCGG GCAGTGCTAA 9360 

TCAATAATTT AAAAGACATG 7U\TGTCATTA GATCCTTTAT AACGTAGATC GAAGCCAAAG 9420 

CAGCTCATTT GTGACAACAT TTCATATCAC CAGACACACC AGGCAACAGA AGTTGAAGCA 9480 

CAACCACTGT AGCAAAATAC CTTGACTGCT TGTGAGACCA TTAGCATTGC AGGCCAAACC 9540 

GTACTGTATT TCCTTCTCAT AACCTCAAGG AACCATATGT GCTACCCACA ACACCTCATT 9600 
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CTTACCCAGG GTGCGCTGCG TCCTCATGGT 
TTGAAAGGGA ACACCTGGCA TTCTGTGGTG 
ATTATGTTCA AGTTATTTCA GGATTGCCAT 
GGAATATATG TTGTTGTTGT TGTTTTAAAC 
TGTAGTTATA CACCATATGC CTCATTTTAT 
ACAATGAATT GATGTTTAGT TTGCTTTAGT 
TATTAAGAGC ACX3TATCCAT TATTCTTCTC 
CCAAACCTCA TATGTGAAAT G6CCAAAGCA 
CTGTGCTGAC CAAAGATTAG TAACCAGTTA 
TTAATAACTA AAAAAAAACT CGTGCC 



ACTGTAGGCA GCTGAAGAAC CGCCGTTCCC 9660 
TTTCGTGCTG TCTTAAATAA TGGTGCATTT 9720 
ATGTGCAAAC AAATCATGCA ATGCAGCCAA 9780 
CCATTTTTTT TTTAGAATTT TCATTAATAC 9840 
CATAGCCTAT TGTGTATGAA AGATGTTTGT 9900 
CATTTAAAAA GATATTGTAC CAGGATGTGC 9960 
AACCCAAGAA CCTGTTTCCT GGACCAGTGA 10020 
CATGCAGGCT CCTGGTTGTT CCTCTCAAAC 10080 
TACCCAGTAT TTTGAGGTTT TATTGTTTTT 10140 



Seg ID NO: 459 Protein sequence 
Protein Accession #: NP__001990.1 



1 11 21 31 41 51 

I I I I I I 

MGRRRRLCLQ LYFLWLGCW LWAQGTAGQP QPPPPKPPRP QPPPQQVRSA TAGSEGGFLA 60 

PEYREEGAAV ASRVRRRGQQ DVLRGPNVCG SRFHSYCCPG WKTLPGGNQC IVPICRNSCG 120 

DGFCSRPNMC TCSSGQISST CGSKSIQQCS VRCMNGGTCA DDHCQCQKGY IGTYCGQPVC 180 

ENGCQNGGRC lAQPCACVYG FTGPQCERDY RTGPCFTQVN NQMCQGQLTG IVCTKTLCCA 240 

TTGRAWGHPC EMCPAQPQPC r'rGFIPNIRT GACQDVDECQ AIPGICQGGN CINTVGSFEC 300 

RCPAGHKQSE TTQKCEDIDE CSIIPGICET GECSNTVGSY FCVCPRGYVT STDGSRCIDQ 3 60 

RTGMCFSGLV NGRCAQELPG RMTKMQCCCE PGRCWGIGTI PEACPVRGSE BYRRLCMDGL 420 

PMGGIPGSAG SRPGGTGGNG FAPSGNGNGY GPGGTGFIPI PGCaiGFSPGV GGAGVGAGGQ 480 

GPIITGLTIL NQTIDICKHH ANIiCLNGRCI PTVSSYRCEC NMGYKQDANG DCIDVDECTS 540 

NPCTNGDCVN TPGSYYCKCH AGFQRTPTKQ ACIDIDECIQ NGVLCKNGRC VNSDGSFQCI 600 

CNAGFELTTD GKNCVDHDEC TTTNMCLNGM CINEDGSFKC ICKPGFVLAP NGRYCTDVDE 660 

CQTPGICMNG HCINSEGSFR CDCPPGIjAVG MDGRVCVDTH MRSTCYGGIK KGVCVRPFPG 720 

AVTKSECCCA NPDYGFGEPC QPCPAKNSAE FHGLCSSGVG ITVDGRDINE CALDPDICAN 780 

GICENLRGSY RCNCNSGYEP DASGRNCIDI DECLVNRLLC DNGLCRNTPG SYSCTCPPGY 840 

VFRTETETCB DXNECESNPC VNGACRNNLG SFNCECSPGS KIiSSTGLICI DSIiKGTCWIiN 900 

IQDSRCEVNI NGATLKSECC ATLGAAWGSP CERCBLDTAC PRGLARIKGV TCEDVNECEV 960 

FPGVCPNGRC VNSKGSFHCE CPEGLTLDGT GRVCLDIRME QCYLKWDEDE CIHPVPGKFR 102 0 

MDACCCAVGA AWGTECEECP KPGTKEYETli CPRGAGFANR GDVLTGRPFY KDINECKAFP 1080 

GMCTYGKCRN TIGSFKCRCN SGFALDMBER NCTDIDECRI SPDLCGSGIC VNTPGSFECE 1140 

CFEGYESGFM MMKNCMDIDG CERNPLLCRG GTCVNTEGSF QCDCPLGHEL SPSREDCVDI 1200 

NECSIiSDNLC RNGKCVNMIG TYQCSCNPGY QATPDRQGCT DIDECMIMNG GCDTQCTNSE 1260 

GSYECSCSEG YALMPDGRSC ADIDECENNP DICDGGQCTN IPGEYRCLCY DGFMASMDMK 1320 

TCIDVNECDL. NSNICMFGEC ENTKGSFICH CQLGYSVKKG TTGCTDVDEC BIGAHNCDMH 13 80 

ASCIjNIPGSF KCSCREGWIG NGIKCIDLDE CSNGTHQCSI NAQCVNTPGS YRCACSEGFT 1440 

GDGFTCSDVD ECAENINL.CE NGQCLNVPGA YRCECEMGFT PASDSRSCQD IDECSFQNIC 1500 

VSGTCNNLPG MFHCICDDGY ELDRTGGNCT DIDECADPIN CVNGLCVNTP GRYECNCPPD 1560 

FQIiNPTGVGC VDNRVGNCYIj KFGPRGDGSL SCNTEIGVGV SRSSCCCSLG KAWGNPCETC 162 0 

PPVNSTEYYT LCPGGEGFRP NPITIILEDI DECQELPGLC QGGNCINTFG SFQCECPQGY 1680 

YLSEDTRICE DIDECFAHPG VCGPGTCYNT LGNYTCICPP EYMQVNGGHN CMDMRKSFCY 1740 

RSYNGTTCEN ELPFNVTKRM CCCTYNVGKA GNKPCEPCPT PGTADFKTIC GNIPGFTFDI 180 0 

HTGKAVDIDE CKEIPGICAN GVCINQIGSF RCECPTGFSY NDIjLLVCEDI DECSNGDNIjC 1860 

QRNADCINSP GSYRCECAAG PKLSPNGACV DRNECLEIPN VCSHGLCVDL QGSYQCICHN 1920 

GFKASQDQTM CMDVDECERH PCGNGTCKNT VGSYNCLCYP GFELTHNNDC LDIDECSSFF 1980 

GQVCRNGRCF NEIGSFKCLC NEGYELTPDG KNCIDTNECV ALPGSCSPGT CQNLEGSFRC 2040 

ICPPGYEVKS ENCIDINECD EDPNICLFGS CTNTPGGFQC LCPPGFVLSD NGRRCFDTRQ 210 0 

SFCFTNFENG KCSVPKAFNT TKAKCCCSKM PGEGWGDPCE LCPKDDEVAF QDLCPYGHGT 2160 

VPSLHDTRED VNECLESPGI CSNGQCINTD GSFRCECPMG YNLDYTGVRC VDTDECSIGN 2220 

PCGNGTCTNV IGSFECNCNE GFEPGPMMNC EDINECAQNP LLCALRCMNT FGSYECTCPI 2280 

GYALREDQKM CKDLDECAEG LHDCESRGMM CKNLIGTFMC ICPPGMARRP DGEGCVDENE 23 40 

CRTKPGICEN GRCVNIIGSY RCECNEGFQS SSSGTECLDN RQGLCFAEVL QTICQMASSS 2 40 0 

RNLVTKSECC CDGGRGWGHQ CELCPLPGTA QYKKICPHGP GYTTDGRDID ECKVMPNLCT 2460 

NGQCINTMGS FRCFCKVGYT TDISGTSCID LDECSQSPKP CNYICKNTEG SYQCSCPRGY 252 0 

VLQEDGKTCK DLDECQTKQH NCQFLCVNTL GGFTCKCPPG FTQHHTACID NNECGSQPIiL 2580 

CGGKGICQNT PGSFSCECQR GFSLDATGLN CEDVDECDGN HRCQHGCQNI LGGYRCGCPQ 2640 

GYIQHYQWNQ CVDENECSNP NACGSASCYN TLGSYKCACP SGFSFDQFSS ACHDVNECSS 2700 

SKNPCNYGCS NTEGGYLCGC PPGYYRVGQG HCVSGMGFNK GQYLSLDTEV DEENALSPEA 2760 

CYECKINGYP KKDSRQKRSI HEPDPTAVEQ ISLESVDMDS PVNMKFNLSH LGSKEHILEIi 2 82 0 

RPAIQPLNNH IRYVISQGND DSVFRIHQRN GLSYLHTAKK KLMPGTYTIjE ITSIPLYKKK 2880 
ELKKIiEESNB DDYLLGBIXSE ALRMRLQIQL Y 

Seq ID NO; 460 DNA sequence 

Nucleic Acid Accession tt: NM__013372.1 

Coding sequences 63.. 61 7 

1 11 21 31 41 51 

I I I I I I 

GCGGCCGCAC TCAGCGCCAC GCGTCGAAAG CGCAGGCCCC GAGGACCCGC CGCACTGACA 60 

GTATGAGCCG CACAGCCTAC ACGGTGGGAG CCCTGCTTCT CCTCTTGGGG ACCCTGCTGC 120 

CGGCTGCTGA AGGGAAAAAG AAAGGGTCCC AAGGTGCCAT CCCCCCGCCA GACAAGGCCC 180 

AGCACAATGA GTCAGAGCAG ACTCAGTCGC CCCAGCAGCC TGGCTCCAGG AACCGGGGGC 240 

GGGGCCAAGG GCGGGGCACT GCCATGCCOG GGGAGGAGGT GCTGGAGTCC AGCCAAGAGG 300 

CCCTGCATGT GACGGAGCGC AAATACCTGA AGCGAGACTG GTGCaAAACC CAGCCGCTTA 360 

AGCAGACCAT CCACGAGGAA GGCTGCAACA GTCGCACCAT CATCAACCGC TTCTGTTACG 420 

GCCAGTGCAA CTCTTTCTAC ATCCCCAGGC ACATCCGGAA GGAGGAAGGT TCCTTTCAGT 480 

CCTGCTCCTT CTGCAAGCCC AAGAAATTCA CTACCATGAT GGTCACACTC AACTGCCCTG 540 

AACTACAGCC ACCTACCAAG AAGAAGAGAG TCACACGTGT GAAGCAGTGT CGTTGCATAT 600 

CCATCGATTT GGATTAAGCC AAATCCAGGT GCACCCAGCA TGTCCTAGGA ATGCAGCCCC 660 

AGGAAGTCCC AGACCTAAAA CAACCAGATT CTTACTTGGC TTAAACCTAG AGGCCAGAAG 720 

AACCCCCA6C TGCCTCCTGG CAGGAGCCTG CTTGTGCGTA GTTCGTGTGC AT6AGTGTGG 780 

ATGGGTGCCT GTGGGTGTTT TTAGACACCA GAGAAAACAC AGTCTCTGCT AGAGAGCACT 840 

CCCTATTTTG TAAACATATC TGCTTTAATG GGQATGTACC AGAAACCCAC CTCACCCCGG 900 
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CTCACATCTA AAGGGGCGGG GCCGTGGTCT GGTTCTGACT TTGTGTTTTT GTGCCCTCCT 960 

GGGGACCAGA ATCTCCTTTC GGAATGAATG TTCATGGAAG AGGCTCCTCT GAGGGCAAGA 1020 

GACCTGTTTT AGTGCTGCAT TCGACATGGA AAAGTCCTTT TAACCTGTGC TTGCATCCTC 1080 

CTTTCCTCCT CCTCCTCACA ATCCATCTCT TCTTAAGTTG ATAGTGACTA TGTCAGTCTA 1140 

ATCTCTTGTT TGCCAAGGTT CCTAAATTAA TTCACTTAAC CATGATGCAA ATGTTTTTCA 12 0 0 

TTTTGTGAAG ACCCTCCT^GA CTCTGGGAGA GGCTGGTGTG GGCAAGGACA AGCAGGATAG 1260 

TGGAGTGAGA AAGGGAGGGT GGAGGGTGAG GCCAAATCAG GTCCAGCAAA AGTCAGTAGG 1320 

GACATTGCAG AAGCTTGAAA GGCCAATACC AGAACACAGG CTGATGCTTC TGAGAAAGTC 1380 

TTTTCCTAGT ATTTAACAGA ACCCAAGTGA ACAGAGGAGA AATGAGATTG CCAGAAAGTG 1440 

ATTAACTTTG GCCGTTGCAA TCTGCTCA7U\ CCTAACACCA AACTGAAAAC ATAAATACTG 1500 

ACCACTCCTA TGTTCGGACC CAAGCAAGTT AGCTAAACCA AACCAACTCC TCTGCTTTGT 1560 

CCCTCAGGTG GAAAAGAGAG GTAGTTTAGA ACTCTCTGCA TAGGGGTGGG AATTAATCAA 1620 

AAACCKCAGA GGCTGAAATT CCTAATACCT TTCCTTTATC GTGGTTATAG TCAGCTCATT 1680 

TCCATTCCAC TATTTCCCAT AATGCTTCTG AGAGCCACTA ACTTGATTGA TAAAGATCCT 1740 

GCCTCTGCTG AGTGTACCTG ACAGTTAGTC TAAAGATGAR AGAGTTTAGG GACTACTCTG 1800 

TTTTAGCAAG ARATATTKTG GGGGTCTTTT TGTTTTAACT ATTGTCAGGA GATTGGGCTA 1860 

RAGA6AAGAC GACGAGAGTA AGGAAATAAA GGGRATTGCC TCTGGCTAGA GAGTAAGTTA 1920 

GGTGTTAATA CCTGGTAGAA ATGTAAGGGA TATGACCTCC CTTTCTTTAT GTGCTCACTG 1980 

AGGATCTGAG GGGACCCTGT TAGGAGAGCA TAGCATCATG ATGTATTAGC TGTTCATCTG 2040 

CTACTGGTTG GATGGACATA ACTATTGTAA CTATTCAGTA TTTACTGGTA GGCACTGTCC 2100 

TCTGATTAAA CTTGGCCTAC TGGCAATGGC TACTTAGGAT TGATCTAAGG GCCAAAGTGC 2160 

AGGGTGGGTG AACTTTATTG TACTTTGGAT TTGGTTAACC TGTTTTCTTC AAGCCTGAGG 2220 

TTTTATATAC AAACTCCCTG AATACTCTTT TTGCCTTGTA TCTTCTCAGC CTCCTAGCCA 2280 

AGTCCTATGT AATATGGAAA ACAAACACTG CAGACTTGAG ATTCAGTTGC CGATCAAGGC 23 40 

TCTGGCATTC AGAGAACCCT TGCAACTCGA GAAGCTGTTT TTATTTCGTT TTTGTTTTGA 2400 

TCCAGTGCTC TCCCATCTAA CAACTAAACA GGAGCCATTT CAAGGCGGGA GATATTTTAA 2460 

ACACCCATVAA TGTTGGGTCT GATTTTCAAA CTTTTAAACT CACTACTGAT GATTCTCACG 2S20 

CTAGGCGAAT TTGTCCAAAC ACATAGTGTG TGTGTTTTGT ATACACTGTA TGACCCCACC 2580 

CCAAATCTTT GTATTGTCCA CATTCTCCAA CAATAAAGCA CAGAGTGGAT TTAATTAAGC 264 0 

ACACAAATGC TAAGGCAGAA TTTTGAGGGT GGGAGAGAAG AAAAGGGAAA GAAGCTGAAA 2700 

ATGTAAAACC ACACCAGGGA GGAAAAATGA CATTCAGAAC CAGCAAACAC TGAATTTCTC 2760 

TTGTTGTTTT AACTCTGCCA CAAGAATGCA ATTTCGTTAA TGGAGAT6AC TTAAGTTGGC 2820 

AGCAGTAATC TTCTTTTAGG AGCTTGTACC ACAGTCTTGC ACATAAGTGC AGATTTGGCT 2 880 

C7UVGTAAAGA GT^TTTCCTC AACACTAACT TCACTGGGAT AATCAGCAGC GTAACTACCC 2 940 

TAAAAGCATA TCACTAGCCA AAGAGGGAAA TATCTGTTCT TCTTACTGTG CCTATATTAA 3000 

GACTAGTACA AATGTGGTGT GTCTTCCAAC TTTCATTGAA AATGCCATAT CTATACCATA 3060 

TTTTATTCGA GTCACTGATG ATGTAATGAT ATATTTTTTC ATTATTATAG TAGAATATTT 3120 

TTATGGCAAG ATATTT6TGG TCTTGATCAT ACCTATTT^ ATAATGCCAA ACACCAAATA 3180 

TGAATTTTAT GATGTACACT TTGTGCTTGG CATTAAAAGA AAAAAACACA CATCCTGGAA 3240 

GTCTGTAAGT TGTTTTTTGT TACTGTAGGT CTTCAAAGTT AAGAGTGTAA GTGAAAAATC 3300 

TGGAGGAGAG GATAATTTCC ACTGTGTGGA ATGTGAATAG TTAAATGAAA AGTTATGGTT 33 60 

ATTTAATGTA ATTATTACTT CAAATCCTTT GGTCACTGTG ATTTCAAGCA TGTTTTCTTT 3420 

TTCTCCTTTA TATGACTTTC TCTGAGTTGG GCATW^GAAGA AGCTGACACA CCGTATGTTG 3480 

TTAGAGTCTT TTATCTGGTC AGGGGAAACA AAATCTTGAC CCAGCTGAAC ATGTCTTCCT 3540 

GAGTCAGTGC CTGAATCTTT ATTTTTTAAA TTGAATGTTC CTTAAAGGTT AACATTTCTA 3600 

AAGCAATATT AAGAAAGACT TTAAATGTTA TTTTGGAAGA CTTACGATGC ATGTATACAA 3 660 

ACGAATAGCA GATAATGATG ACTAGTTCAC ACATAAAGTC CTTTTAAGGA GAAAATCTAA 3 72 0 

AATGAAAAGT GGATAAACAG AACATTTATA AGTGATCAGT TAATGCCTAA GAGTGAAAGT 3780 

AGTTCTATTG ACATTCCTCA AGATATTTAA TATCAACTGC ATTATGTATT ATGTCTGCTT 3840 

AAATCATTTA AAAACGGCAA AGAATTATAT AGACTATGAG GTACCTTGCT GTGTAGGAGG 3900 

ATGAAAGGGG AGTTGATAGT CTCATAAAAC TAATTTGGCT TCAAGTTTCA TGAATCTGTA 3960 

ACTAGAATTT AATTTTCACC CCAATAATGT TCTATATAGC CTTTGCTAAA GAGCAACTAA 4020 
TAAATTAAAC CTATTCTTTC AAAAAAAAA 

Seq ID NO: 461 Protein sequence 
Protein Accession #: NP_037504.1 

1 11 21 31 41 51 

1 I I 1 I 1 

MSRTAYTVGA LLLLLGTLLP AAEGKKKGSQ GAIPPPDKAQ HNDSEQTQSP QQPGSRNRGR 60 

GQGRGTAMPG EEVLESSQEA LHVTERKYLK RDWCKTQPLK QTIHEEGCNS RTIINRFCYG 120 

QCNSFYIPRH IRKEEGSFQS CSFCKPKKFT TMMVTLNCPE LQPPTKKKRV TRVKQCRCIS 180 
IDLD 

Seq ID NO: 462 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..2733 

1 11 21 31 41 51 

I I I I I I 

ATGAAAGTTG GAGTGCTGTG GCTCATTTCT TTCTTCACCT TCACTGACGG CCACGGTGGC 60 

TTCCTGGGGA AAAATGATGG CATCAJ\AACA AAAAAAGAAC TCATTGTGAA TAAGAAAAAA 120 

CATCTAGGCC CAGTCGAAGA ATATCAGCTG CTGCTTCAGG TGACCTATAG AGATTCCAAG 180 

GAGAAAAGAG ATTTGAGAAA TTTTGTGAAG CTCTTG7\AGC CTCCATTATT ATGGTCACAT 240 

6GGCTAATTA GAATTATCAG AGCAAAGGCT ACCACAGACT GCAACAGCCT GAATGGAGTC 300 

CTGCAGTGTA CCTGTGAAGA CAGCTACACC TGGTTTCCTC CCTCATGCCT TGATCCCCAG 3 60 

AACTGCTACC TTCACACGGC TGGAGCACTC CCAAGCTGTG AATGTCATCT CAACAACCTC 420 

AGCCAGAGTG TCAATTTCTG TGAGAGAACA AAGATTTGGG GCACTTTCAA AATTAATGAA 480 

AGGTTTACAA AT6ACCTTTT GAATTCATCT TCTGCTATAT ACTCCAAATA TGCAAATGGA 540 

ATTGAAATTC AACTTAAAAA AGCATATGAA AGAATTCAAG GTTTTGAGTC GGTTCAGGTC 600 

ACCCAATTTC GAAATGGAAG CATCGTTGCT GGGTATGAAG TTGTTGGCTC CAGCAGTGCA 660 

TCTGAACTGC TGTCAGCCAT TGAACATGTT GCCGAGAAGG CTAAGACAGC CCTTCACAAG 720 

CTGTTTCCAT TAGAAGACGG CTCTTTCAGA GTGTTCGGAA AAGCCCAGTG TAATGACATT 780 

GTCTTTGGAT TTGGGTCCAA GGATGATGAA TATACCCTGC CCTGCAGCAG TGGCTACAGG 840 

GGAAACATCA CAGCCAAGTG. TGAGTCCTCT GGGTGGCAGG TCATCAGGGA GACTTGTGTG 900 

CTCTCTCTGC TTGAAGAACT GAACAAGAAT TTCAGTATGA TTGTAGGCAA TGCCACTGAG 960 

6CAGCTGTGT CATCCTTCGT GCAAAATCTT TCTGTCATCA TTCGGCAAAA CCCATCAACC 1020 
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ACAGTGGGGA ATCTGGCTTC GGTGGTGTCG ATTCTGAGCA ATATTTCATC TCTGTCACTG 1080 

GCCAGCCATT TCAGGGTGTC CAATTCAACA ATGGAGGATG TCATCAGTAT AGCTGACAAT 1140 

ATCCTTAATT CAGCCTCAGT AACCAACTGG ACAGTCTTAC TGCGGGAAGA AAAGTATGCC 1200 

AGCTCACGGT TACTAGAGAC ATTAGAAAAC ATCAGCACTC TGGTGCCTCC GACAGCTCTT 1260 

CCTCTGAATT TTTCTCGGAA ATTCATTGAC TGGAAAGGGA TTCCAGTGAA CAA?^GCCAA 1320 

CTCAAAAGGG GTTACAGCTA TCAGATTAAA ATGTGTCCCC AAAATACATC TATTCCCATC 1380 

AGAGGCCGTG TGTTAATTGG GTCAGACCAA TTCCAGAGAT CCCTTCCAGA AACTATTATC 1440 

AGCATGGCCT CGTTGACTCT GGGGAACATT CTACCCGTTT CCAAATU^TGG AAATGCTCAG 1500 

GTCAATGGAC CTGTGATATC CACGGTTATT CAAAACTATT CCATAAATGA AGTTTTCCTA 1560 

TTTTTTTCCA AGATAGAGTC AAACCTGAGC CAGCCTCATT GTGTGTTTTG GGATTTCAGT 162 0 

CATTTGCAGT GGAACGATGC AGGCTGCCAC CTAGTGAATG AAACTCAAGA CATCGTGACG 1680 

TGCCAATGTA CTCACTT6AC CTCCTTCTCG ATATTGATGT CACCTTTTGT CCCCTCTACA 1740 

ATCTTCCCCG TTGTAAAATG GATCACCTAT GTGGGACTGG GTATCTCCAT TGGAAGTCTC 1800 

ATTTTATGCC TGATCATCGA GGCTTTGTTT TGGAAGCAGA TTAAAAAAAG CCAAACCTCT 1860 

CACACACGTC GTATTTGCAT GGTGAACATA GCCCTGTCCC TCTTGATTGC TGATGTCTGG 1920 

TTTATTGTTG GTGCCACAGT GGACACCACG GTGAACCCTT CTGGAGTCTG CACAGCTGCT 1980 

GTGTTCTTTA CACACTTCTT CTACCTCTCT TTGTTCTTCT GGATGCTCAT GCTTGGCATC 2040 

CTGCTGGCTT ACCGGATCAT CCTCGTGTTC CATCACATGG CCCAGCATTT GATGATGGCT 2100 

GTTGGATTTT GCCTGGGTTA TGGGTGCCCT CTCATTATAT CTGTCATTAC CATTGCTGTC 2160 

ACGCAACCTA GCAATACCTA CAAAAGGAAA GATGTGTGTT GGCTTAACTG GTCCAATGGA 2220 

AGCAAACCAC TCCTGGCTTT TGTTGTCCCT GCACTGGCTA TTGTGGCTGT GAACTTCGTT 2280 

GTGGTGCTGC TAGTTCTCAC AAAGCTCTGG AGGCCGACTG TTGGGGAAAG ACTGAGTCGG 2 340 

GATGACAAGG CCACCATCAT CCGCGTGGGG AAGAGCCTCC TCATTCTGAC CCCTCTGCTA 2400 

GG6CTCACCT GGGGCTTTGG AATAGGAACA ATAGTGGACA GCCAGAATCT GGCTTGGCAT 2460 

GTTATTTTTG CTTTACTCAA TGCATTCCAG 6GATTTTTTA TCTTATGCTT TG6AATACTC 2520 

TTGGACAGTA AGCTGCGACA ACTTCTGTTC AACAAGTTGT CTGCCTTAAG TTCTTGGAAG 2580 

CAAACAGAAA AGCAAAACTC ATCAGATTTA TCTGCCAAAC CCAAATTCTC AAAGCCTTTC 2640 

AACCCACTGC AAAACAAAGG CCATTATGCA TTTTCTCATA CTGGAGATTC CTCCGACAAC 2700 
ATCATGCTT^ CTCAGTTTGT CTCAAATGAA TAA 

Seq ID NO: 463 Protein sequence 
Protein Accession #: Eos sequence 
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MKVGVLWLIS 
EKRDIiRNFLK 
NCYLHTAGAIi 
lEIQIiKKAYE 
LFPLEDGSFR 
LSLLBELNKN 
ASHFRVSNST 
PLNFSRKFID 
SMASLTLGNI 
HLQWNDAGCH 
ILCLIIEALF 
VFFTHFFYLS 
TQPSNTYKRK 
DDKATIIRV6 
LDSKIiRQLLP 
IMLTQFVSNE 



11 
I 

FFTFTDGHGG 
LLKPPLLWSH 
PSCECHLNNL 
RIQGFESVQV 
VFGKAQCNDI 
FSMIVGNATE 
MEDVISIADN 
WKGIPVNKSQ 
LPVSKNGNAQ 
LVNETQDIVT 
WKQIKKSQTS 
LFFWMLMLGI 
DVCWLNWSNG 
KSLLILTPLL 
NKLSALSSWK 



21 

I 

FIjGKNDGIKT 
GIiIRIIRAKA 
SQSVNFCERT 
TQFRNGSIVA 
VFGFGSKDDE 
AAVSSFVQNL 
ILNSASVTNW 
LKRGYSYQIK 
VNGPVISTVI 
CQCTHLTSFS 
HTRRICMVNI 
LLAYRIIIiVF 
SKPLIiAFWP 
GLTWGFGIGT 
QTBKQNSSDL 



31 
I 

KKBLIVNKKK 
TTDCNSIiNGV 
KIWGTFKINE 
GYEWGSSSA 
YTLPCSSGYR 
SVIIRQNPST 
TVLLREBKYA 
MCPQNTSIPI 
QNYSINEVFIi 
ILMSPFVPST 
ALSLLIADVW 
HHMAQHLMMA 
ALAIVAVNFV 
IVDSQNLAWH 
SAKPKFSKPF 



41 
I 

HLGPVEEYQL 
LQCTCEDSYT 
RFTNDLLNSS 
SELLSAIEHV 
GNITAKCESS 
TVGNIiASWS 
SSRLLETLEN 
RGRVLIGSDQ 
FFSKIESNLS 
IFPWKWITY 
FIVGATVDTT 
VGFCLGYGCP 
WLLVLTKLW 
VIFALLNAFQ 
NPLQNKGHYA 



Seq ID NO: 464 DNA sequence 

Nucleic Acid Accession #: AB035089.1 

Coding sequence: 9845.. 10219 



GGGCATGCAG 
CAGTTCTAGT 
CCAAGAGGAA 

TTGGTTTGAA 

aggaggaagg 
gagactattt 
aagaaagg;u\ 
gtagacagaa 
aatctcctcc 
aataaaatgt 
aagg;\agtga 
gtaatgacag 
ggattcctgg 
caagtgttca 
tcaggatatg 
taggagaact 

CAAGCTTTCT 
TTCAACCTTC 
CCATAGATTG 
CCTCCATTCC 
TGGTACCCGA 
CTGGATGCAG 
TTTCCTCTTC 
CTATTCCCCG 
ATATATATTG 
GTGAGAAATT 
CCTATGTGTT 
AAAACAAACT 
TGTGGAGA/VA 



11 
I 

CCATCGGGGA 
AT^GGGAGA 
TTAGGGAGAG 
AGCATACAGT 
GAGGCAAGAG 
CCCTCTCTGC 
AGCTAGTTAG 
TCCTTGGGAA 
ACTAACCAGT 
TCTCTTGACT 
GGTCCAGGAA 
GATATTTCCT 
AGCCAATGAA 
TATGCAAAAA 
TCCAGTCTCA 
ATTTAGGAAC 
CTAAATTTAA 
AGGGCAAACC 
GTCCCCTGTA 
CAGGATGAGC 
GTAAATCCAT 
ACTCAGCTGA 
TTTCTTTTTC 
ACTTCAATCA 
ATACAGGAGA 
TCCAGATTGG 
TCTGGCACCT 
CACGGCTGGT 
TCAGAACTCT 



21 

1 

AAATCCATAG 
ACATCAATAT 
AGTTATAAGA 
AAATATGATG 
ATAATATCAT 
TTTTCAAACC 
TCTTGTTCTG 
TACAGTAATT 
TTCCCTATAG 
TTGTTACTTA 
AATGTAGGAG 
GAAAGTGTAA 
GTTGGTGTAT 
CTTCTTGGAA 
CACACCAGGA 
AGAAAAAAAT 
GCAAACTCCT 
TCCGTGCCTC 
ACCCCGGTTT 
TTGTTGCTTC 
CCTACTCCAA 
GAAGACCATT 
TTCTATCTTT 
GGGAACTTAT 
CCTAAGAAGA 
AAACACAGCT 
TGTTGTAGAT 
GTTAAAAAGG 
ATTCACGGTC 



31 
1 

TGCAGATAAA 
AGGATGTTTC 
GATCAGCAAG 
TCTGTCCCTG 
TTTCTCTGTG 
TTACTGGAGT 
AGGTTGTTCA 
GACATATATT 
ATTGCCACAA 
ACAATGCTGA 
ATATTTCTTA 
TTTCCCATTG 
GTTTATGAAA 
TTTCTGAGTT 
TATGTCCTTT 
GCCTGAAATG 
GGTCATTTTC 
AGACGTTTAG 
GTCTCAGCTT 
TGTCCTATGA 
TAGAGGAAGG 
ATTCATTTTT 
GGATTTTTAG 
ACCTCTTAAA 
GCATGTCTTG 
TCCTTTCTCC 
AAATCTCCCT 
GCCCATGACA 
GGTTGGAATG 



41 

1 

GCAAGGAGGA 
TTAGCAATAG 
GGGACAGGGT 
GCAGTGTTGG 
CTCCAACTGT 
TGTTTTCCCT 
ATGTATACAT 
CTGTTATTTG 
GCACATAATA 
GAAAACTTTA 
ACCAATCTAT 
AGGATTTGTT 
TATCAAGAGA 
CTCTGTGGCA 
CTAGCCTGTC 
ATTTCTCATT 
AGTTAGTACC 
CCATAGTCTG 
GTTATCCTGT 
GACATTAGAT 
TCCATTTTTG 
GGAATTCTTT 
TCCATCAACG 
CTCATTCAGA 
GGGGTTGAGG 
CATCCAGCCC 
TGACTTTGTG 
ATACCAAGT6 
CACACTTGTG 



51 
I 

LLQVTYRDSK 
WFPPSCLDPQ 
SAIYSKYANG 
AEKAKTALHK 
GWQVIRETCV 
ILSNXSSLSL 
ISTLVPPTAL 
FQRSLPETII 
QPHCVFWDFS 
VGLGISIGSL 
VNPSGVCTAA 
LIISVITXAV 
RPTVGERLSR 
GFFILCFGIL 
FSHTGDSSDN 



51 

I 

AGAAGAAGGA 
AAAAAGAAGG 
TAGATTTGGT 
CAGAGTAGGA 
ACTTACATAT 
CATGAAAACC 
ATCTATATCT 
ATGCTTGAAA 
AGAAACAATA 
CAGCCTTCAT 
AAAGGCATTA 
TTTAATTTCT 
CATAAGTTGG 
ATATATGACA 
TATCACATGC 
TGAACTCATC 
TTTCCTTAAG 
AAATTCTCTT 
TTTTTTCTTC 
TCCTTTTCTT 
TCTTATAGCG 
ATCTCAGATA 
CCCCATTAGT 
GACTCAAAAC 
AAACAGGCAG 
CTACTTTCAG 
ATGTGCTGA6 
TTGGGGAGAA 
CAGAATTCTA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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TGGAGAAGAG TCTGGCATTT CCTCAAAATG TTAACCTGGA TTTACCATAT GACCCAGCGA 1800 

TTTCATTCAT AGGTTTATAC TCAAAAGAAA TGAAGAAATA TGCCATGCAA AAAAAT6TAC 1860 

ATGAAAGGTC ACAACATCAT TATTCATAAT AGTAAAAGGA TGGAAACAAC ACAAATGTCC 1920 

ATCAACTTAT GATTAAAGAA AATCTGGTCT ATTCATAGAA TGGAATATTA TTCGACCACA 1980 

5 AAAAGGAATG ATGTACTGAT CCATGCAATG ATGTGGACAA ACCATGAAAA TAACACTAGA 2 040 

TTAAAGAAGC CAGTCACAAA AGGACTTACT GTATGATTCC ATTTACCTGA AATGTTTGGA 2100 

ATAGGCAAAT CCATAGAAAC AGGAGGTAGA TTCCTGGTTT CCAGGGTCTC CAGGAAGGGA 2160 

AGAATGAAGT ACAAGATTTC TTTTGGAGGT AGTGAAATTG TTGTGGAATG AGATCATGAT 222 0 

GATGATAGCA CAACTTTGTG AATATAATAA AATCATTGAA TTGTACAGTT G7\ATTTATGG 2280 

10 TATATAAATT ATATGTTAAT AAAAAGGGGG TCCACAAAAC AAACAGCCCC CCACTCTGGT 2340 

TGTCAGGGAG ATATTGGATT AAATGGCCTT GGACAACAAC CCCTCTCCCT GGCCACAGAC 2400 

ATTCTTCAGA TTACAAGATA TTCCAGGGGA AACACTGGAA TGAGTCTGAA GCCAGGTGCT 2460 

AAACAGAAGG ACCATTGAGA AATGTTGTGA TCCTGACAGG TCAAGCAATT TATTTTTCGG 252 0 

CTTCATTTTT AAATGTAAAA TTAGAAAGCT GCCATTTAAA ATGGCCCGTC TGTTTCAATT 2580 

15 GCTCTTCTCA GTGTCAGCCT GTTAACTCAA TGTGTTAGTC TGTTTTCATG CTGCTGATAA 2640 

AAACATACCT GAGACTGGCA AGAAAAAGAG GTTTAATTGG GCTTAGAGTT CCACGTGATT 2700 

GGGGAGGCCT CAGAATCACA GTAGGAGGCA AAAGTTATTC TTACATGGTG GCTGCAAGAG 2 760 

AAGATGAGGA AGAAGCAAAA GAAGAAACCC CTGATAAACC CATCGGATCT CCTGAGGCTT 2820 

ATTAACTATC ATGAGAATAG CACAAGAAAG ACCGGCCCCC ATGATTCAAT TACCTCTACC 2880 

20 TGGGTCCCTC CAATAACATG TGGAAATTCT GGTAGATACA ATTCAAGTTG AGATTTGGGT 2940 

GGGAACACAG CCAAACCATA TCACTCAGCA AGGCAGATAA CTTTCTCACT GAGCCTATGC 3000 

2\ACAG7\AAAC CATCTGGGAT GGTTGTAAGG GGCACAGGAA GTGACTGGTA GGATCACTGC 3060 

CAAAGCTGAG CACTCAGGAG AAGGCAATAG AATCCTATTC TCCATAGTAT GCTATAAGAT 312 0 

ACTGAAGTAC ACTTCTTCAC TATCTCTTTG GACTTAGAAT TAGCACTACA TTCCTTGTTA 3180 

25 TACAGAAAAA TTACTTIAGGA AATTCATAGG ATGACAAAAA CTTTCAGAAC TGA/yWVCAG 3240 

GAAATGTAAG CTTTTTAGTT CTTTGGTATT CGAAGTATGC CTAAAAGACA ATGCAAAATC 3300 

CAAGAAAAGA ATGGTGGGGT TTTTGTTTGT TTGGTTTTGT TTTTGTTTTA CAGCTGGAGT 3360 

AGAATACAAA GGGATGGAGT TGAAACAAAT ' GAGAGGAAAT TGGAATTCTA AACfTATTCT 3420 

CATTGGCATT AGAT^GGCAC GTACATGTAT TTCACATGAG CCGGTGACTG CTGACTTGCA 3480 

30 TTCTTATTTT TTCCCTATAG ATTAAAAAGG AGGTACAATG GTAGAACTGT AATCCTGTCC 3540 

TTTGTCATAA ATTTTCATAT TCATAAAGGT GAGTGTTAGC CCGCTTGTGA AATCTGAAGT 3600 

TGAGTAACTT CAAATACTAA CCACAGAGGG AAAGGCAGCA AGAGGAGAGG CATAAATTTA 3660 

GGATCTCACC CTTCATTCCA CAGACACACA CAGCCTCTCT GCCCACCTCT GCTTCCTCTA 372 0 

GGAACACAGG TAAGAGCTTC AAGCCTCTCC AGCTTAATAA CATGAATTAT TTTTGAGAAT 3780 

35 AATAATGATA CTGTGTTCTA TATCATGCAT CTCCTGCATT CTGTCTGATT ATATTTTACT 3840 

TATTCTGCCA GAGCAAAATT AAAATACCTA TTTCATCTGA TTTGTCCTTT ATCTAAATTG 3 900 

CTTAGTTCCA AGTAAACCAA GGCACTTTTA GGAACACAGA GGGAGAGTGC CTTGCAGCCA 3960 

GAGAGTCTTG AAGGAGATGT CAGGGACGCA TCTTAACAGC TGGTTGGATG TGATCCACAG 4020 

AGGTCTCCTG TTAGCATTCA TTGTAAAGCC ATCCTACCTA GCTCTAGTGT AACCAGCAAT 4080 

40 GAAAGAAAGA TAAAGAGGGT CGATTACTTA TTTACAATAG TCTTTAAAAA CGTAGTTTTG 4140 

TAAGCCTTCT AATTAGGACA TTAATATATT TAATATATGC ACATTGTAGA AAGATTGAAG 4200 

CGTTAAAAAT AAGAGAAAAA CTTTAAATGT CAAAATCTCA CAACCCAGAT ATATCATTTC 4260 

TTTAAGAAAA TTGTACTACA AAATACCATT CCATTTATTA AAGTCATTCT GACAGGAATC 4320 

TGATGCTTTT CCAGGAGTTC CAGATCACAT CGAGTTCACC ATGAATTCAC TCAGTGAAGC 4380 

45 CAACACCAAG TTCATGTTCG ATCTGTTCCA ACAGTTCAGA AAATCAAAAG AGAACAACAT 4440 

CTTCTATTCC CCTATCAGCA TCACATCAGC ATTAGGGATG GTCCTCTTAG GAGCCAAAGA 4500 

CAACACTGCA CAACAAATTA GCAAGGTAGC TATCAGCATC ATTACGTTGT CCTGTTGCAG 4560 

TTTTTCTCTG GTTCCGTCGG CTAGCACGCA GATGGTAATA GATGTGGTGG TCTGATGGGT 4620 

AGCACAGGGG GCTGTGCAGG AATTCCCATA AGTGTGAGAC CACTGACTTA AACAGATCTT 4680 

50 TTGAGTAAAG TTTTCTTGTC CCGCTTCATG TCTCTTCCAG GTTCTTGACT TTGATCAAGT 4740 

CACAGAG7\AC ACCACAGAAA AAGCTGCAAC ATATCATGTG AGTCACAGAG CACTCTGATT 4800 

CAGCTTTAGA TCCCTGAACA GGTCATAGTT TAAACCTGGA ACTTCACAAA AACTAAGAAA 4860 

AGGCCAGTTT TAGGGAAAAT CTTGGACACA AAGATTGAGA CATACAGAGT GGGTTGGCAT 4920 

TTCATGGCAC ATAATTATTA TTCCTCATTT CTGCGTTACT AAAAGACAGT CAGCACTGTA 4980 

55 CCTCAGAGCA TAGGTCTGGA TCAGGATAGG CTGGGTTCAG ACTCCAGCTT TGCTCTTCAC 5040 

AAATGATGAA TAAGAGCAGG ACACAACTGC TCGGAGTCCC AGTGACCTCA TCCCAGAAAA 5100 

CTAAGGGTAA GAAAAAATCT GACTCAATAC ATGCAAATAC ATGCAAATGT TTACAACAGT 5160 

GCCTTGCCCA TAAAAGTCAT AATAAATGTT ATTATTATTA TAAAGTAGCT ATAATTATAC 522 0 

TAATCATAAT AATGTGAAAA TAATTTAATT TTCATTGAGT CATTAATGAG ATTCAGAGGA 52 80 

60 ATAAGCACAA GTCCAAGTAT ATTTTGGAAA ATGATTGCTA TGGAATATAT TGGTTTAGAG 534 0 

CCTTAATAGT GCAAAATGCT TTGCTGGAAG GTAGAAAGTT CTAGATTTAA ACAGGCTTAG 5400 

GTTCAAAACT TGGCACTTCT AATTTATGTC TCTATA/U^CA GGGTTTTTTT CCCCATTCTC 5460 

TGAGCTTTCT TGTGTTCATC TGAATTGAAC TAAAGACTTA GAGTTACCCA TGTAAAGTCC 5520 

TTAGCCATGG ACCTGGCATA CACTCTTCTT ACGTGCAGAG AATGACCATC ATGAGGAAAG 5580 

65 AGCCACAGAT CAGTCAATGT GTCCTACAAG ATAATAGCAC CAACAGGTAT AACAGGGCTT 5640 

CCTGGCATAA TCTATTTAAA ATATCCAACC TTCAACATAC TCGTATCCTT GATGACTGTT 5700 

AGAAGTGAAA TATGGTCCTT GCCCATAAGG AGCTGAGAGT TTAACTGGGA AGCTAAACCT 5760 

AACCCTTTAA ACCAACAAGG AGAAAATCTA CTGGTAGACA GCGCTGCATC TTTAGTTCAG 5820 

AAGAGAAAAG ATTGCAGTAC GTTAGAGCAA GAAGAATTTT CTGGAAGAAG TCAAATATAA 5880 

70 GGTGGATTTT GAAGGGTATT TGAGGTGAAA TACACCAATT ATCAGGGAAT AACATCAAAG 5940 

GTCCTCAATG AGACTACCAG CATTTAGGGA CTGATCTAAC AGACTTAGCA TGGGTTTAGT 6000 

ATTTACATTG ATACAGCAAT TGAATGATCT CCTTTTTTGA TGTTTGAAGG TTGATAGGTC 6060 

AGGAAATGTT CATCACCAGT TTCAAAAGCT TCTGACTGAA TTCAACAAAT CCACTGATGC 6120 

ATATGAGCTG AAGATCGCCA ACAAGCTCTT CGGAG7WUVAG ACGTATCAAT TTTTACAGGT 6180 

75 AATTTCACCT GGCCTACCCA CATTTCATTT GCATCCTGAT GTCTGTGTCT CTGAGTGGCC 6240 

AAATGGAAGA AAGCAAGGCA GATGAGCCTG GCCGACCCAG GTGGAGAGCA TTTACTCAGA 6300 

GTGCATTAGC TCCATTTCCA CAACTCTCCC CCACTGGAGT GTCCCAGACC CCAACGATAC 6360 

ATCACTGAAG TGTGGATTTA GGGATAATCT TGTGATAAAA GAGGAGGTTG TGTAATAGAG 6420 

TGAGTAAGAG TAATAAGTAA TAAGATACCA TCGATAAACT GGCACTGACT CAGTCACATA 6480 

80 CGATACATCT TGGTGGGAAA TGTATGACTA ATGGGATATT ATTGGAATGG GCAGGCTTGG 6540 

GTGAGTTCCT GAGAATAGTT GAGGAAGTAC CAGGAAATAT TGAATGCACA GGATGAAAGA 6600 

CAAAAACAAA GATCAGAAAC ATCATGGTTA AAATTACTGG AGAGAAGTCT GAGAAGCAAT 6660 

GAATCTCCTT CAGGGAAGCC TGCTCTGCAG TTTGCAAACC ACAGCCTCTT CTGCTTCTGC 6720 

CTTTTGCCAA GATGATATTG ACCTTCAGTG ACCTCTTTCT TGTGCCAGCC CACATTCCCC 6780 

85 TTTTGCATTG CCTACATGAC ACCTGTATAA AAATATCCAT GGACAGGAGA TACTGCATCT 6840 

ATTCAGGGTC TGGATTCAGC TTACTGTTGT TACAAATAAG TAAGTTTGGT AATATATAGT 6900 

TACATAAATT ACTCCTAATT CCTACTTCTT CCTTCATATC TCAAAGGAAT ATTTAGATGC 6960 
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CATCAAGAAA TTTTACCAGA CCAGTGTGGA ATCTACTGAT TTTGCAAATG CTCCAGAAGA 7020' 

JUVGTCGAAAG AAGATTAACT CCTGGGTGGA AAGTCAAACG AATGGTAGGA GAGCCACCCA 7080 

TTATAGAAAC ACCTTTGAGA AACCTATGCC AGTGAGCCTT GTGCTTGACA CTGCATGGGG 7140 

GAACAGGTGT GGGGATTGAG ATGGGTTTGC AGGGAGGGCT GAAGAGGGCA CTCCAGATGA 7200 

AGGATTTGTC CAAATGAATA TGAAGAGAGC CTAGGGGAGC CAAGGAGGAA ATCACAGGAA 7260 

GCCAATTAGA TGGAAACACA TCTGGAGAAT TATTTGCTTA TGGCCCTGCA TGACAATAGC 7320 

TTTGTGGATC CCCTGTCTCC GCTCAGACCT ATTTTGAGAT CATATCCTTT ACTTTAAATC 7380 

AGACTCAAAT TTTTATGATG AATATTTAAT AGAAAACATT AGAAAGCGTC TCTC6TCTCC 7440 

TTTACTAATT GGGAAACAAG CAGCTCTCTG GTAAATCACC CTTTTGTCTC TGAGCTGGAG 7500 

CTGCCTGGAT CACATCTGTA GCCAATGTGT TCTGCAGGGA TTATCACAGC TCTCTTCCCC 75 60 

ATCAAGGGCA AAGAGCTTGA CAAAGTCTCC ATTCTACAGA CATCTTTCTT ACCTCCCACC 7620 

TCTCATTACA GGCCAAACTT ACAGCAACTC AACATGAGAG TGAATAGGAA GATACCCCCG 7680 

GAAGTAGTGT CTGACAGCAC AGGACATGCG TTTCATATTA CAGAGCTCAA GTCACTCATC 7740 

CTAAAATGCA ATCAGGGCCT CCTTCCTCTG AATGGGGACC CCGTAGTTAA AAAAAAATAA 7800 

AAGTAGG7W\G AGGAGGGAGG GAGAAAGGAA AGACACATGT TGGAAGAGTA GACAAAATCA 7860 

GTTTATCAGT ATTCCAAATC AGATGATTGG AGACATTCAT ACACAGAGAA CGTGAACTCC 7920 

TTCTCTATCA CAAGAAGTGA TGTCTCCATC AAGGGTAACT TTATACGACT GGAGCCTTGA 7980 

AGAAAGCTGC ATCTGGTGAA CCACTGGTCA GTGAGTCTAA CAATTCAAAG ATCAAAGTCA 8040 

GTGAGTCTCA AGCAGGGATT TGGGTCAATA ATTAACGATC AGTCACGAAC ATTTGCAAAG 8100 

CATCTTCCAG ACAAGCCATT TGTAGCTTGT GTAAAAGACT CTTTTATTCT TTCCCTTGCA 8160 

GAAAAAATTA AAAACCTATT TCCTGATGGG ACTATTGGCA ATGATACGAC ACTGGTTCTT 8220 

GTGAACGCAA TCTATTTCAA AGGGCAGTGG GAGAATAAAT TTAAAAAAGA AAACACTAAA 8280 

GAGGAAAAAT TTTGGCCAAA CAAGGTATTG TCTATATTTT ATTTATATAG TGTAATATGT 8340 

TAATACATGG AATGTTAAAC ATTTCTGATG GAATGTAACA TGATAAGTAA AATIATA/UVRA 8400 

TTGTTCATGT CTGTTATTTT GTTGTTTTAC TCTTATTUVCT TTATTTAGTT AGGAATACCT 8460 

GAAAAACTAT TGTTTCTAAC TCATGGAATT CCTGGGTTAT TTCTTAGAAG AAGAAGGATG 8520 

TGTTGCTATC TCAATAATAT TATCTTTTTT GTGTTGTGTT TCACGTGTTA TTTGTTGGAC 8580 

ACATTGATTT ATTGCAGAAT ACATACAAAT CTGTACAGAT GATGAGGCAA TACAATTCCT 864 0 

TTAATTTTGC CTTGCTGGAG GATGTACAGG CCAAGGTCCT GGAAATACCA TACAAAGGCA 8700 

AAGATCTAAG CATGATTGTG CTGCTGCCAA ATGAAATCGA TGGTCTGCAG AAGGTAAGAA 8760 

CTTGCATCTA CAACTCTTCC TTCTACTGCC GGACATTTTT CCAAAGATAC CAAGTTTAAA 8820 

CAAGGTAAAA GCTTATGACC GAGTTGCCTC AAAATGATGA AAAATTCTAA ATGAGGAATG 8880 

ATGACTCACC TTCATATTAC AAATATTTGA GCATAGGGCC TGACACAAAC TGAAAGCTTA 8 940 

GTTTTTGTTT GTTTGTTTGT TTTTATTATT ATTATTATAA TACTTTAAGC TTTAGGGTAC 90 00 

ATGTGCACAA TGTGCAGGTT AGTTACATAT GTATACATGT GCCATGCTGG TGTGCTGCAC 9060 

CCATTAACTC ATCATTTAGC GTTAGGTATA TCTCCTAATG CTATCCCTCC CCCCTCCCCC 9120 

CACCCCACAA CAGTCCTCAG AGTGTGATGT TACCTTCCTG TGTCCAAGTG TTCTCATTGT 9180 

TCAATTCCCA TCTATGATTT AATTCCATCT ATGGCTTAGT TAATGATTAA TTTATTAGAG 9240 

TTACATGCAT TGGATATCAA TTTGATGATA TTATTATGCA GCAATTTAAA CTTGACTGGG 93 00 

AGAAATATAT ACCAATGTGA GGAAAGTTTA CAAATAGGCC GAGTAGAAAA GGGAATACAA 93 60 

ATTTAGGAAT TTAGGGAATT ACAATTTAAT AATTGCAATG TGTACTAAAT AATGTATACA 9420 

GAAAAATATG ATGAGCCTAT TAAAAATTGA CACATGTAGT AGGCTGTTGG CACAAGAAAT 9480 

AGTGATACAT ACAGTTCATT GTGTACAAAA TAATGTAATC ATATTTTACA TGTGTATCAT 9540 

ACAGTTGTAT ACATACATAT GTACACATAT ACATATACGT AAAAACATGA TTCTGTTTTT 9600 

ACATACATGT ATATACATAT ACACATATAA CCCAATGTAT TTAl^ATATTC AGGACTCATA 9660 

TTTTACCTAT TAGAATAATA ATGTCTATTA AAGTGAACCT TCTGTATTTC ACATTTATTG 9720 

CCAAAATAAC GAATCTCCAC ATAGTCAATT CATTGTTAAG GTGTATTAGA GATCGACAGT 9780 

TAGTCATATC AGTTTCTTTT TTCCATTTGT ATAGCTTGAA GAGAAACTCA CTGCTGAGAA 9840 

ATTGATGGAA TGGACAAGTT TGCAGAATAT GAGAGAGACA TGTGTCGATT TACACTTACC 9900 

TCGGTTCAAA ATGGAAGAGA GCTATGACCT CAAGGACACG TTGAGAACCA TGGGAATGGT 9960 

GAATATCTTC AATGGGGATG CAGACCTCTC AGGCATGACC TGGAGCCACG GTCTCTCAGT 10 02 0 

ATCTAAAGTC CTACAC7VAGG CCTTTGTGGA GGTCACTGAG GAGGGAGTGG AAGCTGCAGC 10080 

TGCCACCGCT GTAGTAGTAG TCGAATTATC ATCTCCTTCA ACTAATGAAG AGTTCTGTTG 10140 

TAATCACCCT TTC'CTATTCT TCATT^GGCA AAATTJiGACC AACAGCATCC TCTTCTATGG 10200 

CAGATTCTCA TCCCCATAGA TGCAATTAGT CTGTCACTCC ATTTAGAAAA TGTTCACCTA 10260 

GAGGTGTTCT GGTAAACTGA TTGCTGGCAA CAACAGATTC TCTTGGCTCA TATTTCTTTT 10320 

CTATCTCATC TTGATGATGA TAGTCATCAT CAAGAATTTA ATGATTAAAA TAGCATGCCT 10380 

TTCTCTCTTT CTCTTAATAA GCCCACATAT AAATGTACTT TTCCTTCCAG AAAAATTTCC 10440 

CTTGAGGAAA AATGTCCAAG ATAAGATGAA TCATTTAATA CCGTGTCTTC TAAATTTGAA 10500 

ATATAATTCT GTTTCTGACC TGTTTTAAAT GAACCAAACC ATU^TCATACT TTCTCTTCAA 10560 

ATTTAGCAAC CTAGAAACAC ACATTTCTTT GAATTTAGGT GATACCTAAA TCCTTCTTAT 10620 

GTTTCTAAAT TTTGTGATTC TATAAAACAC ATCATCAATA AAATAATGAC ATAAAATCAT 10680 

TTTTGCTTTA CCTGTTTTCT CTCTGGAAAG GGCAAGTGTC CAGTTACACA TAGGAAAGAT 10740 

AATTTAGAGA TATATTAATC ATATATAAAG GAAAATTAAA AACAGAGTAG TTGATGATGA 10 80 0 

GCCTGGAGTA GAAGGCATAT CCCAGAACAG GAGGAGCCTT GTT^CCACA TAGGAACTTC 10860 

CTATTTTATG CTAAAGGGAT AAGAAACTCA TTACAGGCTT TGATGGTTGT TTGTCAAAGA 10920 

GGGGCATAAA ATTATCATAT CCACATCTAG AAAATACATC TCTGGCTACG CTGATATCAA 10980 

TGGATGCGAG GAAAGAACAG TGTGGTTACC ATATATAAAT TAGGAAATCA TTAGAGTATT 11040 

GGGAGTGGAA ATGGAGAGAA AGAAAGAGCC TGGGGGAATT ATTTAGGAT^ TAATAGTTAC 11100 

AGAAAGACAT CTAAGTTGCT GACCTATCTG ACTGGATGGA TGGAAGAATA TCTTGTTTCT 11160 

GAGAGAAAAA AAGACTTTGG GTTTAAATTT GTACTTGATG AATTAAGGTA CTTTTAATAT 11220 

TCAAATGGAT TTGCCTGGCA GGCACTTGAA GATATTAGTC TAAATCTCAG AAACAGAATA 11280 

TGATCTGAAG CTCTAAATTT GTGATATTCA ATATAAATAC TTTAGAGTCA TTGGGATAAA 11340 

TATGGTAGTT GTAGCT7\AAA GCAAAAATAA GATACTAGGG AGAAAGGATA AAGTTAGAAG 11400 

AAAGAAGAAT CTAGAATTGA CCTTGAAGTA TATCAGCATG TGTAAAGATC AGGAATTGAT 11460 

CATTTTTATT TTCCAGAAAG TAGCTTTTCT TAGGGTTCCA TATTTACTCC CATAGATTCT 11520 
TCCC 

Seg ID NO: 465 Protein sequence 
Protein Accession #: BAB21525.1 

1 11 21 31 41 51 

I I I I I I 

MNSIiSEANTK FMFDLFQQFR KSKENNIFYS PISITSALGM VIiLGAKDNTA QQISKVLHFD 60 
QVTENTTEKA ATYHVDRSGN VHHQFQKLLT EFNKSTDAYE LKIANKLFGE KTYOFLQEYIj 120 

DAIKKFYQTS VESTDFANAP EESRKKINSW VESQTNEKIK NLFPD6TIGN DTTLVLVNAI 180 
YFKGQWENBCF KKENTKEEKF WPNKNTYKSV QMMRQYNSFN FALLBDVQAK VLEIPYKGKD 240 
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LSMIVLLPNE IDGLQKLEEK LTAEKLMEWT SLQNMRETCV DLHLPRFKME ESYDLKDTLR 
TM6MVNIFNG DADLS6MTWS HQIiSVSKVIiH KAFVBVTEEG VBAAAATAW WBLSSPSTN 
BBFCCNHPFL FFIRQNKTNS ILPYGRPSSP 

Seq ID NO: 466 DNA sequence 

Nucleic Acid Accession # : NM_001910 . 1 

Coding sequences 50.. 1240 
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GGAGAGAAGA 
CCTTCTTTTG 
GCCCCTCAGG 
GTTCTGGAAA 
GAGTGCCAAG 
TGGCTCCCCA 
CCCCTCTGTG 
GTCCAGCACA 
GTCCGGGATC 
GTTTGGAGAA 
TCTGGGCCTG 
GATGGCTCAG 
AGGTGGTGCG 
CCTGAATTGG 
GGTGGGAGGC 
TTCCCTCATC 
CCCCGTGGAT 
CTTCACCATT 
CGTGGATGGA 
TGGGCCCCTC 
TGGGAATAAC 
CCTGCCTGTC 
GTTATTTTCC 
ACATGAGAAT 
ACTCCCACCA 
TGATTATGAA 
GCAGAGATCA 
CACACGGCCA 
GTACCTGGAT 
TAACATCCTT 
CTGTTTTTGT 
TTTGAAATGT 
TTGAAATCCC 
AGTTAGATCT 
CTTGTTGCAT 
CTGCTGGGTG 



11 

1 

AAGGAGGGGG 
CTGCTGGTGC 
AGGCATCCGT 
TCCCATAATT 
GAACCCCTCA 
CCACAGAACT 
TACTGCACTA 
TACAGCCAGC 
ATTGGAGCCG 
AGTGTCACAG 
GGATACCCCT 
AACCTGGTGG 
GGGAGCGAGC 
GTCCCAGTCA 
ACTGTTATGT 
ACTG6CCCTT 
GGAGAATATG 
AACGGAGTCC 
ATGCAGTTCT 
TGGATCCTGG 
CGTGTGGGAC 
TGACAGACCT 
AGAGAATGTA 
ACACACACAC 
CCGTCATGAT 
AATCAAAAAT 
TGGTATAATA 
GGCCTGTTTA 
CATTCTGAAG 
AAATATACAA 
AGTTGGATTG 
CTGTAAGTCT 
GAGGTGTCAT 
GGAGGTCTGA 
CCTGTCAGGA 
GGTTTGGAGT 



21 

I 

CAAGGGAGAA 
TCCTGGAGCT 
CCCTCAAGAA 
TGGACATGAT 
TCAACTACTT 
TCACTGTCAT 
GCCCAGCCTG 
CAGGTCAATC 
ACCAAGTCTC 
AGCCAGGCCA 
CCTTGGCTGT 
ACTTGCCGAT 
TGATTTTTGG 
CCAAGCAAGC 
TCTGCTCCGA 
CCGACAAGAT 
CTGTGGAGTG 
CCTATACCCt 
GCAGCAGTGG 
GGGATGTCTT 
TGGCCCCAGC 
TGAATATGTT 
GCTGTTTCCA 
ACACACATAT 
GGAGGAATTA 
TTTCACATTT 
AATCCCTTTG 
TCTACACTGC 
CAJ^ATTCCGA 
TCGGAATTCA 
TTTGTATTAG 
CTTTCCATCT 
TTGACATGGT 
TTTTGTGGCA 
GGCAGATAAT 
TCTTGGCTTT 



31 
I 

GCTGCTGGTC 
GGGAGAGGCC 
GAAGCTGCGG 
CCAGTTCACC 
GGATATGGAA 
CTTCGACACT 
CAAGACGCAC 
TTTCTCCATT 
T6TGGAAGGA 

GGGAGGAGTG 
GTTTTCTGTC 
AGGCTACGAC 
TTACTGGCAG 
GGGCTGCCAG 
TAAGCAGCTG 
TGCCAACCTT 
CAGCCCAACT 
CTTTCAAGGA 
CATTCGACAG 
AGTCCCCTAA 
AGGCTGGGGC 
GGGTTGCAAC 
ACACACACAC 
CGTTATACAT 
GATTATGAAA 
CAACTCCACT 
TGCCCACTCC 
GCATTACATC 
AGCATCTCCC 
GATTCAAGCA 
ACAGAGTTTA 
TCTCTGAACT 
AAAATACTTC 
GCTGGTGCCT 
AATCATTCAT 



41 
I 

GGACTCACAA 
CAAGGATCCC 
GCACGGAGCC 
GAGTCCTGCT 
TACTTCGGCA 
GGCTCCTCCA 
AGCAGGTTCC 
CAGTATGGAA 
CTAACCGTGG 
GATGCAGAGT 
ACTCCAGTAT 
TACATGAGCA 
CACTCCCATT 
ATTGCACTGG 
GCCATTGTGG 
CAAAACGCCA 
AACGTCATGC 
GCCTACACCC 
CTTGACATCC 
TTTTACTCAG 
GGAGGGGCCT 
ATTCTTTACA 
TTGAATTAAG 
ACACTTCACA 
TCATATTTTG 
ATCTCCAAAC 
CAGCCCTGAC 
TCTCTCCAGC 
ATTTTGTCCA 
ATTGTCCCAC 
AGGCCCATAT 
GCACATTTGA 
TATCTTTCCT 
CTAGGTGGTG 
CTCTATTGGT 
TACAAAGTTC 



51 

TGAAAACGCT 
TTCACAGGGT 
AGCTCTCTGA 
CAATGGACCA 
CTATCTCCAT 
ACCTCTGGGT 
AGCCTTCCCA 
CCGGGAGCTT 
TTGGCCAGCA 
TTGATGGAAT 
TTGACAACAT 
GTAACCCAGA 
TCTCTGGGAG 
ATAACATCCA 
ACACAGGGAC 
TTGGGGCAGC 
CGGATGTCAC 
TACTGGACTT 
ACCCTCCAGC 
TCTTTGACCG 
TGTGTCTGTG 
CCTACAAAAA 
ACCAAACAGA 
CATACACACC 
TATTGATTTT 
ATATGCACAA 
AACCCATCCA 
TCCACATGCT 
TAAATATTTC 
AAATGTTTGG 
ATTGCATTTA 
ACGTTGCTGG 
ATAAAATGGT 
CTGGGTACTT 
AATGTTAAGA 
AGCATTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



50 
55 
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80 
85 



Seq ID NO: 467 Protein sequence 
Protein Accession #j NP 001901.1 



MKTLLIjLLLV 
SMDQSAKEPL 
QPSQSSTYSQ 
FDGILGLGYP 
FSGSLNWVPV 
IGAAPVDGEY 
HPPAGPLWIIi 



11 

I 

LLBLGEAQGS 
INYLDMEYFG 
PGQSFSIQYG 
SLAVGGVTPV 
TKQAYWQIAL 
AVECANLNVM 
GDVFIRQFYS 



21 
1 

LHRVPLRRHP 
TISIGSPPQN 
TGSLSGIIGA 
FDNMMAQNLV 
DNIQVGGTVM 
PDVTFTINGV 
VFDRGNNRVG 



31 

I 

SLKKKLRARS 
FTVIFDTGSS 
DQVSVEGLTV 
DLPMFSVYMS 
FCSEGCQAIV 
PYTLSPTAYT 
IiAPAVP 



41 

I 

QLSEFWKSHN 
NLWVPSVYCT 
VGQQFGESVT 
SNPEGGAGSE 
DTGTSLITGP 
LLDFVDGMQF 



51 
I 

LDMIQFTESC 
SPACKTHSRF 
EPGQTFVDAE 
LIFGGYDHSH 
SDKIKQLQNA 
CSSGFQGLDI 



Seq ID NO: 468 DNA sequence 

Nucleic Acid Accession #j NM_018058.1 

Coding sequence: 319.. 1575 



1 

I 

TACGCGCTGC 
GACGGCCGGG 

TACACCGACA 
GTCAACGTGG 
AGAAAGGGCT 
CCTGATGCCC 
CTCAGAGATG 
GTGGGCCCCA 
AACTTCCTTT 
GTGGACGACC 
AAAGTGGACA 
ACCCATGGGA 
GTCCGCAC6G 
AACATTGCCT 
GGAGACCCCC 
ACAGGGGGTG 
GGAGAGTCCA 
TGGCTGCGAG 
CTCTACACCA 
TGTGAGATGG 
GTGACGTGGC 



11 
I 

GGGACC6GCA 
AGGAGATCTA 

AGTTGTTCAA 
CCCGTGGTGT 
CTGGACGCTA 
TCATTGAAAT 
TGGCTGCTGA 
TCCTCAGCA6 
TCCACAACCG 
CCCACCAGCA 
TCGTCTATGG 
AGGTCCGCTT 
TCATCACCGC 
ACCGCAGCTC 
TCATCGAGGA 
TGGTGACCGA 
TGGCTCAGCC 
TGGTGCCACG 
AGAAGAGTGG 
AGCCCGTGGC 
CAGATGGCAA 



21 
I 

GGGGAAC6CC 
CTTCCTCAAC 
GTTCCGCAAT 
GGCCAGCCTC 
CTCTATCTAC 
GGACCCTGAG 
GGCTGGGGTC 
CAGTGCCTCG 
GGGCGATGGC 
TGGGCGAGGT 
CAACTGGAAT 
CCGGGACATC 
CGACTTTGAC 
CTCAGCCAAC 
GCTCAATCCC 
CTTCGACGGA 
GCTGTCCGTC 
CACCCGGGTT 
GGCCCACCTG 
ACACTTTGGC 
GATGGTGAGC 



31 

I 

ATCGGGGTCA 
ACCAATAATG 
AACCGGTGGG 
TTTGCCGGAC 
ATTGCCAATT 
GCCAGTGACC 
AGCAAATATA 
GATATCTTCT 
ACCTTTGTGG 
GTCGCCCTGG 
GGCCCCCACC 
GCCTCACCCA 
AATGACCAGG 
CGCCTCTTCC 
GGCGACGCCT 
GACGGGATGC 
TTCCGGGGCA 
GGGGCCTTTG 
AGGATCATCG 
CTGGGG/kAGG 
CGGAACGTGG 



41 

I 

CAGCCTGCGA 
CCTTCTCGGG 
AAGACATCCT 
GCTCTGTGGC 
ACGCCTACGG 
TCTCCCGGGG 
CAGGGGGCCG 
GCGACAATGA 
ACGCTGCGGC 
CTGACTTCAA 
GCCTCTATCT 
AGTTCTCCAT 
AGCTGGAGAT 
GCGTCATCCG 
TGGAGCCTGA 
TGGACCTCAT 
ATCAGGGCTT 
CCAGGGGAGC 
ACGGGGGCTC 
ATGAAGCCAG 
CCAGCGGGGA 



51 
I 

CATCGACGGG 
GGTGGCCACG 
GAGCGATGAG 
CTGTGTGGAC 
TAATGTGGGC 
CATTCTGGCG 
AGGCGTCAGC 
GAATGGGCCT 
CAGTGCTGGT 
CCGTGATGGC 
GCAAATGAGC 
GCCCTCCCCT 
CTTCTTCAAC 
TAGAGAGCAC 
GGGCCGGGGC 
CTTGTCCCAT 
CAACAACAAC 
TAAGGTCGT6 
AGGCTACCTG 
CAGTGTGGAG 
GATGT^CTCA 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



365 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



WO 02/086443 

GTGCTGGAGA TCCTCTACCC CCGGGATGAG GACACACTTC AGGACCCAGC CCCACTGGAG 1320 

ACACCAATGA ATGCATCCAG TTCCCATTCG TGTGCCCTCG AGACAAGCCC GTATGTGTCA 1380 

ACACCTATGG AAGCTACAGG TGCCGGACCA ACAAGAAGTG CAGTCGGGGC TAC6AGCCCA 1440 

ACGAGGATGG CACAGCCTGC GTGGGGACTC TCGGCCAGTC ACCGGGCCCC CGCCCCACCA 1500 

CCCCCACCGC TGCTGCTGCC ACTGCCGCTG CTGCTGCCGC TGCTGGAGCT GCCACTGCTG 1560 

CACCGGTCCT CGTAGATGGA GATCTCAATC TGGGGTCGGT GGTTAAGGAG AGCTGCGAGC 1620 

CCAGCTGCTG AGCAGGGGTG GGACATGAAC CAGCGGATGG AGTCCAGCAG GGGAGTGGGA 1680 

AAGTGGGCTT GTGCTGCTGC CTAGACAGTA GGGATGTAAA GGCCTGGGAG CTAGACCCTC 1740 

CCCAAGCCCA TCCATGCACA TTACTTAGCT AACAATTAGG GAGACTC6TA AGGCCAGGCC 1800 

CTGTGCTGGG CACATAGCTG TGATCACAGC AGACAGGGTC GCTGCCCTGA TGGCGCTTAC 1860 

ATTCCAGTGG GTCTAATGAC CATATCTTAG GACACAGATG TGCCCAGGGA GGTGGTGTCA 1920 

CTGCACAGGA AGTATGAGGA CTTTAGTGTC CTGAGTTCAA ATCCTGATTC AGGAACTCAC 1980 

AAAGCTATGT GACCTTACAC CAGTCACTTA ACTTGTTAGC CATCCATTAT CGCATCTGCA 2040 

AAATGGGGAT TAAGAATAGA ATCTTGGGGT TAGTGTGGAG ATTAGATTAA ATGTATGTAA 2100 

GACACTTGGC ACAAAACCTG GCACATAGTA AAGGCTCAAT AAAAACAAGT GCCTCTCACT 2160 
GGGCTTTGTC AACACGTG 

Seq ID NO: 469 Protein sequence 
Protein Accession #: NP 060528.1 



PCT/US02/12476 



MDPEASDLSR 
RGDGTFVDAA 
FRDIASPKFS 
EXiNPGDAXjEP 
RTRVGAFARG 
KMVSRNVASG 
GAGPTRSAVG 



11 

1 

GIIiALRDVAA 
ASAGVDDPHQ 
MPSPVRTVIT 
EGRGTGGWT 
AKWLYTKKS 
EMNSVLEILY 
ATSPTRMAQP 



21 

1 

EAGVSKYTGG 
HGRGVALADF 
ADFDNDQELE 
DFDGDGMIiDL 
GAHLRIIDGG 
PRDEDTLQDP 
AWGLSASHRA 



31 

1 

RGVSVGPIIiS 
NRDGKVDIVY 
IFPNNIAYRS 
IXiSHGBSMAQ 
SGYLCEMEPV 
APLETPMNAS 
PAPPPPPLLL 



41 

1 

SSASDIFCDN 
GNWNGPHRLY 
SSANRLFRVI 
PLSVFRGNQG 
AHFGLGKDEA 
SSHSCALETS 
PLPLLLPLLE 



51 

1 

ENGPNFIjFHN 
LQMSTHGKVR 
RREHGDPIjIE 
FNNNWLRWP 
SSVEVTWPDG 
PYVSTPMEAT 
LFLLHRSS 



Seq ID NO: 470 DNA sequence 
Nucleic Acid Accession #j AJ279016 
Coding sequence: 1..1962 



1 

! 

ATGTCCAGGA 
CAGCGGGCTG 
AGTAATCCCA 
TTTGAGATCG 
CAGAAGCGGC 
GACCGGCAGG 
GAGATCTACT 
TTGTTCAAGT 
CGTGGTGTGG 
GGACGCTACT 
ATTGAAATGG 
GCTGCTGAGG 
CTCAGCAGCA 
CACAACCGGG 
CACCAGCATG 
GTCTATGGCA 
GTCaSCTTCC 
ATCACCGCCG 
CGCAGCTCCT 
ATCGAGGAGC 
GTGACCGACT 
GCTCAGCCGC 
GTGCCACGCA 
AAGAGTGGGG 
CCCGTGGCAC 
GAT6GCAAGA 
CTCTACCCCC 
TTCTCCCAGC 
GTGTGCCCTC 
AACAAGAAGT 
CTCGGCCAGT 
GCTGCTGCCG 
CTGGGGTCGG 
CCAGCGGATG 
AGGGATGTAA 
TAACAATTAG 
CAGACAGGGT 
GGACACAGAT 
CCTGA6TTCA 
AACTTGTTAG 
TTAGTGTGGA 
AAAGGCTCAA 



11 
I 

TGTTACCGTT 
AACCCATGTT 
CCCAGCTCAA 
TCGTGGCGGG 
TGGTGAACAT 
GGAACGCCAT 
TCCTCAACAC 
TCCGCAATAA 
CCAGCCTCTT 
CTATCTACAT 
ACCCTGAGGC 
CTGGGGTCAG 
GTGCCTCGGA 
GCGATGGCAC 
GGCGAGGTGT 
ACTGGAATGG 
GGGACATCGG 
ACTTTGACAA 
CAQCCAACCG 
TCAATCCCGG 
TCQACGGAGA 
TGTCCGTCTT 
CCCGGTTTGG 
CCCACCTGAG 
ACTTTGGCCT 
TGGTGAGCCG 
GGGATGAGGA 
AGGAAAATGG 
GAGACAAGCC 
GCAGTCGGGG 
CACCGGGCCC 
CTGCTGGAGC 
TGGTTAAGGA 
GAGTCCAGCA 
AGGCCTGGGA 
GGAGACTCGT 
CGCTGCCCTG 
GTGCCCAGGG 
AATCCTGATT 
CCATCCATTA 
GATTAGATTA 
TAAAAACAAG 



21 
I 

CCTGCTGCTG 
CACTGCAGTC 
CTATGGTGTG 
GTACAATGGA 
CGCGGTCGAT 
CGGGGTCACA 
CAATAATGCC 
CCGGTGGGAA 
TGCCGGACGC 
TGCCAATTAC 
CAGTGACCTC 
CAAATATACA 
TATCTTCTGC 
CTTTGTGGAC 
CGCCCTGGCT 
CCCCCACCGC 
CTCACCCAAG 
TGACCAGGAG 
CCTCTTCCGC 
CGACGCCTTG 
CGGGATGCTG 
CCGGGGCAAT 
GGCCTTTGCC 
GATCATCGAC 
GGGGAAGGAT 
GAACGTGGCC 
CACACTTCAG 
CCATTGCATG 
CGTATGTGTC 
CTACGAGCCC 
CCGCCCCACC 
TGCCACTGCT 
GAGCTGCGAG 
GGGGAGTGGG 
GCTAGACCCT 
AAGGCCAGGC 
ATGGCGCTTA 
AGGTGGTGTC 
CAGGAACTCA 
TCGCATCTGC 
AATGTATGTA 
TGCCTCTCAC 



31 
I 

CTCTGGTTTC 
ACCAACTCAG 
GCAGTTACTG 
CCCAACCTGG 
GAGCGCAGCT 
GCCTGCGACA 
TTCTCGGGGG 
GACATCCTGA 
TCTGTGGCCT 
GCCTACGGTA 
TCCCGGGGCA 
GGGGGCCGAG 
GACAATGAGA 
GCTGCGGCCA 
GACTTCAACC 
CTCTATCTGC 
TTCTCCATGC 
CTGGAGATCT 
GTCATCCGTA 
GAGCCTGAGG 
GACCTCATCT 
CAGGGCTTCA 
AGGGGAGCTA 
GGGGGCTCAG 
GAAGCCAGCA 
AGCG66GAGA 
GACCCAGCCC 
GACACCAATG 
AACACCTATG 
AACGAGGATG 
ACCCCCACCG 
GCACCG6TCC 
CCCAGCTGCT 
AAAGTGGGCT 
CCCCAAGCCC 
CCTGTGCTGG 
CATTCCAGTG 
ACTGCACAGG 
CAAAGCTATG 
AAAATGGGGA 
AGACACTTGG 
TGGGCTTTGT 



41 

1 

TGCCCATCAC 
TTCTGCCTCC 
ATGTGGACCA 
TTCTGAAGTA 
CACCCTACTA 
TCGACGGGGA 
TGGCCACGTA 
GCGATGAGGT 
GTGTGGACAG 
ATGTGGGCCC 
TTCTGGCGCT 
GCGTCAGCGT 
ATGGGCCTAA 
GTGCTGGTGT 
GTGATGGCAA 
AAATGAGCAC 
CCTCCCCTGT 
TCTTCAACAA 
GAGAGCACGG 
GCCGGGGCAC 
TGTCCCATGG 
ACAACAACTG 
AGGTCGTGCT 
GCTACCTGTG 
GTGTGGAGGT 
TGAACTCAGT 
CACTGGAGTG 
AATGCATCCA 
GAAGCTACAG 
GCACAGCCTG 
GTGCTGCTGC 
TCGTAGATGG 
GAGCAGGGGT 
TGTGCTGCTG 
ATCCATGCAC 
GCACATAGCT 
GGTCTAATGA 
AAGTATGAGG 
TGACCTTACA 
TTAAGAATAG 
CACAAAACCT 
CAACACG 



51 
I 

TGAGGGGTCC 
TGACTATGAC 
TGATGGGGAC 
TGACCGGGCC 
CGCGCTGCGG 
CGGCCGGGAG 
CACCGACAAG 
CAACGTGGCC 
AAAGGGCTCT 
TGATGCCCTC 
CAGAGATGTG 
GGGCCCCATC 
CTTCCTTTTC 
GGACGACCCC 
AGTGGACATC 
CCATGGGAAG 
CCGCACGGTC 
CATTGCCTAC 
AGACCCCCTC 
AGGGGGTGTG 
AGAGTCCATG 
GCTGCGAGTG 
CTACACCAAG 
TGAGATGGAG 
GACGTGGCCA 
GCTGGAGATC 
TGGCCAAGGA 
GTTCCCATTC 
GTGCCGGACC 
CGTGGGGACT 
CACTGCCGCT 
AGATCTCAAT 
GGGACATGAA 
CCTAGACAGT 
ATTACTTAGC 
GTGATCACAG 
CCATATCTTA 
ACTTTAGTGT 
CCAGTCACTT 
AATCTTGGGG 
GGCACATAGT 



Seq ID NO: 471 Protein sequence 
Protein Accession #: CAC08451 
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1 11 21 31 41 51 

! 1 I 1 I I 

OJ MSRMLPFLLL LWPLPITEGS QRAEPMFTAV TNSVLPPDYD SNPTQIiNYGV AVTDVDHDGD 60 

FEIWAGYNG PNLVLKYDRA QKRLVNIAVD ERSSPYYALR DRQGNAIGVT ACDIDGDGRE 120 
EIYFLNTNNA FSGVATYTDK LFKFRNNRWE DILSDEVNVA RGVASLFAGR SVACVDRKGS 180 
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6RYSIYIANY AYGNVGPDAIi lEMDPEASDL SRGIIjALRDV 
LSSSASDIFC DNBNGPNFLF HNRCTOQTFVD AAASAGVDDP 
VYGNWNGPHR LYLQMSTHGK VRFRDIASPK FSMPSPVRTV 
RSSSANRIiFR VIRREHGDPL lEELNPGDAL EPEGRGTGGV 
AQPLSVFRGN QGFNNNWLRV VPRTRFGAFA RGAKWLYTK 
PVAHPGLGKD EASSVEVTWP DGKMVSRNVA SGEMNSVI.EI 
FSQQENGHCM DTKECIQFPP VCPRDKPVCV NTYGSYRCST 
LGQSPGPRPT TPTAAAATAA AAAAAGAATA APVIiVDGDliN 

Seg ID HO; 472 DNA sequence 
Nucleic Acid Accession #t FQENESHH 
Coding sequences 1. .4794 



PCT/US02/12476 



AAEAGVSKYT 
HQH6RGVALA 
ITADFDNDQE 
VTDFDGDGML 
KSGAHLRIID 
LYPRDEDTLQ 
NKKCSRGYEP 
LGSWKESCE 



GGRGVSVGPI 
DFNRDGKVDI 
LEIPFNNIAY 
DLILSHGESM 
GGSGYLCEME 
DPAPLECGQG 
NEDGTACVGT 
PSC 



240 
300 
360 
420 
480 
540 
600 



ATGGCGTGTC 
AGCGGCTCCT 
GTTCTGAAGT 
TCACCCTACT 
ATCGACGGGG 
CACAGCAGCT 
CCACCTACAA 
TCCTCCCTGG 
TGTCGGGGTG 
GGGGTGGCCA 
CTGAGCGATG 
GCCTGTGTGG 
GGTAATGTGG 
GGCATTCTGG 
TTCTCCCACA 
GGAGGAGACC 
TGCCGGCTGG 
CAGAGGGAGG 
TCCAAAAGCC 
GCGCCTTCTC 
CCCCTTGTCA 
CCCCACCCCC 
CTGATGGCTG 
CTGAGAAGCT 

gagctgggaG 
ctgggagaac 
cccaaggtca 

GGCCCCGGGA 
CTCTCCCATC 
GT6CCX3GGAG 
CTGGCGTGGA 
TTTAGGCTCA 
CTGCAGTTCC 
TCTGCCACTC 
ATCCTCAGCA 
TTCCACAACC 
GCCTTCATCG 
CTAGCAGAAA 
CCACATTGCC 
TTCTT6ACGC 
CAGGGGGCCC 
ACTGCCTATT 
TTGTCCTCTG 
GCCCTGGCTG 
CCCCACCGCC 
TCACCCAAGT 
GACCAGGAGC 
CTCTTCCGAT 
GGTCAGGGAG 
AAGGTCAACA 
AGAGGCTGTG 
AAAGGGAAGG 
CCACACTACC 
GTCCAATCAC 
CGGGGTCCAA 
GCTACGGGCT 
AGGGGCTACG 
AGAAAGGGGC 
CCAGGAAAA6 
ACTACCAGGA 
AATCACTACC 
GTCCAATCAC 
CGGGGTCCAA 
GCTATGGGGT 
AGGGGCTATG 
AGA6AGCACG 
GGCCGGGGCA 
TTGTCCCATG 
AACAACAACT 
AAGGTCGTGC 
GGCTACCTGT 
AGTGTGGAGG 



11 
I 

CGGGAGGACT 
CCCCAGCATC 
ATGACCGGGC 
ACGCGCTGCG 
ACGGCCGGGA 
CAGCGCAGGT 
CCCCTGCAGG 
GTCAGGCTTC 
GACTGAGACC 
CGTACACCGA 
AGGTCAAOGT 
ACAGAAAGGG 
GCCCTGATGC 
CGCTCAGAGA 
CTGCCTCTCC 
CAGAGGAGGC 
GCTGGAAGGA 
CTGGGGCAGC 
ATTTGGCTGA 
CAGCCCACCC 
CTCAGCTAAT 
GAGCCCCAGG 
AGQCTTTGGG 
GGGAGGAAAG 
GTCCCTGGAG 
CTCCCATTTT 
CACAGGAGTG 
GGGTGGCCAA 
CCCTGGTCCC 
CTGCCCTGCC 
ACCAGATGGA 
GGAAAGCACG 
CCTCAGGCCT 
ACTGTGGGTC 
GCAGTGCCTC 
GGGGCGATGG 
TTCACCTCAA 
CTGGTCCTTC 
ATCATGGTTT 
AAGGCTTGGC 
CACCCTGCGT 
ACATTGTCCT 
AAAGAGTCAA 
ACTTCAACCG 
TCTATCTGCA 
TCTCCATGCC 
TQGAGATCTT 
GCTCCATCCT 
AAGGTTTAAG 
CAGGTCCCCT 
GGAATGCAGG 
GAAATGTGGC 
ACAAAAAGGG 
TACCAGGAAA 
TCACTACCAG 
CCAATCACTA 
GGCTCCAATC 
TACGGGCTCC 
GGGCTACAGG 
AAAGGGGCTA 
AGGAAAAGGG 
TACCACAGAA 
TCACTACCAG 
CCAATCACTA 
GGGTCCAATC 
GAGACCCCCT 
CAGGGGGTGT 
GAGAGTCCAT 
GGCTGCGAGT 
TCTACACCAA 
GTGAGATGGA 
TGACGTGGCC 



21 

1 

CCCAGCCCGT 
CCCTCCCCAT 
CCAGAAGCGG 
GGACCGGCAG 
GGAGATCTAC 
CCCTTCTGGG 
CCTCCTGGGT 
TCCGGACAGC 
TACCCATGAA 
CAAGTTGTTC 
GGOCCGTGGT 
CTCTGGACGC 
CCTCATTGAA 
TGTGGCTGCT 
AAGCATTGGT 
AGATGAGGAG 
CGGGCAGTTC 
TGGCGTGCCC 
CAAGAACCTA 
TTTCCCTGCC 
GACACATGGA 
AATGGACCCC 
CGCGTGGCCA 
CAGGCAGAAG 
CCAAGCCACA 
ACAAAGAACA 
CCATCTAGTG 
GCGAGAGATT 
CAACTTCCCC 
TGGGAATCCT 
AAAAGAGGAG 
GGAAGCAGAA 
CAGAGGCAGC 
GATGTCTTTT 
GGATATCTTC 
CACCTTTGTG 
ATATCACCTC 
CTCCTCCTGC 
GTCTATGAGC 
CTCCAGTGCC 
TCTGGCAAGA 
GTGGTCTGCC 
CGTGGGTGTG 
TGATGGCAAA 
AATGAGCACC 
CTCCCCTGTC 
CTTCAACAAC 
GGCTCGTGGC 
AATCAGAAGG 
GATGAAGAAA 
GCAAAGCCTG 
CCAAAGTGTG 
GCTACAGGGT 
AGGGGCTACG 
GAAAAGGGGC 
CCAGGAAAAG 
ACTACCAGGA 
AATCACTACC 
GTCCAATCAC 
CGGGGTCCAA 
GCTACAGGGT 
AGGGGCTACG 
GAAAAGGGGC 
CCAGGAAAAG 
ACTACCACAG 
CATCGAGGAG 
GGTGACCGAC 
GGCTCAGCCG 
GGTGCCACGC 
GAAGAGTGGG 
GCCCGTGGCA 
AGATGGCAAG 



31 
I 

TGCTCTGGTT 
TCCTCCTCCA 
CTGGTGAACA 
GGGAACGCCA 
TTCCTCAACA 
CTCCACAGAA 
CTGCCTCCAC 
AGGCAGGGAG 
CCAGAACCAT 
AAGTTCCGCA 
GTGGCCAGCC 
TACTCTATCT 
ATGGACCCTG 
GAGGCTGGGG 
GAGATATCTG 
CACAGTGGGG 
AAGGAAGAAG 
AGAGGACGTG 
TTTGGCCCAC 
CGCCAAGCCC 
CGTCTGGCTG 
AAATGTAAGG 
GCGCTCAGCA 
GGGCAGGCCA 
CAGCACCTGC 
GACGGAGATC 
GCCACCATGC 
GGGAGAGAGA 
AGCTGCTTGA 
GGGAACTGGG 
GGGAAGATTC 
TTCCCCCCAG 
CCTGTCCTCC 
CTAGGGGGCC 
TGCGACAATG 
GACGCTGCGG 
TGCAGAGATT 
TGCCCGTGGC 
TTTACAAGGA 
CACCGGAGGA 
GCTCCCTGTG 
ATCCCAGAGA 
GACGACCCCC 
GTGGACATCG 
CATGGGAAGG 
CGCACGGTCA 
ATTGCCTACC 
TCTTCATCCT 
GGAGGGTTCC 
CAGAAAGGAA 
GCCAAGGAGC 
CCCAGAACCC 
CCAATCACTA 
GGGTCCAATC 
TACGGGGTCC 
GGGCTACAGG 
AAAGGGGCTA 
AGGAAAAGGG 
TACCAGGAAA 
TCACTACCAG 
CCAATCACTA 
GGCTCCAATC 
TACGGGCTCC 
GGGCTACGGG 
AAAGGGGCTA 
CTCAATCCCG 
TTCGACGGAG 
CTGTCCGTCT 
ACCCGGTTTG 
GCCCACCTGA 
CACTTTGGCC 
ATGGTGAGCC 



41 
I 

GGATGGGACT 
GGTACAATGG 
TCGCGGTCGA 
TCGGGGTCAC 
CCAATAATGC 
ACAGGCCTGT 
TCAGCGGAAG 
AGAGGGTGCC 
TTCTTCTGAG 
ATAACCGGTG 
TCTTTGCCGG 
ACATTGCCAA 
AGGCCAGTGA 
TCAGCAAATA 
GCAGAACCGA 
ATGGAAGCAC 
CAGCAGCTTT 
TTCGAACAGC 
CATGTTACTA 
CCCAACACTA 
GAAAACTAGC 
GCCGCCATGC 
CCACTGTGGT 
TGTCCAGATG 
CTGCTAGAGA 
CAGGGAGGAG 
CAGCTCTCGG 
CTGGGGCAGT 
GGCCTCTTGA 
TTCTGGACAT 
ATGGAGACCA 
GCTCCTCTGA 
AGGTGGGCCT 
GAGGCGTCAG 
AGAATGGGCC 
CCAGTGCTGA 
TTCCTCACTC 
ATGCACGTCT 
CCGGGTCACG 
CACTCAGCCT 
TCCTGGGGTC 
GCCTGATGAC 
ACCAGCATGG 
TCTATGGCAA 
TCCGCTTCCG 
TCACCGCCGA 
GCAGCTCCTC 
TGACAGCTGG 
CAGGGCCAGG 
GGAAGGACGA 
CGGCCTCTGC 
AAGCGCCACA 
CCAGGAAAAG 
ACTACCAGGA 
AATCACTACC 
GTCCAATCAC 
CAGGGTCCAA 
GCTACGGGGT 
AGGGGCTACG 
GAAAAGGGGC 
CCAGGAAAAG 
ACTACCAGGA 
JUVTCACTACC 
GTCCAATCAC 
CGGGGTCCAA 
GCGACGCCTT 
ACGGGATGCT 
TCCGGGGCAA 
GGGCCTTTGC 
GGATCATC6A 
TGGGGAAG6A 
GGAACGT6GC 



51 
1 

GGGTGGGCCC 
ACCCAACCTG 
TGAGCGCAGC 
AGCCTGC6AC 
CTTCTCGGGC 
GCTGAAGCCT 
GGACTTTTCC 
GGTTCCCTGC 
ACCCAAATCA 
GGAAGACATC 
ACGCTCTGTG 
TTACGCCTAC 
CCTCTCCCGG 
TACAGAAGGC 
GGAGCGGGAA 
CAGCCAACTG 
GGTGGAGGAA 
TCTGCAGACT 
TTCTGTCTGC 
CCCTGTAGCC 
CCGGAGTGTC 
TGAGCCCGGC 
GCCAGGGGGC 
TGCACTCAGG 
GCTGTATGAC 
AAGGGACTCG 
GGGACTCGAG 
AGGAAGACCA 
AGCCGGGACA 
GGCCAAGGCC 
TGAGCCCAGA 
GGAGCCTCTG 
GGGGCTTGCT 
CGTGGGCCCC 
TAACTTCCTT 
ACGTCGTTTA 
CCTGTGCCAC 
TCTTCAGGCT 
GTTCTATTCA 
CCAGGGTTCT 
TCTGATCCCC 
CCACAGCTAT 
GCGAGGTGTC 
CTGGAATGGC 
GGACATCGCC 
CTTTGACAAT 
AGCCAACCGC 
TGGGAGGAAC 
GGGTCAGGCC 
GGACTGGGCA 
TATTGCAGGG 
AGATACAAAG 
GGGCTACGGG 
AAAGGGGCTA 
AGGAAAAGGG 
TACCAGGAAA 
TCACTACCAG 
CCAATCACTA 
GGGTCCAATC 
TACGGGGTCC 
GGGCTACAGG 
AAAGGGGCTA 
AGGAAAAGAG 
TACCAGGAAA 
CGTCATCCGT 
GGAGCCTGAG 
GGACCTCATC 
TCAGGGCTTC 
CAGGGGAGCT 
CGGGGGCTCA 
TGAAGCCAGC 
CAGCGGGGA6 
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WO 02/086443 

ATGAACTCAG TGCTGGAGAT CCTCTACCCC CGGGATGAGG ACACACTTCA GGACCCAGCC 4380 

CCACTGGAGT GTGGCCAAGG ATTCTCCCAG CAGGAAAATG GCCATTGCAT GGACACCAAT 4440 

GAATGCATCC AGTTCCCATT CGTGTGCCCT CGAGACAAGC CCGTATGTGT CAACACCTAT 4500 

GGAAGCTACA GGTGCCGGAC CAACAAGAAG TGCAGTCGGG GCTACGAGCC CAACGAGGAT 4560 

GGCACAGCCT GCGTGGGTAC TGAGCTAGGC TCTAGGCATA CAATGACGTG GAAACCAAGG 462 0 

CCCAAAAAGG AGCTGCAACT TTCCCAAGGC ATCTGCACCC CCGTCTGGTC CTTTTTCCTG 4680 

CCGGGTTGCC GGCTGCTCCT CAAAAGAGCT CAGCTCCAGG CTGCTCCCAG CACCCTTCTC 4740 
CAGAAAGCTC CAGGTATTCC AGAAGCCCAA GTGTATGAAC AAGATCAGGA ATAA 

Seq ID NO J 473 Protein sequence 
Protein Accession #: FGENESH predicted 



PCT/US02/12476 



MACPGGLPAR 
SPYYALRDRQ 
PPTTPAGLLG 
GVATYTDKLF 
GNVGPDALIE 
GGDPEEADEE 
SKSHLADKNIi 
PHPRAPGMDP 
ELGGPWSQAT 
GPGRVAKREI 
LAWNQMEKEE 
SATHCGSMSF 
AFIVHLKYHL 
FLTQGLASSA 
LSSERVNVGV 
SPKFSMPSPV 
GQGE6LRIRR 
KGKGNVAQSV 
RGPITTRKRG 
RKGLRAPITT 
NHYQEKGLQG 
AMGSNKYQEK 
GRGTGGWTD 
KWLYTKKSG 
MNSVLBILYP 
GSYRCRTNKK 
PGCRLLLKRA 



11 

I 

CSGWMGLGGP 
GNAIGVTACD 
LPPLSGRDFS 
KFRNNRWEDI 
MDPEASDLSR 
HSGDGSTSQIi 
FGPPCYYSVC 
KCKGRHAEPG 
QHLPARELYD 
GRETGAVGRP 
GKIHGDHEPR 
LGGRGVSVGP 
CRDFPHSLCH 
HRRTjbSLQGS 
DDPHQHGRGV 
RTVITADFDN 
GGFPGPGGQA 
PRTQAPQDTK 
YGVQSLPGKG 
RKRGYGVQSL 
PITTRKRGYR 
6LRAPITTRK 
FDGDGMLDLI 
AHLRIIDGGS 
RDEDTLQDPA 
CSRGYEPNED 
QliQAAPSTLIi 



21 

I 

SGSSPASPPH 
IDGDGREEIY 
SSLGQASPDS 
LSDEVNVARG 
GILALRDVAA 
CRLGWKDGQF 
APSPAHPPPA 
LMAEALGAWP 
LGEPPILQRT 
LSHPLVPNFP 
FRLRKAREAE 
ILSSSASDIF 
IiAETGPSSSC 
QGAPPCLLAR 
ALADFNRDGK 
DQELEIFFNN 
KVNTGPLMKK 
PHYHKKGLQG 
ATGSNHYQEK 
PGKGATGSNH 
VQSLPQKGAT 
RGYGVQSLPQ 
LSHGESMAQP 
GYLCEMEPVA 
PLECGQGFSQ 
GTACVGTELG 
QKAPGIPEAQ 



31 

1 

SSSRYNGPNL 
FLNTNNAFSG 
RQGERVPVPC 
VASLFAGRSV 
EAGVSKYTEG 
KEEAAALVEE 
RQAPQHYPVA 
ALSTTWPGG 
DGDPGRRRDS 
SCLRPLEAGT 
FPPGSSEEPL 
CDNENGPNFL 
CPWHARLLQA 
APCVLGSLIP 
VDIVYGNWNG 
lAYRSSSANR 
QKGRKDEDWA 
PITTRKR6YG 
GLQGPITTRK 
YQEKGLRGPI 
GSNHYQEKGIi 
KGATGSNVIR 
LSVFRGNQGF 
HFGLGKDEAS 
QENGHCMDTN 
SRHTMTWKPR 
VYEQDQE 



Seq ID NO: 474 DNA sequence 

Nucleic Acid Accession #: NM_003661.1 

Coding sequence : 1 . . 1152 



ATGAGTGCAC 
CAAAACGTTC 
GCTGCTGGCA 
AAGGAAAAAG 
GGATTCGTGG 
GACAACCTTG 
TACAGAAACT 
AGAAGGCTCC 
AATGTGGTGT 
CTGGCACCCT 
ATCACAGCCG 
ACACAAGCCC 
GAGTTTTTGG 
ACACGAGGCA 
GTACCGCATG 
GAACAGGTGG 
ACGGATGTGG 
TCAAAGCACT 
CAGGAGCTGG 
CAAGAACTGT 



11 
I 

TTTTCCTTGG 
CAAGTGGGAC 
CCATGGACCC 
TGAGCACACA 
CTGCTGCTGA 
CAAGACAAAT 
GGTTTCTGAA 
GTGCCCTTGC 
CTGGCTCTCT 
TCACAGAGGG 
CTTTGACCGG 
AAGCCCACGA 
GTGAGAACAT 
TTGGGAAGGA 
CCTCAGCCTC 
AGAGGGTTAA 
CGCCTGTAAG 
TACATGAGGG 
AGGAGAAGCT 
GA 



21 

I 

TGTGGGAGTG 
AGATACTGGA 
AGAGAGCAGT 
GAATCTGCTA 
ACTGCCCAGG 
GATCATGAAA 
AGAGTTTCCT 
AGATGGGGTT 
CAGCATTTCC 
AGGCAGCCTT 
GATTACCAGC 
CCTGGTCATC 
ATCCAACTTT 
CATCCGTGCC 
ACGCCCCCGG 
TGAACCCAGC 
CTTCTTTCTT 
GGCAAAGTCA 
AAACATTCTC 



31 
I 

AGGGCAGAGG 
GATCCTCAAA 
ATCTTTATTG 
CTGCTGCTGA 
AATGAGGCAG 
GACAAAAACT 
CGGTTGAAAA 
CAGTiAGGTCC 
TCTGGCATCC 
GTACTCTTGG 
AGTACCATGG 
AAAAGCCTTG 
CTTTCCTTAG 
CTCAGACGAG 
GTCACTGAGC 
ATCCTGGAAA 
GTGCTGGATQ 
GAGACAGCTQ 
AACAATAATT 



41 

1 

VLKYDRAQKR 
HSSSAQVPSG 
CRGGLRPTHE 
ACVDRKGSGR 
FSHTASPSIG 
QREAGAAGVP 
PLVTQLMTHG 
LRSWEESRQK 
PKVTQECHLV 
VPGAALPGNP 
LQPPSGLRGS 
FHNRGDGTFV 
PHCHHGLSMS 
TAYYIVLWSA 
PHRLYLQMST 
LPRCSILARG 
RGCGNAGQSL 
VQSLPGKGAT 
RGYGLQSLPG 
TTRKRGYGLQ 
RGPITTRKRG 
RBHGDPLIEE 
NNNWLRWPR 
SVEVTWPDGK 
ECIQFPFVCP 
PKKELQLSQG 



41 
I 

AAGCTGGAGC 
GTAAGCCCCT 
AGGATGCCAT 
CTGATAATGA 
ATGAGCTCCG 
GGCACGATAA 
GTQAGCTTGA 
ACAAAGGCAC 
TGACCCTCGT 
AACCTGGGAT 
ACTACGGAAA 
ACAAATTGAA 
CTGGCAATAC 
CCAGAGCCAA 
CAATCTCAGC 
TGAGCAGAGG 
TAGTCTACCT 
AGGAGCTGAA 
ATAAGATTCT 



51 

I 

LVNIAVDERS 
LHRNRPVLKP 
PEPFLLRPKS 
YSIYIANYAY 
EISGRTEERB 
RGRVRTALQT 
RLAGKLARSV 
GQAMSRCALR 
ATMPALGGIiE 
GNWVLDMAKA 
PVLQVGLGXiA 
DAAASAERRL 
FTRTGSRFYS 
IPESLMTHSY 
HGKVRFRDIA 
SSSLTAGGRN 
AKEPASAIAG 
GSNHYQEKGIi 
KGATGSNHYH 
SLPGKGATGS 
YGLQSLPGKE 
LNPGDTU^EPE 
TRFGAFARGA 
MVSRNVASGE 
RDKPVCVNTY 
ICTPVWSFFL 



51 
I 

GAGGGTGCAA 
CGGTGACTGG 
TAAGTATTTC 
GGCCTGGAAC 
TAAAGCTCTG 
AGGCCAGCAG 
GGATAACATA 
CACCATCGCC 
CGGCATGGGT 
GGAGTTGGGA 
GAAGTGGTGG 
G6A6GTGAGG 
TTACCAACTC 
TCTTCAGTCA 
TGAAAGCGGT 
AGTCAAGCTC 
CGTGTACGAA 
GAAGGTGGCT 
GCAGGCGGAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



Seq ID NO: 475 Protein sequence 
Protein Accession #i NP 003652.1 



75 



80 



MSALFLGVGV 
KEKVSTQNLL 
YRNWFLKEFP 
LAPFTEGGSL 
BFIiGENISNF 
EQVERVNEPS 
QELEEKLNIL 



11 

I 

RAEEAGARVQ 
liLIiTDNEAWN 
RLKSELEDNI 
VLLEPGMELG 
LSLAGNTYQL 
ILEMSRGVKIi 
NNNYKILQAD 



21 
I 

QNVPSGTDTG 
GFVAAAEIjPR 
RRLRAIiADGV 
ITAALTGITS 
TRGIGKDIRA 
TDVAPVSPFL 
QEL 



31 

I 

DPQSKPLGDW 
NEADELRKAL 
QKVHKGTTIA 
STMDYGKKWW 
lilURARANLQS 
VLDWYLVYE 



41 

1 

AAGTMDPESS 
DNLARQMIMK 
NWSGSLSIS 
TQAQAHDLVI 
VPHASASRPR 
SKHLHEGAKS 



51 
I 

IFIEDAIKYF 
DKNWHDKGQQ 
SGILTLVGMG 
KSLDKLKEVR 
VTEPISAESG 
ETAEELKKVA 



Seq ID NOi 476 DNA sequence 
Nucleic Acid Accession #: NM_014452.1 
85 Coding sequence: 1..1968 



60 
120 
180 
240 
300 
360 



11 



21 



368 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



ATGGGGACCT 
GCCACAGCCA 
GCTCAGCCAG 
ACCGGCCAGG 
ACCAACACAA 
AATGGCATAG 
TTACCTTGTG 
AACGCTACCT 
ACAGAGACTG 
TCTAGTGTGA 
AAGCCGGGGA 
ACCTCACCTT 
GTCCCTTCCT 
TCTGTTAGAC 
TCAGCAAGGG 
CAGCAAGGCC 
GGCGAGAAGT 
CACAAGCATT 
GTGCTTGTGG 
CCCCGGCAGG 
ACCCAGAACC 
CTTGTAGCAG 
AGTGA6AGGG 
GCAGCTCTGC 
GCCCTGCGCC 
ACCACCCAGC 
AGCCCCATCC 
TCCCCACAGG 
GAGTCTACAT 
AAGAAGGACA 
GATGACATGC 
6CTGAGGACA 
CAGACCCTCC 



CTCCX3AGCAG 
CGATGATCGC 
AACAGAAGGC 
TGCTAACCTG 
GCCTGCGCGT 
AGAAATGCCA 
CTGCCTTGAC 
GTGCCCCCCA 
AGGATGTGCG 
TGAAATGCAA 
CCAAGGAGAC 
CCCCTGGCAC 
CCACTTATGT 
CAAAGGTACT 
GGAAGGAAGA 
CCCACCACAG 
CCAGCACGCC 
TTGACATCAA 
TGATTGTGGT 
ATCCCAGTGC 
GGGAGAAATG 
CCCAAGTGGG 
AGGTTGCTGC 
AGCACTGGAC 
AGCACCGGAG 
TGGAAACTGA 
CCAGCCCCAA 
ACAAGAACAA 
CCAGCGGCTC 
CAGTGTTGCG 
TCCACTTTCT 
AACTAGACCG 
TGGACTCTGT 



CAGCACCGCC 
GGGCTCCCTT 
CTCGAATCTC 
TGACAAGTGT 
CTGCAGCAGT 
TGACTGTAGT 
TGACCGAGAA 
TACGGTGTGT 
GTGTAAGCAG 
AGCATACACA 
AGACAACGTC 
AGCCATCTTT 
TCCCAAAGGC 
GAGTAGCATC 
CGTGAACAAG 
ACACATCCTG 
CATCAAGGGC 
TGAGCATTTG 
GTGCAGTATC 
CATTGTGGAA 
GATCTACTAC 
AAGCCA6TGG 
TTTCTCCAAT 
CATCCGGGGC 
AAACGATGTT 
CAAACTAGCT 
CGCGAAACTT 
GGGCTTCTTC 
CTCCGCGCTG 
GCAGGTACGC 
AAATCCTGAG 
GCTATTCGAA 
TTATAGCCAT 



CTCGCCTCCT 
CTCCTGCTTG 
ATTGGCACAT 
CCAGCAGGAA 
TGCCCTGTGG 
CAGCCATGCC 
TGCACTTGCC 
CCTGTGGGTT 
TGTGCTCGGG 
GACTGTCTGA 
TGTGGCACAC 
CCACGCCCTG 
ATGAACTCAA 
CAGGAAGGGA 
ACCCTCCCAA 
AAGCTGCTGC 
CCCAAGAGGG 
CCCTGGATGA 
CGGAAAAGCT 
AAGGCAGGGC 
TGCAATGGCC 
AAAGATATCT 
GGGTACACAG 
CCCGAGGCCA 
GTGGAGAAGA 
CTCCCGATGA 
GAGAATTCCG 
GTGGATGAGT 
AGCAGGAACG 
CTGGACCCCT 
GAGCTGCGGG 
ATTATTGGAG 
CTTCCTGACC 



GCAGCCGCAT 
GATTCCTTAG 
ACCGCCATGT 
CCTATGTCTC 
GGACCTTTAC 
CATGGCCAAT 
CACCTGGCAT 
GGGGTGTGCG 
GTACCTTCTC 
GTCAGAACCT 
TCCCGTCCTT 
AGCACATGGA 
CAGAATCCAA 
CAGTCCCTGA 
ACCTTCAGGT 
CGTCCATGGA 
GACATCCTAG 
TTGTGCTTTT 
CGAGGACTCT 
TGAAGAAATC 
ATGGTATCGA 
ATCAGTTTCT 
CCGACCACGA 
GCCTCGCCCA 
TTCGTGGGCT 
GCCCCAGCCC 
CTGTCCTGAC 
CGGAGCCCCT 
GTTCCTTTAT 
GTGACTTGCA 
TGATTGAAGA 
TCAAGAGCCA 
TGCTGTAG 



Seq ID NO: 477 Protein sequence 
Protein Accession #t NP 055267.1 



MGTSPSSSTA 
TGQVLTCDKC 
LPCAALTDRE 
SSVMKCKAYT 
VPSSTYVPKG 
QQGPHHRHIL 
VLWIWCSI 
LVAAQVGSQW 
ALRQHRRNDV 
SPQDKHKGFF 
DDMLHFLNPE 



11 
1 

LASCSRIARR 
PAGTYVSEHC 
CTCPPGMFQS 
DCLSQNLWI 
MNSTESNSSA 
KLLPSMEATG 
RKSSRTLKKG 
KDIYQFLCNA 
VEKIRGLMED 
VDESBPLLRC 
ELRVIBEIPQ 



21 

1 

ATATMIAGSL 
TNTSLRVCSS 
NATCAPHTVC 
KPGTKETDNV 
SVRPKVLSSI 
GEKSSTPIKG 
PRQDPSAIVE 
SERBVAAFSN 
TTQLETDKLA 
DSTSSGSSAL 
AEDKLDRLFE 



31 
1 

LLLGFLSTTT 
CPVGTFTRHB 
PVGWGVRKKG 
CGTLPSFSSS 
QEGTVPDNTS 
PKRGHPRQNL 
KAGLKKSMTP 
GYTADHERAY 
£}PMSPSPLSP 
SRNGSFITKE 
IIGVKSQKAS 



41 
1 

AQPEQKASNL 
NGIEKCHDCS 
TETEDVRCKQ 
TSPSPGTAIF 
SARGKEDWK 
HKHFDINEHL 
TQNREKWIYY 
AALQHWTIR6 
SPIPSPNAKIi 
KKDTVLRQVR 
QTLLDSVYSH 



CGCCCGCCGA 

CACCACCACA 

TGACCGTGCC 

TGAGCATTGT * 

CAGGCATGAG 

GATTGAGAAA 

GTTCCAGTCT 

GAAGAAAGGG 

AGATGTGCCT 

GGTGGTGATC 

CTCCAGCTCC 

AACCCATGAA 

CTCTTCTGCC 

CAACACAAGC 

AGTCAACCAC 

GGCCACTGGG 

ACAGAACCTA 

CCTGCTGCTG 

GAJ^AAAGGGG 

CATGACTCCA 

TATCCTGAAG 

TTGCAATGCC 

GCGGGCCTAC 

GCTMTTAGC 

GATGGAAGAC 

GCTTAGCCCG 

GGTGGAGCCT 

TCTCCGCTGT 

TACCAAAGAA 

GCCTATCTTT 

GATTCCCCAG 

GGAAGCCAGC 



51 

i 

IGTYRHVDRA 
QPCPWPMIEK 
CARGTFSDVP 
PRPEHMETHE 
TLPNLQWNH 
PWMIVLFLLL 
CNGHGIDILK 
PEASLAQLIS 
ENSALLTVEP 
LDPC33LQPIF 
LPDLL 



Seq ID NO: 478 DNA sequence 

Nucleic Acid Accession #: XM_044533 

Coding sequence; 238.. 2751 



11 



21 



31 



41 



51 



GCTCTGCCCA 
AGGGGCTGAG 
GGAGGCGGGG 
AGTCCGGCCG 
CTGCGCACCG 
CGGCCACCGC 
TGGGCGCTCA 
TTCGAAGCTG 
CTGTACGTGG 
GGCGGGGAGT 
TTCAAGGGCA 
AGCGGCAGTC 
AACATGGAGA 
AAGGGCCGTT 
CTCTACACTG 
AGCCTTCGCC 
GCCTCAGCCT 
TTTTTCTTCA 
ATTGCCCGCA 
TCCTTCCTCA 
CTGCAGGATG 
GGGGTCTTCA 
ACAATGAAGG 
CAGCAGTGGT 
AACAGTGCCC 
TTCCTCAAGG 
CCCCAGGCTC 
GATGTCCTCT 
CGGGTGCACA 



AGCCGAGGCT 
TTTGCCAGGG 
GCCCCCGGGG 
AGCCACCTGA 
CGATGGGCCT 
TGCTGCTGCT 
GCCCCCGGAT 
AACACATCTC 
GTGCTCGAGA 
ACCAGGAGCT 
AGGACCCACA 
ACCTGTTCAC 
ACTTCACCCT 
GTCCCTTCGA 
GAACAGTCAG 
CCACCAAGAC 
ACATTCCTGA 
GCGAGACTGG 
TCTGCAAGGG 
AGGCCCAGCT 
TCTTCACGCT 
CTTCCCAGTG 
ATGTGCAGAG 
ACACCGTGAC 
GGGAAAGGAA 
ACCACTTCCT 
GCTACCAGCG 
TCCTGGGCAC 
TCATTGAGGA 



GCGGGGCCGG 
CCCACTTGAC 
CGACTCGGGG 
GCCCGAGCCG 
GAGGAGCTGG 
CCTGCTGCTG 
CAGCCTGCCT 
CAACTACACA 
GGCCCTCTTT 
GCTTTGGGGT 
GCGCGACTGT 
CTGTGGCACA 
GGCAAGGGAC 
CCCGAATTTC 
CAGCTTCCAA 
CGAGAGCTCC 
GAGCCTGGGC 
CCAGGAATTT 
CGATGAGGGT 
GCTGTGCTCA 
GAGCCCCAGC 
GCACAGGGGA 
AGTCTTCAGC 
CCACCCGGTG 
GATCAACTCA 
6ATG6ACGGG 
CGTGGCTGTA 
TGGTGACGGC 
GCTGCA6ATC 



CGCCGGCGGG 
CCTGTTTCCC 
GCGGACCGCG 
CGGGACACCG 
CTCGCCGCCC 
CTGCTCCTGC 
CTGGGCTCTG 
GCCCTTCTGC 
GCACTCAGTA 
GCAGACGCAG 
CAAAACTACA 
GCAGCCTTCA 
GAGAAGGGGA 
AAGTCCACTG 
GGGAATGACC 
CTCAACTGGC 
AGCTTGCAAG 
GAGTTCTTTG 
GGAGAGCGGG 
CGGCCCGACG 
CCCCAGGACT 
ACTACAGAAG 
GGCCTCTACA 
CCCACACCCC 
TCCCTGCAGC 
CAGGTCCGAA 
CACCGCGTCC 
CGGCTCCACA 
TTCTCATCGG 



AGGACTGCGG 
ACCTCCCGCC 
GGGCGGAGCT 
TCGCTCCTGC 
CATGGGGCGC 
TGCAGCCGCC 
AAGAGCGGCC 
TGAGCAGGGA 
GCAACCTCAG 
AGAAGAAACA 
TCAAGATCCT 
GCCCCATGTG 
ATGTCCTCCT 
CCCTGGTGGT 
CGGCCATCTC 
TGCAAGACCC 
GCGATGATGA 
AGAACACCAT 
TGCTACAGCA 
ATGGCTTCCC 
GGCGTGACAC 
GCTCTGCCGT 
AGGAGGTGAA 
GGCCTGGAGC 
TCCCAGACCG 
GCCGCATGCT 
CTGGCCTGCA 
AGGCAGTGAG 
GACAGCCCGT 



TGCCCCGCGG 
CCCCAGGTCC 
GCCGCCCGTG 
TCTCCGAATG 
GCTGCCGCCT 
GCCTCCGACC 
ATTCCTCAGA 
TGGCAGGACC 
CTTCCTGCCA 
GCAGTGCAGC 
CCTGCCGCTC 
TACGTACATC 
GG7AGATGGC 
TGATGGCGAG 
GCGGAGCCAA 
AGCTTTTGTG 
CAAGATCTAC 
TGTGTCCCGC 
GCGCTGGACC 
CTTCAACGTG 
CCTTTTCTAT 
CTGTGTCTTC 
CCGTGAGACA 
GTGCATCACC 
CGTGCTGAAC 
GCTGCTGCAG 
CCACACCTAC 
CGTGGGCCCC 
GCAGAATCTG 



60 
120 
180 
240 
300 
*f]360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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CTCCTGGACA CCCACAGGGG GCTGCTGTAT GCGGCCTCAC ACTCGGGCGT AGTCCAGGTG 
CCCATGGCCA ACTGCAGCCT GTACAGGAGC TGTGGGGACT GCCTCCTCGC CCGGGACCCC 
TACTGTGCTT GGAGCGGCTC CAGCTGCAAG CACGTCAGCC TCTACCAGCC TCAGCTGGCC 
ACCAGGCCGT GGATCCAGGA CATCGAGGGA GCCAGCGCCA AGGACCTTTG CAGCGCGTCT 
TCGGTTGTGT CCCCGTCTTT TGTACCAACA GGGGAGAAGC CATGTGAGCA AGTCCAGTTC 
CAGCCCAACA CAGTGAACAC TTTGGCCTGC CCGCTCCTCT CCAACCTGGC GACCCGACTC 
TGGCTACGCA ACGGGGCCCC CGTCAATGCC TCGGCCTCCT GCCACGTGCT ACCCACTGGG 
GACCTGCTGC TGGTGGGCAC CCAACAGCTG GGGGAGTTCC AGTGCTGGTC ACTAGAGGAG 
GGCTTCCAGC AGCTG6TAGC CAGCTACTGC CCAGAGGTGG TGGAGGACGG GGTGGCAGAC 
CA7U\CAGATG AGGGTGGCAG TGTACCCGTC ATTATCAGCA CATCGCGTGT GAGTGCACCA 
GCTGGTGGCA AGGCCAGCTG GGGTGCAGAC AGGTCCTACT GGAAGGAGTT CCTGGTGATG 
TGCACGCTCT TTGTGCTGGC CGTGCTGCTC CCAGTTTTAT TCTTGCTCTA CCGGCACCGG 
AACAGCATGA AAGTCTTCCT GAAGCAGGGG GAATGTGCCA GCGTGCACCC CAAGACCTGC 
CCTGTGGTGC TGCCCCCTGA GACCCGCCCA CTCAACGGCC TAGGGCCCCC TAGCACCCCG 
CTCGATCACC GAGGGTACCA GTCCCTGTCA GACAGCCCCC CGGGGTCCCG AGTCTTCACT 
GAGTCAGAGA AGAGGCCACT CAGCATCCAA GACAGCTTCG TGGAGGTATC CCCAGTGTGC 
CCCCGGCCCC GGGTCCGCCT TGGCTCGGAG ATCCGTGACT CTGTGGTGTG AGAGCTGACT 
TCCAGAGGAC GCTGCCCTGG CTTCAGGGGC TGTGAATGCT CGGAGAGGGT CAACTGGACC 
TCCCCTCCGC TCTGCTCTTC GTGGAACACG ACCGTGGTGC CCGGCCCTTG GGAGCCTTGG 
GGCCAGCTGG CGTGCTGCTC TCCAGTCAAG TAGCGAAGCT CCTACCACCC AGACACCCAA 
ACAGCCGTGG CCCCAGAGGT CCTGGCCAAA TATGGGGGCC TGCCTAGGTT GGTGGAACAG 
TGCTCCTTAT GTAAACTGAG CCCTTTiSTTT AAAAAACAAT TCCAAATGTG AAACTAGAAT 
GAGAGGGAAG AGATAGCATG GCATGCAGCA CACACGGCTG CTCCAGTTCA TGGCCTCCCA 
GGGGTGCTGG GGATGCATCC AAAGTGGTTG TCTGAGACAG AGTTGGAAAC CCTCACCAAC 
TGGCCTCTTC ACCTTCCACA TTATCCCGCT GCCACCGGCT GCCCTGTCTC ACTGCAGATT 
CAGGACCAGC TTGGGCTGCG TGCGTTCTGC CTTGCCAGTC AGCCGAGGAT GTAGTTGTTG 
CTGCCGTCGT CCCACCACCT CAGGGACCAG AGGGCTAGGT TGGCACTGCG GCCCTCACCA 
GGTCCTGGGC TCGGACCCAA CTCCTGGACC TTTCCAGCCT GTATCAGGCT GTGGCCACAC 
GAGAGGACAG CGCGAGCTCA GGAGAGATTT CGTGACAATG TACGCCTTTC CCTCAGAATT 
CAGGGAAGAG ACTGTCGCCT GCCTTCCTCC GTTGTTGCGT GAGT^CCCGT GTGCCCCTTC 
CCACCATATC CACCCTCGCT CCATCTTTGA ACTCAAACAC GAGGAACTAA CTGCACCCTG 
GTCCTCTCCC CAGTCCCCAG TTCACCCTCC ATCCCTCACC TTCCTCCACT CTAAGGGATA 
TCAACACTGC CCAGCACAGG GGCCCTGAAT TTATGTGGTT TTTATACATT TTTTAATAAG 
ATGCACTTTA TGTCATTTTT TAATAAAGTC TGAAGAATTA CTGTTT 

Seq ID NO J 479 Protein sequence 
Protein Accession #j XP_044533.3 

1 11 21 31 41 51 

I I I I I I 

MLRTTWGLRS WLAAPWGALP PRPPLLLLLL LLLLLQPPPP TWALSPRISL PLGSEERPFL 

RFEAEHISNY TALLLSRDGR TLYVGAREAL FALSSNLSFL PGGEYQELLW GADAEKKQQC 120 

SFKGKDPQRD CQNYIKILLP LSGSHLFTCG TAAFSPMCTY INMENFTLAR DEKGNVLLED 180 

GKGRCPFDPN FKSTALWDG EliYTGTVSSF QGNDPAISRS QSLRPTKTES SLNWIiQDPAF 240 

VASAYIPBSIi GSLQGDDDKI YFFFSETGQE FEFFBNTIVS RIARICKGDE GGERVLQQRW 300 

TSFLKAQLLC SRPDDGFPFN VLQDVFTLSP SPQDWRDTLP YGVFTSQWHR GTTEGSAVCV 360 

FTMKDVQRVF SGLYKEVNRE TQQWYTVTHP VPTPRPGACI TNSARERKIN SSLQLPDRVIi 42 0 

NFLKDHFLMD GQVRSRMLLL QPQARYQRVA VHRVPGLHHT YDVLFLGTGD GRLHKAVSVG 480 

PRVHIIEELQ IFSSGQPVQN LLLDTHRGLL YAASHSGWQ VPMANCSLYR SCGDCLLARD 540 

PYCAWSGSSC KHVSLYQPQI* ATRPWIQDIE GASAKDLCSA SSWSPSFVP TGEKPCEQVQ 600 

FQPNTVNTLA CPLLSNIjATR LWLRNGAPVN ASASCHVLPT GDLLLVGTQQ LGEFQCWSLE 660 

EGFQQLVASY CPEWEDGVA DQTDEGGSVP VIISTSRVSA PAGGKASWGA DRSYWKEFLV 720 

MCTLFVLAVL LPVLPLLYRH RNSMKVFLKQ GECASVHPKT CPWLPPETR PLNGLGPPST 780 
PLDHRGYQSL SDSPPGSRVF TESEKRPLSI QDSFVEVSPV CPRPRVRLGS EIRDSW 

Seq ID NO: 480 DNA sequence 

Nucleic Acid Accession #: NM_004217.1 

Coding sequence: 58., 1092 

1 11 21 31 41 51 

GGCCGGGAGA GTAGCAGTGC CTTGGACCCC AGCTCTCCTC CCCCTTTCTC TCTAAGGATG 60 

GCCCAGAAGG AGAACTCCTA CCCCTGGCCC TACGGCCGAC AGACGGCTCC ATCTGGCCTG 120 

AGCACCCTGC CCCAGCGAGT CCTCCGGAAA GAGCCTGTCA CCCCATCTGC ACTTGTCCTC 180 

ATGAGCCGCT CCAATGTCCA GCCCACAGCT GCCCCTGGCC AGAAGGTGAT GGAGAATAGC 240 

AGTGGGACAC CCGACATCTT AACGCGGCAC TTCACAATTG ATGACTTTGA GATTGGGCGT 300 

CCTCTGGGCA AAGGCAAGTT TGGAAACGTG TACTTGGCTC GGGAGAAGAA AAGCCATTTC 360 

ATCGTGGCGC TCAAGGTCCT CTTCAAGTCC CAGATAGAGA AGGAGGGCGT GGAGCATCAG 420 

CTGCGCAGAG AQATCGAAAT CCAGGCCCAC CTGCACCATC CCAACATCCT GCGTCTCTAC 480 

AACTATTTTT ATGACCGGAG GAGGATCTAC TTGATTCTAG AGTATGCCCC CCGCGGGGAG 540 

CTCTACAAGG AGCTGCAGAA GAGCTGCACA TTTGACGAGC AGCGAACAGC CACGATCATG 600 

GAGGAGTTGG CAGATGCTCT AATGTACTGC CATGGGAAGA AGGTGATTCA CAGAGACATA 660 

AAGCCAGAAA ATCTGCTCTT AGGGCTCAAG GGAGAGCTGA AGATTGCTGA CTTCGGCTGG 72 0 

TCTGTGCATG CGCCCTCCCT GAGGAGGAAG ACAATGTGTG GCACCCTGGA CTACCTGCCC 780 

CCAGAGATGA TTGAGGGGCG CATGCACAAT GAGAAGGTGG ATCTGTGGTG CATTGGAGTG 840 

CTTTGCTATG AGCTGCTGGT GGGGAACCCA CCCTTTGAGA GTGCATCACA CAACGAGACC 900 

TATCGCCGCA TCGTCAAGGT GGACCTAAAG TTCCCCGCTT CTGTGCCCAC GGGAGCCCAG 960 

GACCTCATCT CCAAACTGCT CAGGCATAAC CCCTCGGAAC GGCTGCCCCT GGCCCAGGTC 102 0 

TCAGCCCACC CTTGGGTCCG GGCCAACTCT CG6AGGGTGC TGCCTCCCTC TGCCCTTCAA 1080 

TCTGTCGCCT GATGGTCCCT GTCATTCACT CGGGTGCGTG TGTTTGTATG TCTGTGTATG 1140 

TATAGGGGAA AGAAGGGATC CCTAACTGTT CCCTTATCTG TTTTCTACCT CCTCCTTTGT 1200 
TTAATAAAGG CT6AAGCTTT TTGT 

Seq ID NOi 481 Protein secjuence 
Protein Accession #: NP_004208 

1 11 21 31 41 51 
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WO 02/086443 

I 1 

MAQKENSYPW PYGRQTAPSG 
SSGTPDILTR HFTIDDFEIG 
QLRREIEIQA HLHHPNILRL 
MEELADALMY CHGKKVIHRD 
PPEMIEGRMH NEKVDLWCIG 
QDLISKLLRH NPSERIiPLAQ 



PCT/US02/12476 



LSTLPQRVLR 
RPLGKGKFCTT 
YNYFYDRRRI 
IKPENLLLGL 
VLCYELLVGN 
VSAHPWVRAN 



KEPVTPSALV 
VYIiAREKKSH 
YLILEYAPRG 
KGELKIADFG 
PPFESASHNE 
SRRVLPPSAL 



1 

LMSRSNVQPT 
FIVALKVIjFK 

ELYKELQKSC 
WSVHAPSLRR 
TYRRIVKVDL 
QSVA 



AAPGQKVMEN 
SQIEKEGVEH 
TFDEQRTATI 
KTMCGTLDYL 
KFPASVPTGA 



Seq ID NO: 482 DNA sequence 
Nucleic Acid Accession #: AK055663 
Coding sequence: 38.. 1423 



1 
I 

AGAACGGCTT 
AAAACCACAA 
CCGAAGGTCC 
GCTTATGTGG 
TTTTGATCTT 
TAGCCCTGTC 
AGTCTTGGCA 
ACAGCCCGAG 
CCTGTTCACG 
TACGAGCTGG 
GGGACTTAGC 
AGCATTTGCT 
CACTGCCTCT 
GTACAGTGGG 
ACTCATCAGA 
GACCCTAGGT 
TGAACAT^TG 
TGTTCAAATT 
CAATGTCCTA 
TGATTTGAAC 
ATTTAACACT 
TTATGGTTTT 
TGGAGTTCCA 
TAGATATGGA 
TTATAAGGAA 
TGTTTAATCA 
TATGAAACTA 
GCTTTAAATA 
GTTTTGTAGT 
AGATCTGTCA 
TGCCACTGTG 
CTTAGTTTTT 
ATGCAGTGGC 
TGCCTCAGCC 
TGTATTTTTA 
ACCTCATGAT 
CCTGGCCGAT 
GGGAAAGGGA 
AATTGCTAAA 
TTTTTAGCAG 
GATTTTTGTT 



11 
i 

CCGGCGGGAG 
AGATCCTTTT 
TGGAAGATAC 
TGCAGTTCTA 
TTTAGTTTAA 
TATTCATTTG 
CAGTTGGGAG 
ATACACACGG 
ATGCTTTCTA 
CTTCAAGAGC 
AGTATCTTCC 
CTTTGTATTA 
GCTATAGCTA 
AAAGTCTTAC 
GAGGTATCTA 
TTTGGCTCAT 
GTTCTTGCTC 
TTCAAGGATG 
AACTTTTCAG 
CCAGTTACAT 
CCTGGGAAAA 
GGTCTCAATC 
GGAATTGGAG 
ACTAATAATA 
TATTGACTCC 
TTTACTCTAA 
TATTTTTGTA 
GGCTTCCTTT 
TGACTGCAGT 
CATTACTAAG 
CCCGGCCAAT 
GTTTTGTTTT 
ATGATCTCAG 
TCCCGAGTAG 
GTAAAGACGG 
CCACCCACCT 
ATTTTCTTTA 
AAAATGTCTG 
TTTTTCTTTG 
AAATTTTGGA 
AAAGTTTCTC 



21 
1 

CTGTGCAGCT 
TTGGCAAGTT 
TGCTCTTTGG 
CTAATAGTAT 
TGACATGTTT 
GGTTTGAAAG 
CTCTCTTTAT 
GAAGATTATT 
TTCGGAATAA 
ATGTTGCAGA 
TTCCCCGAAT 
CATATATGCT 
TTGCCTTGAT 
TCCAGACAAC 
CCTTAGATGG 
TGGCTGGATC 
ATGTGACCAA 
ACTGGATTAG 
ATCATCACGT 
CTW^CTCCAGC 
ATGTGAACCC 
ATGGACACAC 
CAACTCAAGG 
GAATTGGACA 
TTGGCTTCCA 
ATGTTAGATA 
AAATGTATTT 
AGAAAATGTG 
GTGATGTGAC 
ATACGATATT 
ACATTATTAT 
GTTTTTTGAG 
CTCACTGCAA 
CTGGGATTAC 
GGGATTTCAC 
TAGCCTCCCA 
ATGAAATTTA 
TTCAAAAAGT 
AGGTTCTCCT 
ATACATTCTA 
TCCTTTAAAA 



31 
1 

CCTTATCATG 
GTTACGGGAA 
TGTAATAAAC 
AGCTTTAACT 
AATAAGTTAC 
ATTAGAAGTC 
ATTATW^GAA 
AGTTGGTACT 
ACCTTTTGCT 
TCTTAGTCGA 
GAATCCATTT 
CATTGAAATT 
GACATTTGGC 
ACCACCCCAT 
AGTTTTAGAA 
AGTGCATGTA 
CAGGCTGTAC 
GCCTGCCTTA 
AATCCCAATG 
TAAACCTAGT 
AGTTATTCTT 
ACCTTACAGC 
ATTGAGGACT 
ACCAAGACCA 
ATTTATTTAG 
ATAGTAGTCT 
GTGACAGTGA 
TTTCTTTAAA 
CTTACCTTTA 

TAACTTAAGG 
ATGGAGTCTC 
CCTCTGCCTC 
AGGCACCTGC 
CATGTTGGCC 
AAGTGCTGGG 
TAAATATGCT 
AAAGGTCTCT 
G7\ATTATGTC 
TCTAGCACAA 
ATTTTAGTAC 



41 

1 

GGGACAATTC 
TTTAGACTTG 
TTGATATGTA 
GCCTATACTT 
TGGGTAACAT 
CTGGCTGTAT 
AGTGCAGAAC 
TTTGTGGCTC 
TATGTCTCAG 
AGCTTGTGTG 
GTTTTGATTG 
AATAATTATT 
ACTATGTATC 
GTTATTGGTC 
GTCCGAAATG 
AGAATTCGAC 
ACTCTAGTGT 
TTGTCTGGGC 
CCrCTTTTAA 
AGTCCACCTC 
CTAAACACAC 
AGCATGCTTA 
GGTTTTACAA 
TGATAGACTC 
TAATCCAACT 
TGTTCACATT 
AATCCTCGTA 
TTTGGATTTT 
TAAGAGCCAC 
TCCGAGACGG 
CTGTACTTTA 
ACTCTGTCGC 
CTGAGTTCAA 
CACCACGCCC 
AGGCTGGTCT 
ATTAGGTGTG 
TCTTGAATAA 
TTTATAGCTT 
TTACAAACTA 
TTTGAATTTT 
ATTTGTAAAT 



51 

1 

ATCTCTTTCG 
TAGCAGCTGA 
CTGGCTTCCT 
ACCTGACCAT 
TGAGGAAACC 
TTGCCTCCAC 
GCTTTTTGGA 
TTTGTTTCAA 
AAGCTGCTAG 
GAATTATTCC 
ATCTTGCTGG 
TTGCCGTAGA 
CCATGAGTGT 
AGTTGGACAA 
AACATTTTTG 
GAGATGCCAA 
CTACTCTAAC 
CTGTTGCAGC 
AGGGTACTGA 
CAGAATTTTC 
AAACAAGGCC 
ATCAAGGACT 
ATATACCAAG 
TAACTTATTT 
TTGCATTGAC 
TCATGAAACC 
AATGTTAAAG 
GGTATCTTTG 
TTGATGGAGT 
AGTCTTGCTC 
TTAAGGCTTC 
CCAGGCTGGA 
ATGATTCTCC 
AGCTAATTTT 
TGAACTCCTG 
AGCCACCGCA 
TACACATTTT 
TTCCAAACTT 
AAAGCAAAAA 
TAATTATCAA 



Seq ID NO: 483 Protein sequence 
Protein Accession #: BAB70980.1 



1 
I 

MGTIHLFRKP 
TAYTYLTIFD 
ESAERFLEQP 
RSLCGIIPGL 
GTMYPMSVYS 
VRIRRDANEQ 
MPIiLKGTDDL 
SSMLNQGLGV 



11 
I 

QRSFFGKLLR 
LFSLMTCLIS 
EIHTGRLLVG 
SSIFLPRMNP 
GKVLLQTTPP 
MVLAHVTNRIj 
NPVTSTPAKP 
PGIGATQGLR 



21 

I 

EFRLV/VADRR 
YWVTLRKPSP 
TFVALCFNLF 
FVLIDLAGAF 
HVIGQLDKLI 
YTLVSTLTVQ 
SSPPPEFSFN 
TGFTNIPSRY 



31 

i 

SWKILLFGVI 
VYSFGFERLB 
TMLSIRNKPF 
ALCITYMLIE 
REVSTLDGVIi 
IFKDDWIRPA 
TPQKNVNPVI 
GTNNRIGQPR 



41 
I 

NLICTGFLLM 
VLAVFASTVIi 
AYVSEAASTS 
INNYFAVDTA 
EVRNBHFWTL 
LLSGPVAANV 
LLNTQTRPYG 
P 



51 

1 

WCSSTNSIAL 
AQLGALFILK 
WLQEHVADLS 
SAIAIALMTF 
GFGSLAGSVH 
LNFSDHHVIP 
FGLNHGHTPY 



Seq ID NO: 484 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 
Coding sequence: 1..900 



ATGCCGCCGC 
CCGCGGCGGC 
GCC6TGGGCA 
CGGCCCACTG 
GGCTGCGGCG 
GGACCCCGGG 
CTTCCTAACT 
CCGGTGCGCA 
CTTTGCTACC 
TTTCAAAACA 
GTGCTGCTGG 



11 

1 

GGGAGCTGA6 
GTAGCGCGCC 
AGAGCAGCCT 
CGCTGGACAC 
GGGCTGTGCA 
GAGGAGACTG 
CAGGCTCTCC 
TTGAGCTCTG 
CGGATACCGA 
TCACA6AGAA 
TGGGCACCCA 



21 

1 

CGAGGCCGAG 
CCCAGAGCTG 
CATCGTCAGC 
CTTCTCTGGT 
CCGGGGAGCT 
GAGCAGGCCC 
CCGCCCCGCC 
GGACACAGCG 
TGTCTTCCTG 
ATGGCTGCCC 
GGCCGACCTG 



31 

1 

CCGCCCCCGC 
GGCATCAAGT 
TACACCTGCA 
ACGTACGTTC 
GGGGCGGGCG 
CGAGGTGGCG 
CCTGCAGTGC 
GGACAGGAGG 
GCGTGCTTCA 
GAGATCCGCA 
AGGGACGATG 



41 

I 

TCCGGGCCCC 
GCGTGCTGGT 
ATGGGTACCC 
AATCGCCCGT 
TCTCGGCGGG 
CTGGTGCGGC 
AAGTCCTGGT 
ATTTTGACCG 
GCGTGGTGCA 
CGCACAACCC 
TCAACGTACT 



51 

1 

GACCCCTCCC 
GGGCGACGGC 
CGCGCGCTAC 
GCGGCCGCGT 
AGGGCGCAGA 
CCAGGACGCT 
GGATGGAGCT 
ACTTCGTTCC 
GCCCAGCTCC 
CCAG6CGCCT 
AATTCAGCT6 



60 
120 

180 
240 
300 



60 

120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



371 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

GACCAGGGGG GCCGGGAGGG CCCCGTGCCC CAACCCCAGG CTCAGGGTCT GGCCGAGAAG 

ATCCX5AGCCT GCTGCTACCT TGAGTGCTCA GCCTTGACX3C AGAAGAACTT GAAGGAAGTA 

TTTGACTCGG CTATTCTCAG TGCCATTGAG CACAAAGCCC GGCTGGAGAA GAAACTGAAT 

GCCAAAGGTG TGCGCACCCT CTCCCGCTGC CGCTGGAAGA AGTTCTTCTG CTTCGTTTGA 

Seq ID NOt 485 Protein sequence 
Protein Accession #: FGENESH predicted 



PCT/US02/12476 



720 
780 
840 



MPPRELSEAE 
RPTALDTFSG 
liPNSGSPRPA 
FQNITBKWLP 
IRACCYIiECS 



11 
I 

PPPLRAPTPP 
TYVQSPVRPR 
PAVQVLVDGA 
EIRTHNPQAP 
ALTQKNLKEV 



21 
I 

PRRRSAPPEL 
GCGGAVHRGA 
PVRIELWDTA 
VLLVGTQADL 
FDSAILSAIE 



31 
1 

GIKCVLVGDG 
GAGVSAGGRR 
GQEDFDRLRS 
RDDVNVLIQL 
HKARLEKKLN 



41 
I 

AVGKSSLIVS 
GPRGGDWSRP 
LCYPDTDVFL 
DQG6REGPVP 
AKGVRTLSRC 



51 
1 

YTCa^GYPARY 
RGGAGAAQDA 
ACFSWQPSS 
QPQAQGLABK 
RWKKFFCFV 



Seq ID NO: 486 DNA sequence 

Nucleic Acid Accession #: XM_063832.2 

Coding sequence: 1..711 



ATGCCGCCGC 
CCGCGGCGGC 
GCCGTGGGCA 
CGGCCCACTG 
ATTGAGCTCT 
CCGGATACCG 
ATCACAGAGA 
GTGGGCACCC 
GGCCGGGAGG 
TGCTGCTACC 
GCTATTCTCA 
GTGGGCACCC 



11 
I 

GGGAGCTGAG 
GTAGCGCGCC 
AGAGCAGCCT 
CGCTGGACAC 
GGGACACAGC 
ATGTCTTCCT 
AATGGCTGCC 
AGGCCGACCT 
GCCCCGTGCC 
TTGAGTGCTC 
GTGCCATTGA 
TCTCCCGCTG 



21 
1 

CGAGGCCGAG 
CCCAGAGCTG 
CATCGTCAGC 
CTTCTCTGTG 
GGGACAGGAG 
GGCGTGCTTC 
CGAGATCCGC 
GAGGGACGAT 
CCAACCCCAG 
AGCCTTGACG 
GCACAAAGCC 
CCGCTGGAAG 



31 

I 

CCGCCCCCGC 
GGCATCAAGT 
TACACCTGCA 
CAAGTCCTGG 
GATTTTGACC 
AGCGTGGTGC 
ACGCACAACC 
GTCAACGTAC 
GCTCAGGGTC 
CAGAAGAACT 
CGGCTGGAGA 
AAGTTCTTCT 



41 

1 

TCCGGGCCCC 
GCGTGCTGGT 
ATGGGTACCC 
TGGATGGAGC 
GACTTCGTTC 
AGCCCAGCTC 
CCCAGGCGCC 
TAATTCAGCT 
TGGCCGAGAA 
TGAAGGAAGT 
AGAAACTGAA 
GCTTCGTTTG 



51 

1 

GACCCCTCCC 
GGGCGACGGC 
CGCGCGCTAC 
TCCGGTGCGC 
CCTTTGCTAC 
CTTTCAAAAC 
TGTGCTGCTG 
GGACCAGGGG 
GATCCGAGCC 
ATTTGACTOG 
TGCCAAAGGT 
A 



Seq ID NOt 487 Protein sequence 
Protein Accession #j XP_063832.1 

1 11 21 31 41 51 

I I I I I I 

MPPRELSEAE PPPLRAPTPP PRRRSAPPEL GIKCVLVGDG AVGKSSLIVS YTCNGYPARY 
RPTALDTFSV QVLVDGAPVR lELWDTAGQE DFDRLRSLCY PDTDVFLACF SWQPSSFQN 
ITEKWLPEIR THNPQAPVLL VGTQADLRDD VNVLIQLDQG GREGPVPQPQ AQGLAEKIRA 
CCYLECSALT QKNLKEVFDS AILSAIEHKA RLEKKLNAKG VRTLSRCRWK KFFCFV 

Seq ID NO: 488 DNA sequence 

Nucleic Acid Accession #: NM_0143 98.1 

Coding sequence: 64.. 13 14 



GGCACCGATT 
ACCATGCCCC 
CACGATGGCA 
ACTGCAGCAG 
CCTCACCAAA 
ACAGTAAAAA 
ATTACCTACA 
GTTACTGAAG 
CCACCAGCTC 
ACTCAACCCA 
ACAACCGGTC 
AATACCACCC 
CCATCGTCAG 
GCAGAGATGG 
TACTTCAACA 
AACCTTCTGT 
TCATATTATA 
CAAGGAATCA 
GTGAGTGAAC 
CTTCAAGCCT 
TACACAATTG 
GGTGTCTATA 
CCCGGGGGGA 
TTGGGAAATT 
AATGAAGTGA 
GTTTATTTTA 
GCCACTCAAA 
AGCCTTCAAA 
ATTTTATTTT 
GCTTTTACTA 
ACTCCTTTTC 
CAGGCTGGAG 
TGATTCTCCT 
GCTAATTTTT 
CTCTTGACCT 
AGCCATTGCG 



11 
I 

CGGGGCCTGC 
GGCAGCTCAG 
GTCAAATGAG 
CAACAGTACA 
CTTTAGCAGC 
TTCC7U\CAAC 
CCCTGGTCAC 
TTACAGTCGG 
ATACAGCTGG 
GTAACCAGAC 
AGAAGCCTGA 
GCACAGCTGC 
TCAAGACTGG 
GGATACAGCT 
TCGACCCCAA 
TGAATTTTCA 
TCAGTGAAGT 
T^CATGCGGT 
AGAGCCTCCA 
TTGATTTTGA 
TGCTTCCTGT 
AAATCCGCCT 
ATGAAAATAA 
CCCTCAGAGT 
GTCATGTGTG 
TGAAAGATAT 
GTCAACATTT 
TTATAAACCA 
ACCCTTGATC 
TCTGTGTTTT 
CACTTTAAAT 
TACAGTGGCA 
GCTTCAGCTT 
GTATTTTTAT 
CAGGTGATCC 
CCCGGCCTTA 



21 

I 

CCGGACTTCG 
CGCGGCGGCC 
AGCAAAAGCA 
GGACATAAAA 
AAGATTCATG 
TACCCCAGCA 
AACCCA6GCC 
CCCTAGCTTA 
AACCAGTTCA 
CACCCTTCCA 
TCAACCCACC 
ACCTGCCTCC 
AATTTATCAG 
GATTGTTCAA 
CGCAACGCAA 
GG6CGGATTT 
GGGAGCCTAT 
GGTGATGTTC 
GTTGTCAGCC 
AGATGACCAC 
GATTGGGGCC 
AAGGTGTCAA 
TGGAATTTAG 
GTGGGTCCTT 
ATTTAAGTTC 
AGTGAGCTGT 
GAGATATGTT 
AGGGTCAATT 
TTAACAAAGC 
ATGGTTTCAT 
TTGTTTTTGT 
CGATCTCGGC 
CCCGAGTAGC 
TATAGACGGG 
ACCCACCTCA 
AATGTTTTTT 



31 
I 

CCGCACGCTG 
GCGCTCTTCG 
TTTCCAGAAA 
AAACCTGTCC 
GATGGTCATA 
ACTACAAAAA 
ACACCCAACA 
GCCCCTTATT 
TCAACCGTCA 
GCAACTTTAT 
CATGCCCCAG 
ACGGTTCCTG 
GTTCTAAACG 
GACAAGGAGT 
GCCTCTGGGA 
GT6AATCTCA 
TTGACCGTCT 
CAGACAGCAG 
CACCTGCAGG 
TTTGGAAATG 
ATCGTGGTTG 
TCATCTGGAT 
A6AACTCTTT 
CAAACAATGT 
AGGCAGCACA 
TTATTTTCTA 
GAATTAACAT 
GTAACTAATA 
CTTTGCTTTG 
GTAACATACA 
TTTTTGAGAC 
TTATGGCAAC 
TGGGATTACA 
TTTCACCATG 
GCCTCCCAAA 
TTAATCATCA 



41 
1 

CAGAACCTCG 
CGTCCCTGGC 
CCAGAGATTA 
AGCAACCAGC 
TCACCTTTCA 
ACACTGCAAC 
ACTCACACAC 
CACTGCCACC 
GCCACACAAC 
CGATAGCACT 
GAACAACGGC 
GGCCCACCCT 
GAAGCAGACT 
CGGTTTTTTC 
ACTGTGGCAC 
CATTTACCAA 
CAGATCCAGA 
TCGGGCATTC 
TGAAAACAAC 
TGGATGAGTG 
GTCTCTGCCT 
ACCAGAGAAT 
CATCCCTTCC 
AAACCACCAT 
TCAATTTCTA 
GTTTCCTTTA 
TVATATATGTA 
CTACTGTGTG 
TTATCAAATG 
TATTCCTGGT 
GGAGTTTCAC 
CTCCGCCTCC 
GGCACACACT 
TTGGCCAGAC 
GTGCTGGGAT 
AAAAGAACAA 



51 
I 

CCCAGCGCCC 
CGTAATTTTG 
TTCTCAACCT 
TAAGCAAGCA 
AACAGCGGCC 
CACCAGCCCA 
AGCTCCTCCA 
CACCATCACC 
TGGGAACACC 
GCACAAAAGC 
AGCTGCCCAC 
TGCACCTCAG 
CTGTATAAAA 
ACCTCGGAGA 
CCGAAAATCC 
GGATGAAGAA 
GACAGTTTAC 
CTTCAAGTGC 
CGATGTCCAA 
CTCGTCTGAC 
TATGGGTATG 
CTAATTGTTG 
AGGATGGATG 
CTTCTATTCA 
AATACTTTTT 
GAATATTTTA 
AAGTAGAATA 
TGCATTGAAG 
GACTTTCAGT 
GTAGCACTTA 
TCTTGTCACC 
CGGGTTCAAG 
ACCACGCCTG 
TGGTCTTGAA 
TACAGGCATG 
CATATCTCAG 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



372 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

GTTGTCTAAG TGTTTTTATG TAAAACCAAC AAAAAGAACA AATCAGCTTA TATTTTTTAT 2220 

CTT6ATGACT CCTGCTCCAG AATTGCTAGA CTAAGAATTA GGTGGCTACA GATGGTAGAA 2280 

CTAAACAATA AGCAAGAGAC AATAATAATG GCCCTTAATT ATTAACAAAG TGCCAGAGTC 2340 

TAGGCTAAGC ACTTTATCTA TATCTCATTT CATTCTCACA ACTTATAAGT GAATGAGTAA 240 0 

ACTGAGACTT AAGGGAACTG AATCACTTAA ATGTCACCTG GCTAACTGAT GGCAGAGCCA 2460 

GAGCTTGAAT TCATGTTGGT CTGACATCAA GGTCTTTGGT CTTCTCCCTA CACCAAGTTA 2520 

CCTACAAGAA CAATGACACC ACACTCTGCC TGAAGGCTCA CACCTCATAC CAGCATACGC 2580 

TCACCTTACA GGGAT^TGGG TTTATCCAGG ATCATGAGAC ATTAGGGTAG ATGAAAGGAG 2640 

AGCTTTGCAG ATAACAAAAT AGCCTATCCT TAATAAATCC TCCACTCTCT GGAAGGAGAC 2 700 

TGAGGGGCTT TGTAAAACAT TAGTCAGTTG CTCATTTTTA TGGGATTGCT TAGCTGGGCT 2760 

GTAAAGATGA AGGCATCAAA TAAACTCAAA GTATTTTTAA ATTTTTTTGA TAATAGAGAA 2820 

ACTTCGCTAA CCAACTGTTC TTTCTTGAGT GTATAGCCCC ATCTTGTGGT AACTTGCTGC 2880 

TTCTGCACTT CATATCCATA TTTCCTATTG TTCACTTTAT TCTGTAGAGC AGCCTGCCAA 2940 

GAATTTTATT TCTGCTGTTT TTTTTGCTGC TAAAGAAAGG AACTAAGTCA GGATGTTAAC 3000 

AGA/y^GTCC ACATAACCCT AGAATTCTTA GTCAAGGAAT AATTCAAGTC AGCCTAGAGA 3060 

CCATGTTGAC TTTCCTCATG TGTTTCCTTA TGACTCAGTA AGTTGGCAAG GTCCTGACTT 3120 
TAGTCTTAAT AAAACATTGA ATTGTAGTAA AGGTTTTTGC AATAAAAACT TACTTTGG 



PCT/US02/12476 



Seq ID NO: 489 Protein sequence 
Protein Accession NP_0SS213.X 



MPRQLSAAAA 
HQTLAARFMD 
TEVTVGPSLA 
TGQKPDQPTH 

EMGIQLIVQD 
YYISEVGAYL 
QAFDFEDDHF 



11 

I 

LFASIiAVILH 
GHITFQTAAT 
PYSLPPTITP 
APGTTAAAHN 
KESVFSPRRY 
TVSDPETVYQ 
GNVDECSSDY 



21 
1 

D6SQMRAKAF 
VKIPTTTPAT 
PAHTAGTSSS 
TTRTAAPAST 
FNIDPNATQA 
GIKHAWMFQ 
TIVIjPVIGAI 



31 
1 

PETRDYSQPT 
TKNTATTSPI 
TVSHTTGNTT 
VPGPTLAPQP 
SGNCGTRKSN 
TAVGHSFKCV 
WGLCLMGMG 



Seq ID NO: 490 DNA sequence 

Nucleic Acid Accession #: NM_005409,3 

Coding sequence: 94.. 3 78 



TTCCTTTCAT 
CAACAGCACC 

TTGGCTGTGA 
TGTCTTTGCA 
ATAATGTACC 
AAAGGACAAC 
GAAAGAAAGA 
AACCTAGAAC 
AGACTTTTCT 
GGGTGAAAGG 
CTGCCCAAAG 
GGTTACCATC 
GCATTTCTAG 
GAGAACATTT 
CTGTGGTTAC 
CATCTATGTG 
CCAAATATCA 
TTTATAACCA 
TGGGATACTG 
GATGTTTTTC 
TGTACTTTTT 
TACAAAATGT 
AATCACTTTT 
TTGTTCATGC 
GTCATTTTTT 



11 

I 

GTTCAGCATT 
AGCAGCAACA 
TATTGTGTGC 
TAGGCCCTGG 
CAAGTAACAA 
GATGCCTAAA 
ATTTTTAAAA 
AAGTTTAACT 
ATGGTTTTGT 
ACCAAAAACA 
GAGTCCAGCA 
GGAGTTTACA 
GCTAGAGAAC 
CTGTCTCTAG 
AGTGGAGACA 
TCGTAAAGCA 
TGTAGCACAT 
ATTCATTAAA 
GCAACAGTGC 
AACTTTTATT 
GTTTTGATCC 
TTTTGTCTAC 
ACTTTTTGTA 
CTATATACTG 
TCTCTAATAA 



21 

1 

TCTACTCCTT 
GCAAAAAACA 
TACAGTTGTT 
GGTAAAAGCA 
CTGTGACAAA 
TCCCAAATCG 
ATATCAAAAC 
GTGACTACTG 
GACTTTCAAC 
GAAATACAGT 
ATTAAATGGA 
AA6TGCTTTC 
CTTCTAGATT 
AAGTTATCTG 
TTGACATTAT 
TTCCTCAAAC 
CAATATGTAG 
TGTAATTCAT 
ACATATTTCA 
CATTGAGATG 
GTTTGTATAA 
CAAAGAAAAA 
ATTCTGTCTC 
TAAAATTTAG 
ACTACCACAA 



31 

1 

CCAAGAAGAG 
AACATGAGTG 
CAAGGCTTCC 
GTGAAAGTGG 
ATAGAAGTGA 
JU^GCAAGCAA 
ATATGAAGTC 
AAATGACAAG 
TTTTGTACAG 
CTTCCTGAAT 
TTTCTAGGAA 
ACGTTCTTAC 
TGATGCTTAC 
TCTGTATTGA 
TACTGGAGTC 
ATTTTTTCAT 
GGAAACATTC 
AAAATGTACT 
TAACCAAATT 
TTTTGAAGCA 
ATGATAGCAA 
TGTTGAAAAA 
TTAGAAAAAT 
GTATACTCAA 
CCTTTCTTTT 



41 

I 

AAATVQDIKK 
TYTLVTTQAT 
QPSNQTTLPA 
SSVKTGIYQV 
LLLNFQGGFV 
SEQSLQLSAH 
VYKIRLRCQS 



41 
1 

CAGCAAAGCT 
TGAAGGGCAT 
CCATGTTCAA 
CAGATATTGA 
TTATTACCCT 
GGCTTATAAT 
CTGGAAAAGG 
AATTCTACAG 
TTATGTGAAG 
GAATGACAAT 
AAGCTACCTT 
TTGTTGTATT 
AACTATTCTG 
TCTTTATGCT 
AAGCCCTTAT 
GCAi^ATACAC 
TTATGCATCA 
ATGAAAAAAA 
AGCAGCACCG 
ATTAGGATAT 
TATCTTGGAC 
TAAGCAAATG 
ACATAATCTA 
GACTAGTTTA 
TTAAAAAAAA 



51 
1 

PVQQPAKQAP 
PNNSHTAPPV 

TLSIALHKST 
LNGSRLCIKA 
NLTFTKDEES 
LQVKTTDVQL 
SGYQRI 



51 
1 

GAAGTAGCAG 
GGCTATAGCC 
AAGAGGACGC 
GAAAGCCTCC 
GAAAGAAAAT 
CAAAA7VAGTT 
GCATCTGAAA 
TAGGAAACTG 
GATGAAAGGT 
CAGAATTCCA 
AAGAAAGGCT 
ATACATTCAT 
TTGTGACTAT 
ATATTACTAT 
AAGTCAAAAG 
AGTTCTTTCC 
TTTGGTTTGT 
TTATACGCTA 
GTCTTAATTT 
GTGTGTTTAC 
ACATTTGAAA 
TATACCTAGC 
ATCAATTTCT 
AAGAATCAAA 
AAA 



Seq ID NO: 491 Protein sequence 
Protein Accession #j NP__005400.1 



11 



21 
I 



41 



MSVKGMAIAL AVILCATWQ GPPMPKRGRC LCIGPGVKAV KVADIBKASI MYPSNNCDKI 
EVIITLKENK GQRCIOTKSK QARLIIKKVE RKNF 

Seq ID NO: 492 DNA sequence 

Nucleic Acid Accession #: NM_000577.1 

Coding sequence: 41.. 520 



GGCACGAGGG 
CCGACCCTCT 
GZ\AGACCTTC 
CAATTTAGAA 
CCATGGAGGG 
GGAGGCAGTT 
CATCCGCTCA 
CCTCTGCACA 
CGTCATGGTC 
TCCCATTCTT 



11 
1 

GAAGACCTCC 
GGGAGAAAAT 
TATCTGAGGA 
GAAAAGATAG 
AAGATGTGCC 
AACATCACTG 
GACAGTGGCC 
GCGATGGAAG 
ACCAAATTCT 
GCATGGCAAG 



21 

1 

TGTCCTATCA 
CCAGCAAGAT 
ACAACCAACT 
ATGTGGTACC 
TGTCCTGTGT 
ACCTGAGCGA 
CCACCACCAG 
CTGACCAGCC 
ACTTCCAGGA 
GACTGCAGGG 



31 

1 

GGCCCTCCCC 
GCAAGCCTTC 
AQTTGCCGGA 
CATTGAGCCT 
CAAGTCTGGT 
GAACAGAAAG 
TTTTGAGTCT 
CGTCAGCCTC 
GGACGAGTAG 
ACTGCCAGTC 



41 

1 

ATGGCTTTAG 
AGAATCTGGG 
TACTTGCAAG 
CATGCTCTGT 
GATGAGACCA 
CAGGACAAGC 
GCCGCCTGCC 
ACCAATATGC 
TACTGCCCAG 
CCCCTGCCCC 



51 
1 

AGACGATCTG 
ATGTTAACCA 
GACCAAATGT 
TCTTGGGAAT 
GACTCCAGCT 
GCTTCGCCTT 
CCGGTTGGTT 
CTGACGAAGG 
GCCTGCCTGT 
AGGGCTCCCG 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200' 
1260 
1320 
1380 
1440 



60 



60 
120 
180 
. 240 
300 
360 
420 
480 
540 
600 



373 



wo 02/086443 

GCTATGGGGG CACTGAGGAC CAGCCATTGA GGGGTGGACC CTCAGAAGGC GTCACAACAA 660 

CCTGGTCACA GGACTCTGCC TCCTCTTCAA CTGACCAGCC TCCATGCTGC CTCCA6AATG 720 

GTCTTTCTAA TGTGTGT^TC AGAGCACAGC A6CCCCTGCA CAAAGCCCTT CCATGTCGCC 780 

TCTGCATTCA GGATCAAACC CCGACCACCT GCCCAACCTG CTCTCCTCTT GCCACTGCCT 840 

CTTCCTCCCT CATTCCACCT TCCCATGCCC TGGATCCATC AGGCCACTTG ATGACCCCCA 900 

ACCAAGTGGC TCCCACACCC TGTTTTACAA AAAAGAAAAG ACCAGTCCAT GAGGGAGGTT 960 

TTTAAGGGTT TGTGGAAAAT GAAAATTAGG ATTTCATGAT TTTTTTTTTT CAGTCCCCGT 1020 

GAAGGAGAGC CCTTCATTTG GAGATTATGT TCTTTCGGGG AGAGGCTGAG GACTTAAAAT 1080 

ATTCCTGCAT TTGTGAAATG ATGGTGAAAG TAAGTGGTAG CTTTTCCCTT CTTTTTCTTC 1140 

TTTTTTTGTG ATGTCCCAAC TTGTAAAAAT TAAAAGTTAT GGTACTATGT TAGCCCCATA 1200 

ATTTTTTTTT TCCTTTTAAA ACACTTCCAT AATCTGGACT CCTCTGTCCA GGCACTGCTG 1260 

CCCAGCCTCC AAGCTCCATC TCCACTCCAG ATTTTTTACA GCTGCCTGCA GTACTTTACC 132 0 

TCCTATCAGA AGTTTCTCAG CTCCCAAGGC TCTGAGCAAA TGTGGCTCCT GGGGGTTCTT 13 80 

TCTTCCTCTG CTGAAGGAAT AAATTGCTCC TTGACATTGT AGAGCTTCTG GCACTTGGA6 1440 

ACTTGTATGA AAGATGGCTG TGCCTCTGCC TGTCTCCCCC ACCAGGCTGG GAGCTCTGCA 1500 

GAGCAGGAAA CATGACTCGT ATATGTCTCA GGTCCCTGCA GGGCCAAGCA CCTAGCCTCG 1560 

CTCTTGGCAG GTACTCAGCG AATGAATGCT GTATATGTTG GGTGCAAAGT TCCCTACTTC 1620 

CTGTGACTTC AGCTCTGTTT TACAATAAAA TCTTGAAAAT GCCTAAAAAA AAAAAAAAAA 1680 
AT^TUVAAAAAA J\AAAAAAA7\A AAAAAAAAAA AAAAAA 

Seq ID NO: 493 Protein sequence 
Protein Accession #: NP_000568.1 

1 11 21 31 41 51 

1 I I I I I . 

MALETICRPS GRKSSKMQAF RIWDVNQKTF YLRNNQLVAG YLQGPNVNLE EKIDWPIEP 60 

HALFLGIHGG KMCLSCVKSG DETRLQLEAV NITDLSENRK QDKRFAFIRS DSGPTTSFES 120 
AACPGWFLCT AMEADQPVSL TNMPDEGVMV TKFYFQEDE 

Seq ID NO: 494 DNA sequence 

Nucleic Acid Accession #: NM_002081.1 

Coding sequence: 222.. 1898 

1 11 21 31 41 51 

\ \ \ \ \ \ 

GGCTGCCCGA GCGAGCGTTC GGACCTCGCA CCCCGCGCGC CCCGCGCCGC CGCCGCCGCC 60 

GGCTTTTGTT GTCTCCGCCT CCTCGGCCGC CGCCGCCTCT GGACCGCGAG CCGCGCGCGC 12 0 

CGGGACCTTG GCTCTGCCCT TCGCGGGCGG GAACTGCGCA GGACCCGGCC AGGATCCGAG 180 

AGAGGCGCGG GCGGGTGGCC GGGGGCGCCG CCGGCCCCGC CATGGAGCTC CGGGCCCGAG 240 

GCTGGTGGCT GCTATGTGCG GCCGCAGCGC TGGTCGCCTG CGCCCGCGGG GACCCGGCCA 300 

GCAAGAGCCG GAGCTGCGGC GAGGTCCGCC AGATCTACGG AGCCAAGGGC TTCAGCCTGA 3 60 

GCGACGTGCC CCAGGCGGAG ATCTCGGGTG AGCACCTGCG GATCTGTCCC CAGGGCTACA 42 0 

CCTGCTGCAC CAGCGAGATG GAGGAGAACC TGGCCAACCG CAGCCATGCC 6AGCTGGAGA 480 

CCGCGCTCCG GGACAGCAGC CGCGTCCTGC AGGCCATGCT TGCCACCCAG CTGCGCAGCT 540 

TC6ATGACCA CTTCCAGCAC CTGCTGAACG ACTCGGAGCG GACGCTGCAG GCCACCTTCC 600 

CCGGCGCCTT CGGAGAGCTG TACACGCAGA ACGCGAGGGC CTTCCGGGAC CTGTACTCAG 660 

AGCTGCGCCT GTACTACCGC GGTGCCAACC TGCACCTGGA GGAGACGCTG GCCGAGTTCT 720 

GGGCCCGCCT GCTCGAGCGC CTCTTCAAGC AGCTGCACCC CCAGCTGCTG CTGCCTGATG 780 

ACTACCTGGA CTGCCTGGGC AAGCAGGCCG AGGCGCTGCG GCCCfTTCGGG GAGGCCCCGA 840 

GAGAGCTGCG CCTGCX3GGCC ACCCGTGCCT TCGTGGCTGC TCGCTCCTTT GTGCAGGGCC 900 

TGGGCGTGGC CAGCGACGTG GTCCGGAAAG TGGCTCAGGT CCCCCTGGGC CCGGAGTGCT 960 

CGAGAGCTGT CATGAAGCTG GTCTACTGTG CTCACTGCCT GGGAGTCCCC GGCGCCAGGC 102 0 

CCTGCCCTGA CTATTGCCGA AATGTGCTCA AGGGCTGCCT TGCCAACCAG GCCGACCTGG 10 80 

ACGCCGAGTG GAGGAACCTC CTGGACTCCA TGGTGCTCAT CACCGACAAG TTCTGGGGTA 114 0 

CATCGGGTGT GGA6AGTGTC ATCGGCAGCG TGCACACGTG GCTGGCGGAG GCCATCAACG 1200 

CCCTCCAGGA CAACAGGGAC ACGCTCACGG CCAAGGTCAT CCAGGGCTGC GGGAACCCCA 1260 

AGGTCAACCC CCAGGGCCCT GGGCCTGAGG AGAAGCGGCG CCGGGGCAAG CTGGCCCCGC 1320 

GGGAGAGGCC ACCTTCAGGC ACGCTGGAGA AGCTGGTCTC TGAAGCCAAG GCCCAGCTCC 13 80 

GCGACGTCCA GGACTTCTGG ATCAGCCTCC CAGGGACACT GTGCAGTGAG AAGATGGCCC 1440 

TGAGCACTGC CAGTGATGAC CGCTGCTGGA ACGGGATGGC CAGAGGCCGG TACCTCCCCG 15 00 

AGGTCATGGG TGACGGCCTG GCCAACCAGA TCAACAACCC CGAGGTGGAG GTGGACATCA 1560 

CCAAGCCGGA CATGACCATC CGGCAGCAGA TCATGCAGCT GAAGATCATG ACCAACCGGC 1620 

TGCGCAGCGC CTACAACX5GC AACGACGTGG ACTTCCAGGA CGCCA6TGAC GACGGCAGCG 1680 

GCTCGGGCAG CGGTGATGGC TGTCTGGATG ACCTCTGCGG CCGGAAGGTC AGCAGGAAGA 1740 

GCTCCAGCTC CCGGACGCCC TTGACCCATG CCCTCCCAGG CCTGTCAGAG CAGGAAGGAC 1800 

AGAAGACCTC GGCTGCCAGC TGCCCCCAGC CCCCGACCTT CCTCCTGCCC CTCCTCCTCT 1860 

TCCTGGCCCT TACAGTAGCC AGGCCCCGGT GGCGGTAACT GCCCCAAGGC CCCAGGGACA 1920 

GAGGCCAAGG ACTGACTTTG CCAAAAATAC AACACAGACG ATATTTAATT CACCTCAGCC 1980 

TGGAGAGGCC TGGGGTGGGA CAGGGAGGGC CGGCGGCTCT GAGCAGGGGC AGGCGCAGAG 2040 

GTCCCAGCCC CAGGCCTGGC CTCGCCTGCC TTTCTGCCTT TTAATTTTGT ATGAGGTCCT 2100 

CAGGTCAGCT GGGAGCCAGT GTGCCCAAAA GCCATGTATT TCAGGGACCT CAGGGGCACC 2160 

TCCGGCTGCC TAGCCCTCCC CCCAGCTCCC TGCACCGCCG CAGAAGCAGC CCCTCGAGGC 2220 

CTACAGAGGA GGCCTCAAAG CAACCCGCTG GAGCCCACAG CGAGCCTGTG CCTTCCTCCC 2280 

CGCCTCCTCC CACTGGGACT CCCAGCAGAG CCCACCAGCC AGCCCTGGCC CACCCCCCAG 2340 

CCTCCAGAGA AGCCCCGCAC GGGCTGTCTG GGTGTCCGCC ATCCA6GGTC TGGCAGAGCC 2400 

TCTGAGATGA TGCATGATGC CCTGCCCTGA GCGCAGGCTG CAGAGGCCGG CCCCACCTCC 2460 

CTGCGCCCTT GAGGGGCCCC AGCGTCTGCA GGGTGACGCC TGAGACAGCA CCACTGCTGA 2520 

GGAGTCTGAG GACTGTCCTC CCACAGACCC TGCAGTGAGG GGCCCTCCAT GCGCAGATGA 2580 

GGGGCCACTG ACCCACCTGC GCTTCTGCTG GAGGAGGG6A AGCTGGGCCC AAAGGCCCAG 2640 

GGAGGCAGCG TGGGCTCTGC CAATGTGGGC TGCCCCTCGC ACACAGGGCT CACAGGGCAG 2700 

GCCTTGCTGG GGTCCAGGGC TGTTGGAGGA CCCCGAGGGC TGAGGAGCAG CCAGGACCCG 2760 

CCTGCTCCCA TCCTCACCCA GATCAGGAAC CAGGGCCTCC CTGTTCACGG TGACACAGGT 282 0 

CAGGGCTCAG AGTGACCCTC GGCTGTCACC TGCTCACAGG GATGCTGGTG GCTGGTGAGA 2880 

CCCCGCACTG CACACGGGAA TGCCTAGGTC CCTTCCCGAC CCAGCCAGCT GCACTGCAGG 2940 

GCACGGGGAC CTGGATAGTT AAGGGCTTTT CCAAACATGC ATCCATTTAC TGACACTTCC 3000 

TGTCCTTGTT CATGGAGAGC TGTTCGCTCC TCCCAGATGG CTTCGGAGGC CCGCAGGGCC 3060 

CACCTTGGAC CCTGGTGACC TCCTGTCACT CACTGAGGCC ATCAGGGCCC TGCCCCAGGC 3120 



374 



10 



wo 02/086443 

CTGGACX3GGC CCTCCTTCCC TCCTGTGCCC CAGCTGCCAG 6TGGCCCTGG GGAGGGGTGG 3180 

TGTGGTGTTG GGAAGGGGTC CTGCAGGGGG AGGAGGACTT GGAGGGTCTG GGGGCAGCTG 3240 

TCCTGAACCG ACTGACCCTG AGGAGGCCGC TTAGTGCTGC TTTGCTTTTC ATCACCGTCC 3300 

CGCACAGTGG ACGGAGGTCC CCGGTTGCTG GTCAGGTCCC CATGGCTTGT TCTCTGGAAC 3360 

CTGACTTTAG ATGTTTTGGG ATCAGGAGCC CCCAACACAG GCAAGTCCAC CCCATAATAA 3420 

CCCTGCCAGT GCCAGGGTGG GCTGGGGACT CTGGCACAGT GATGCCGGGC GCCAGGACAG 3480 

CAGCACTCCC GCTGCACACA GACGGCCTAG GGGTGGCGCT CAGACCCCAC CCTACGCTCA 3540 

TCTCTGGAAG GGGCAGCCCT GAGTGGTCAC TGGTCAGGGC AGTGGCCAAG CCTGCTGTGT 3600 

CCTTCCTCCA CAAGGTCCCC CCACCGCTCA GTGTCAGCGG 6TGACGTGTG TTCTTTTGAG 3660 
TCCTTGTATG AATAAAAGGC TGGAAACCTA AA 



PCT/US02/12476 



Seq ID NO: 495 Protein sequence 
Protein Accession #: NP 002072.1 



15 



20 



25 



MELRARGWWL 
ICPQGYTCCT 
TLQATFPGAP 
QLLLPDDYLD 
PLGPECSRAV 
TDKFWGTSGV 
RGKLAPRERP 
RGRYLPEVMG 
ASDDGSGSGS 



11 
I 

LCAAAALVAC 
SEMEENLANR 
GEIiYTQNARA 
CLGKQAEALR 
MKLVYCAHCL 
ESVIGSVHTW 
PSGTLEKLVS 
DGLANQINNP 



21 
I 

ARGDPASKSR 
SHAELETALR 
FRDLYSELRL 
PFGBAPREIiR 
GVPGARPCPD 
LAEAINALQD 
EAKAQLRDVQ 
EVEVDITKPD 



31 
I 

SCGEVRQIYG 
DSSRVLQAML 
YYRGANIiHLE 
LRATRAFVAA 
YCRNVLKGCL 
NRDTLTAKVI 
DFWISLPGTIj 
MTIRQQIMQL 



41 
I 

AKGFSLSDVP 
ATQLRSFDDH 
ETLAEFWARIt 
RSFVQGLGVA 
ANQADLDAEW 
QGCGNPKVNP 
CSEKMALSTA 
KIMTNRLRSA 



SI 

1 

QAEISGEHIiR 
FQHLIiNDSER 
LERLFKQIiHP 
SDWRKVAQV 
RNLLDSMVLI 
QGPGPEEKRR 
SDDRCWNGMA 
YNGNDVDFQD 



60 
120 
180 
240 
300 
360 
420 
480 



30 
35 
40 
45 
50 
55 



Seq ID NO: 496 DNA sequence 

Nucleic Acid Accession # : NM_001650 .2 

Coding sequence: 40.1011 



GGGGCAGGCA 
AGGCGGTGGG 
GGGGTCTGGA 

GTCGACATGG 
TTTGGCCATA 
AGGAAGATCA 
ATTGGAGCAG 
ACCATGGTTC 
TTTCAATTGG 
TCAATAGCTT 
ACTGGTGCCA 
GAA/^CCATT 
TATGAGTATG 
AAAGCTGCCC 
GAGACGGATG 
GAGGAGAAGA 
CGCACTGAAA 
GAAACAGATT 
GTCTAAACAA 
TCCAAATCTA 
TCTAGTTACC 
CCTGACAGAA 
AGTCAATTCT 



11 

1 

ATGAGAGCTG 
GTAAGTGTGG 
CTCAAGCTTT 
TCAGCCTGGG 
TTCTCATCTC 
TCAGCGGTGG 
GCATCGCCAA 
GAATCCTCTA 
ATGGAAATCT 
TGTTTACTAT 
TAGCAATTGG 
GCATGAATCC 
GGATATATTG 
TCTTCTGTCC 
AGCAAACAAA 
ACCTGATTCT 
AGGGGAAAGA 
GCAGACAAGA 
TGTTATAAAT 
TAAATATTTC 
AAAAAAGAAA 
TTTCATTAAC 
CTCAAAGACA 
TATTTGAATA 



21 

1 

CACTCTGGCT 
ACCTTTGTGT 
CTGGATUVGCA 
ATCCACCATC 
CCTTTGCTTT 
CCACATCAAC 
GTCTGTCTTC 
TCTGGTCACA 
TACCGCTGGT 
CTTTGCCAGC 
ATTTTCTGTT 
CGCCCGATCC 
GGTTGGGCCC 
AGATGTTGAA 
AGGAAGCTAC 
AAAACCTGGA 
CCAATCTGGA 
CTCCTTAGAA 
TAGAAATGTG 
ATAATTTACA 
TATTTTTAAG 
AACCAATTTT 
CGTCTATCAG 
TTTATTCTAT 



31 

I 

GGGGAAGGCA 
ACCAGAGAGA 
GTCACAGCGG 
AACTGGGGT6 
GGACTCAGCA 
CCTGCAGTGA 
TACATCGCAG 
CCTCCCAGTG 
CATGGTCTCC 
TGTGATTCCA 
GCAATTGGAC 
TTTGGACCTG 
ATCATAGGAG 
TTCAAACGTC 
ATGGAGGTGG 
GTGGTGCATG 
GAGGTATTGT 
CTGTCCTCAG 
CAGGTTTGTT 
AAGGAGGAAC 
ATGTTCTTAA 
AACCGTGTGT 
CTTATTCCTT 
TAAACTGAGT 



41 
1 

TGAGTGACAG 
ACATCATGGT 
AATTTCTGGC 
GAACAGAAAA 
TTGCAACCAT 
CTGTGGCCAT 
CCCAGTGCCT 
TGGTGGGAGG 
TGQTTGAGTT 
AACGGACTGA 
ATTTATTTGC 
CAGTTATCAT 
CTGTCCTCGC 
GTTTTAAAGA 
AGGACAACAG 
TGATTGACGT 
CTTCAGTATG 
ATTTCCTTCC 
GTTTCATGTC 
GGAAGAAACC 
GCAAATATAT 
CAAGATTTGG 
CTCTACTGGA 
TTAACAATGG 



51 
1 

ACCCACAGCA 
GGCTTTCAAA 
CATGCTTATT 
GCCTTTACCG 
GGTGCAGTGC 
GGTGTGCACC 
GGGGGCCATC 
CCTGGGAGTC 
GATAATCACA 
TGTCACTGGC 
AATCJ\ATTAT 
GGGAAATTGG 
TGGTGGCCTT 
AGCCTTCAGC 
GAGTCAGGTA 
TGACCGGGGA 
ACTAGAAGAT 
ACCCATTAAG 
ATATTACTCA 
TATTGTGAAT 
ACCTATTTTA 
TTAAGTCTTG 
ATATTGGTAT 
C 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
72 0 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
65 
70 



Seq ID NO: 497 Protein sequence 
Protein Accession #: NP 001641.1 



MSDRPTARRW 
GTEKPIjPVDM 
AQCLGAIIGA 
KRTDVTGSIA 
AVXjAGGIiYBY 
VIDVDRGEBK 



11 

I 

GKCGPLCTRE 
VLISLCFGLS 
GILYLVTPPS 
LAIGFSVAIG 
VFCPDVEFKR 
KGKDQSGEVIj 



21 

I 

NIMVAFKGVW 
lATMVQCFGH 
WGGLGVTMV 
HLPAINYTGA 
RFKEAFSKAA 
S5V 



31 

1 

TQAFWKAVTA 
ISGGHINPAV 
HGNLTAGHGIi 
SMNPARSFGP 
QQTKGSYMEV 



41 

1 

EFLAMLIFVL 
TVAMVCTRKI 
LVELIITFQL 
AVIMGNWENH 
EDNRSQVETD 



51 

t 

LSLGSTINWG 
SIAKSVFYIA 
VFTIFASCDS 
WIYWVGPIIG 
DLILKPGWH 



Seq ID NO: 498 DNA sequence 
Nucleic Acid Accession i^: AB020684. 
Coding sequence : 1 . . 1744 



60 
120 
180 
240 
300 



75 



80 



85 



1 
I 

CCCCCTTGTC 
TTG6TACCGG 
GACGGTTACC 
TGCTTGCTTT 
CATATATGGC 
CTTTTTCAAT 
CTCATATCCA 
ACTTTATAGA 
GCAGTTTGCT 
CGGGTACATT 
ACTTTGTTTT 
TTTGGTAATT 



11 

I 

ATTAATACAT 
ATTTATACCA 
AGAGGA6AAG 
TATGTTGCTG 
ACATATTTAA 
CATGGAGAGT 
TTTCTTGTTC 
GGAAGCTTGA 
CAGTTTGTAC 
GATATATGTA 
GTTTTGATGT 
ATTTGGGGTA 



21 
1 

TAAAAAGATT 
AAATAATGGA 
GACTCAGTCC 
TAATTTTTAT 
GTGGCAGCCG 
GTACCCGTGT 
TTCAGATGTT 
TTGCACTCTG 
TTCTTACTCA 
AATTACGGAA 
TTGGGAACTC 
TTCTQGCAAT 



31 
I 

CAATCTTTAC 
CTTGATTGGT 
TATTGAAAGC 
TTTAAATGGA 
ATTAGGAGGC 
AATGTGGACA 
GCTAGTGACT 
CATTTCCAAT 
GATTGCATCA 
GATCATTTAT 
AATGTTATTA 
GAAACCACAT 



41 

I 

CCTGAGGTAA 
ATTCAAACCA 
TGTGAAGGAT 
CTAATGATGG 
CTGGTTACAG 
CCACCTCTCC 
CATATTCTCA 
GTATTTTTCA 
TTATTTGCAG 
ATACACATGA 
ACTTCTTATT 
TTCCTGi^AAA 



51 
1 

TTTTGGCCAG 
AGATAT6TTG 
TGGGAGATCC 
CATTATTCTT 
TGTTGTGCTT 
GTGAAAGCTT 
GGGCTACAAA 
TGCTTCCTTG 
TATATGTTGT 
TTTCTCTTGC 
ATGCTTCTTC 
TAAAT6TATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



375 
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TGAACTTAGT TTATGGGTTA TTCAAGGATG TTTTTGGTTA TTTGGAACTG TCATACTTAA 780 

ATACTTGACA TCTAAAATTT TTGGTATTGC AGATGACGCT CATATTGGCA ACTTACTAAC 840 

ATCAAAATTC TTTAGTTATA AGGATTTTGA TACTTTATTG TATACCTGTG CAGCGGA6TT 900 

TGACTTTATG GAAAAAGAGA CTCCACTGAG ATACACAAAG ACATTATTGC TTCCAGTTGT 960 

5 TCTTGTAGTG TTTGTTGCTA TTGTTAGAAA GATTATTAGT GATATGTGGG GTGTCTTAGC 102 0 

TAAACAAC7VG ACACATGTAA GA7WVCACCA GTTTGATCAT GGAGAGCTGG TTTACCATGC 1080 

ATTGCAATTG TTAGCATATA CAGCCCTTGG TATTTTAATT ATGAGACTAA AACTCTTCTT 1140 

GACACCACAC ATGTGTGTTA TGGCATCACT GATCTGCTCA AGACAGCTAT TTGGATGGCT 1200 

^ ^ CTTTTGCAAA GTACATCCTG GTGCTATTGT GTTTGCTATA TTAGCAGCAA TGTCAATACA 1260 

10 AGGTTCAGCA AATCTGCAAA CCCAGTGGAA TATTGTAGGG GAGTTCAGCA ATTTGCCCCA 1320 

AGAAGAACTT ATAGAATGGA TCAAATATAG TACTAAACCA GATGCAGTGT TTGCGGGTGC 13 80 

CATGCCCACG ATGGCAAGTG TTAAGCTCTC TGCACTTCGG CCCATTGTGA ATCATCCACA 1440 

TTATGAAGAC GCAGGCTTGA GAGCCAGAAC AAAAATAGTA TACTCAATGT ATAGTCGGAA 1500 

AGCAGCCGAA GAAGTGAAGC GAGTiACTGAT AAAGTTAAAA GTGAACTATT ACATTCTAGA 1560 

15 AGAGTCATGG TGTGTAAGAA GATCCAAGCC TGGTTGCAGT ATGCCTGAAA TTTGGGATGT 1620 

AGAAGATCCT GCCAATGCTG GGAAAACTCC CTTATGTAAC CTCTTGGTGA AGGATTCCAA 1680 

ACCTCACTTC ACCACTGTAT TCCAGAACAG TGTTTACAAA GTCCTAGAAG TTGTAAAAGA 1740 

ATGACTGCTA CATGACCTGC TGCCTACGGA GAACTACATC TGTAATGGTT TTAATGTTTT 1800 

GCTAAGTCAT GTGTTGTTCA TATCCCAAAA ACTTTTATAG GTAACTGTTT TCAAATAGAA 1860 

20 AACGTTTTAT TTGGTCAATT TGAATGTCAT TCTAATTATA AAAATGACTT ACACCTTTAT 1920 

CAATTGGTTA CTATTTCAAT GCACCCTTTA AAATTTGCTA TGCAAATGAG TATATGCTTG 1980 

TACTTGACTT TAATATTTGT GCTAAAGTGA GCAAAGCTAC CTGTATAAAG AAAACACAGT 2040 

GGGTTGTGAC AAGGATGACA TGAAAATACA GGACAATTCT GACAATGTAG GGGCTGATTT 210 0 

TATAGTGTAA GAACTATTAA TGCCCCTTGC TTCTTTTTTC TGCCTCTTGC TCTTGTCTTT 2160 

25 TGGACATTTC AGTGATTGTA AGTTCTTCGG TCATGTCAGC CCCTGTCATC AACTTGAGTT 2220 

ACAGTAGATG GGGCAGACAT GGAGTGTTTG CTATATAAAA CTATCTGTTT GTTTTACTTC 2280 

CTTGTGCGCT TTTTGTTCTC TGTTCTCTTG TTAATGAAGC TTTTCCTGCC CATTATTAAT 2340 

CCAAACTCTT GGACCTTGTG GTTAGGAAAT TCCCTTAACT TCCAGCCATA TGGCATTATC 2400 

GTGTCTCTTT CTCTCTCTCT CTTGCTCTCT CTCTTCTCCT CTTCCCCATA TTTTCTGTCA 24 6 0 

30 AATAAGTACT GTTTACTCAT TTAGTTGCTT ATCAAGTACT TATTCTTGGT TTTAAAAAAA 2520 

ATTAATGGTA ACTGTATTTT TCTCATTTTT AGCATTATTC AAATGTTTAT ATTTTAATAC 2580 

CTTTAAACCA CTTTAAAGTT TTTTCATGTT TAATTATAGT TTTAAGAAAA ACTATTTTGA 2640 

ACAACCCCAA ATATAGTGCA TCTAGAAACT AATGTATATT TGATTAGACA TCATTTATAG 2700 

TGGAACAGTA GACTGTAGTA CATGGTAATT TTTCTTTTAC TATTAAGATA CAATAAAACA 2760 

35 TGACTAATTT TGCTGTCAAA AATGTAAAGA ATAATGATAA ATGGAGTTTT TTATATTTTA 2 820 

CTTTTAAGAT TGCCTGTCTT TAATAAGACA AAGCCTTAAG CCTTATGTTA TAATTTTGGT 2 880 

TCTAAAAACC ATCATTTCAG TATAAGGAAT AAGTATATTT CGTCCTCCTC TTTAGTTTTT 2 940 

TTCTTCCTAT TTATTTTTAT TTTGAAAAAT TTCTACACCT TCTTTGi\ATT CCTTGTATGA 3000 

ATTTTTGTTT CTTAGAAGTT AATTTGTGTG AAATGAGATT CTTCAAAACG ATGAAACCTC 3 060 

40 ATAGCTCTGA GAAAAGGTTT TAGGGTTTTA AATTCTAAGC AAAGCGTGAC TATGGCTGAC 3120 

AGACTACACA TTTAATTATA CAGCTTCTCT TTCTTAACCA CAGGCAGATT AACCTCATTG 3180 

TGGATTGTCC TTCAGACCTT AGTCCTCAGG CATGGTTTCT GGTGCCCACT CCTGGAAGCC 3240 

GCTGTTCCCT TTCTACCTTC TTACCAG7W3C CCAAGGGCAG GCCTGGTCCC GGGGAAGCAG 3300 

CAGCTTGCTG ACATAAGTCA GCTGCAAAGG CTGAGGAGTG TGCCCTCAGA GAAGCACCGC 33 60 

45 CCCCCAGTCT TGTGCCAGCG CCTAGAGCCG CAGCTCCCAG GGATGCTCCT TCCCTGGAGG 3420 

' CAGCCCAGGA GAGGGACTCT GGCAGCGTTC TTCAGATTTG TGGCCACTGT TTCTCATTTG 3 480 

CTGGTTGACT GTTTTTATTT CTTAGGCTTT TGCTAGTTTT AGAAAATAGG GAAGCAGCCC 3 540 

TTGATTTGTG GATTAAAAGC AACATTTGAG CGATGATGCA CAACAGTCCA GGAAAATGGG 3 600 

CGGTGGACAC TTGAGGCTGA GGATGGGAGT TGACATGAGC AGGGAGAGGG AGGTGCGCGC 3660 

50 TGCTTATCTG TGATTGTTGC TCACCTGAGT GTGGCTGATT GTGTACATCC AGCAGTTACA 3720 

ATTTTTAAAA ATTATACTTT TACATTTATT TTATATTTTT CTCACCCCCA GTAATTTCCT 3780 

TCCAAAGAAG TTCACATGTA ATAAGTAGAA ATTCTGTATA GGAAAAAAGC ATTAAAAATA 3840 

CTATTATAAC TGCTTCATTT GCTGGGAACC ATTAAAAGTA ATATAAATTA GCTTTTTCCA 3 900 

GAAGGATCCT TTTGTAGCAG TGTTTATGAA TGTAACCCCC AGCAAAATAT GGCTATATAT 3960 

55 TAGGGGAGCC AGTTTGGAGC AGAGGCCTGA AGGTCCCTGC TATGCAGCCG TGGCCACAGC 4020 

TCGCAGCCCA AGCACTGTGG AGCATCCACA CCTTTGATGG CAATGCAGAT TG6TAGCAGG 4080 

TTCCATAGGC GTACAAAACA GTATTAAAGC TCAGTGTTTT GCATATTGTT AGCATTTACA 4140 

AATATTTTTG CTTTAGTATG AGGAAAGTAA GGATGGGCAA AGAAGCGATC AAAATAGCTA 420 0 

TTGCTACAAC ATTTTCGAAA ACAAAGTTGG GGCTGTATTT CTTTAAAAAG ATAAGCCTCT 42 60 

60 AAAAATGCTT GGCAAAAAAA ATATAGTGTT AAAATAGGCC AGTGATATTA ATGAGAAAAT 432 0 

GAAAGTATGT ATCAGGAATA AAGTGATATT GCATAGGAGT ATTGTATTTT TATGAATTTT 4380 
ATGCCAGTTG TTTACATGTA CTATATATGT TAAATTAAAA AAAATCATGA GAAATG 
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65 



Seg ID NO: 499 Protein sequence 
Protein Accession #: BAA74900.1 



70 



75 



80 



PLVINTLKRF 
ACFYVAVIFI 
SYPFLVIiQML 
GYIDICKLRK 
ELiSIiWVIQGC 
OFMEKETPLR 
LQLLAYTALG 
GSANLQTQWN 
YEDAGLRART 
EDPANAGKTP 



11 

I 

NLYPEVILAS 
LNGLMMALFF 
LVTHILRATK 
IIYIHMISLA 
PWLFGTVILK 
YTKTIiLLPW 
ILIMRLKLFL 
IVGEFSNLPQ 
KIVYSMYSRK 
LCNLLVKDSK 



21 
I 

WYRIYTKIMD 
lYGTYLSGSR 
LYRGSLIALC 
LCFVLMFGNS 
YLTSKIFGIA 
LWFVAIVRK 
TPHMCVMASL 
EELIEWIKYS 
AAEEVKRELI 
PHFTTVFQNS 



31 
I 

LIGIQTKICW 
LGGLVTVLCF 
ISNVFFMLPW 
MLLTSYYASS 
DDAHIGNLLT 
IXSDMWGVIiA 
ICSRQLFGWIj 
TKPDAVFAGA 
KLKVNYYILE 
VYKVIiEWKE 



41 
I 

TVTRGEGLSP 
FFNHGECTRV 
QFAQFVLLTQ 
LVIIWGILAM 
SKFPSYKDFD 
KQQTHVRKHQ 
FCKVHPGAIV 
MPTMASVKLS 
ESWCVRRSKP 



51 
I 

lESCEGLGDP 
MWTPPLRESF 
lASLPAVYW 
KPHFLKINVS 
TLLYTCAAEF 
FDHGELVYHA 
FAILAAMSIQ 
ALRPIVNHPH 
GCSMPEIWDV 



Seg ID NO: 500 DNA sequence 

Nucleic Acid Accession NM_001276.1 

Coding sequence; 127.. 1278 



60 
120 
180 
240 
300 
360 
420 
480 
540 



„^ 1 11 21 31 41 51 

85 1 I 1 I t 1 

AGT66AGTGG 6ACAGGTATA TAAAGGAAGT ACAGGGCCTG GGGAAGAGGC CCTGTCTA6G 
TAGCTGGCAC CAGGAGCCGT GGGCAAGGGA AGAGGCCACA CCCTGCCCTG CTCTGCTGCA 



60 
120 



376 



wo 02/086443 

GCCAGAATGG GTGTGAAGGC GTCTCAAACA GGCTTTGTGG TCCTGGTGCT GCTCCAGTGC 180 

TGCTCTGCAT ACAAACTGGT CTGCTACTAC ACCAGCTGGT CCCAGTACCG GGAAGGCGAT 240 

GGGAGCTGCT TCCCAGATGC CCTTGACCGC TTCCTCTGTA CCCACATCAT CTACAGCTTT 300 

GCCAATATAA GCAACGATCA CATCGACACC TGGGAGTGGA ATGATGTGAC GCTCTACGGC 360 

5 ATGCTCAACA CACTCAAGAA CAGGAACCCC AACCTGAAGA CTCTCTTGTC TGTCGGAGGA 420 

TGGAACTTTG GGTCTCAAAG ATTTTCCAAG ATAGCCTCCA ACACCCAGAG TCGCCGGACT 480 

TTCATCAAGT CAGTACCGCC ATTCCTGCGC ACCCATGGCT TTGATGGGCT GGACCTTGCC 540 

TGGCTCTACC CTGGACGGAG AGACAAACAG CATTTTACCA CCCTAATCAA GGAAATGAAG 600 

GCCGAATTTA TAAAGGAAGC CCAGCCAGGG AAAAAGCAGC TCCTGCTCAG CGCAGCACTG 660 

10 TCTGCGGGGA AGGTCACCAT TGACAGCAGC TATGACATTG CCAAGATATC CCAACACCTG 720 

GATTTCATTA GCATCATGAC CTACGATTTT CATGGAGCCT GGCGTGGGAC CACAGGCCAT 780 

CACAGTCCCC TGTTCCGAGG TCAGGAGGAT GCAAGTCCTG ACAGATTCAG CAACACTGAC 84 0 

TATGCTGTGG GGTACATGTT GAGGCTGGGG GCTCCTGCCA GTAAGCTGGT GATGGGCATC 900 

CCCACCTTCG GGAGGAGCTT CACTCTGGCT TCTTCTGAGA CTGGTGTTGG AGCCCCAATC 960 

15 TCAGGACCGG GAATTCCAGG CCGGTTCACC 7\AGGAGGCAG GGACCCTTGC CTACTATGAG 1020 

ATCTGTGACT TCCTCCGCGG AGCCACAGTC CATAGAACCC TCGGCCAGCA GGTCCCCTAT 1080 

GCCACCAAGG GCAACCAGTG GGTAGGATAC GACGACCAGG AAAGCGTCAA AAGCAAGGTG 114,0 

CAGTACCTGA AGGATAGGCA GCTGGCAGGC GCCATGGTAT GGGCCCTGGA CCTGGATGAC 1200 

TTCCAGGGCT CCTTCTGCGG CCAGGATCTG CGCTTCCCTC TCACCAATGC CATCAAGGAT 1260 

20 GCACTCGCTG CAACGTAGCC CTCTGTTCTG CACACAGCAC GGGGGCCAAG GATGCCCCGT 1320 

CCCCCTCTG6 CTCCAGCTGG CCGGGAGCCT GATCACCTGC CCTGCTGAGT CCCAGGCTGA 1380 

GCCTCAGTCT CCCTCCCTTG GGGCCTATGC AGAGGTCCAC AACACACAGA TTTGAGCTCA 1440 

GCCCTGGTGG GCAGAGAGGT AGGGATGGGG CTGTGGGGAT AGTGAGGCAT CGCAATGTAA 1500 

GACTCGGGAT TAGTACACAC TTGTTGATGA TTAATGGAAA TGTTTACAGA TCCCCAAGCC 15 60 

25 TGGCAAGGGA ATTTCTTCAA CTCCCTGCCC CCTAGCCCTC GTTATCAAAG GACACCATTT 1620 

TGGCAAGCTC TATCACCAAG GAGCCAAACA TCCTACAAGA CACAGTGACC ATACTAATTA 1680 

TACCCCCTGC 7\AAGCCAGCT TGA/iACCTTC ACTTAGGAAC GTAATCGTGT CCCCTATCCT 1740 

ACTTCCCCTT CCTAATTCCA CAGCTGCTCA ATAAAGTACA AGAGTTTAAC AGTGTGTTGG 1800 

CGCTTTGCTT TGGTCTATCT TTGAGCGCCC ACTAGACCCA CTGGACTCAC CTCCCCCATC 1860 

30 TCTTCTGGGT TCCTTCCTCT GAGCCTTGGG ACCCCTGAGC TTGCAGAGAT GAAGGCCGCC 1920 
ATGTT 
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35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 501 Protein sequence 
Protein Accession #: NP_001267.1 



1 
I 

MGVKASQTGF 
ISNDHIDTWB 

KSVPPFLRTH 
GICVTIDSSYD 
VGYMLRLGAP 
DPLRGATVHR 
GSFCGQDLRF 



11 
1 

WLVLLQCCS 
WNDVTLYGML 

GFDGLDLAWL 
lAKISQHLDF 
ASKLVMGIPT 
TLGQQVPYAT 
PLTNAIKDAL 



21 
I 

AYKLVCYYTS 
NTLKNIINPNL 

YPGRRDKQHF 
ISIMTYDFHG 
FGRSFTLASS 
KGNQWVGYDD 
AAT 



31 
I 

mQYREGDGS 
KTLIiSVGGWN 
TTLIKEMKAE 
AWRGTTGHHS 
ETGVGAPISG 
QESVKSKVQY 



Seq ID NO: 502 DNA sequence 

Nucleic Acid Accession NM_006474. 

Coding sequence: 181.. 669 



1 
I 

GCTGCCTAGG 
TCCGGCCCCC 
TTCCCCCAGC 
ATGTGGAfVGG 
GAAGGAGCCA 
GTTGCCATGC 
AAGTCTGGCT 
GAGGATCTGC 
GCCTCAAACG 
GTTGAGAAAG 
GCCATCGGTT 
TCGCCCTAAA 
TTCTGACTCT 
CGGGCCCATT 
TCACCAGATT 



11 

I 

GTCTGGAAAG 
CCACCGTCGC 
TCAGAATCTT 
TGTCAGCTCT 
GCACAGGCCA 
CAGGTGCCGA 
TGACAACTCT 
CAACTTCAGA 
TGGCCACCAG 
ATGGTTTGTC 
TCATTGGTGG 
GAGCTGAAGG 
GTGGCCCTGT 
CAGATTCCAC 
TGGTTCTTAA 



21 
I 

CTCGGGCACC 
GCTCCTCCAG 
GCTGCTCGGC 
GCTCTTCGTT 
GCCAGAAGAT 
AGATGATGTG 
GGTGGCAACA 
AAGCACAGTC 
TCACTCCACG 
AACAGTGACC 
AATCATCGTT 
GTTACGCCCT 
CCCTGAGCTC 
GGTGACTTTC 
ACTTT 



31 

i 

CTCCCTCTCC 
GCTGGGCCTG 
CCCCAGGAGA 
TTGGGAAGCG 
GACACTGAGA 
GTGACTCCAG 
AGTGTCAACA 
CACGCGCAAG 
GAGAAAGTGG 
CTGGTTGGAA 
GTGGTTATGC 
GCTTGCCAAC 
GTGGGGAGAA 
CGTTTGCCAA 



41 
I 

CPPDALDRFL 
FGSQRPSKIA 
FIKEAQPGKK 
PLFRGQEDAS 
PGIPGRFTKE 
LKDRQLAGAM 



41 

I 

GGGGCTCCTG 
TGGCCGCGGT 
GCAACAACTC 
CGTCGCTCTG 
CTACAGGTTT 
GAACCAGCGA 
GTGTAACAGG 
AACAAAGTCC 
ATGGAGACAC 
TCATAGTTGG 
GAAAAATGTC 
GTGCTTTAAA 
GATGACCCTG 
ATTAACCGAG 



Seq ID NO: 503 Protein sequence 
Protein Accession NP_006465.1 



11 



21 



31 



41 



51 
I 

CTHIIYSFAN 
SNTQSRRTFI 

QLLLSAALSA 
PDRFSNTDYA 
AGTLAYYEIC 
VWALDLDDFQ 



51 
I 

CTCCCACCCC 
GCTTTTAATT 
TiACGGGAACG 
GGTCCTGGCA 
GGAAGGCGGC 
AGACCGCTAT 
CATTCGCATC 
AAGCGCCACA 
ACAGACAACA 
GGTCTTACTA 
GGGAAGGTAC 
AAAAGACCGT 
GGAACATTTG 
GAAAGACCTT 



51 



MWKVSALLFV LGSASLWVLA EGASTGQPED DTETTGLEGG VAMPGAEDDV VTPGTSEDRY 
KSGLTTLVAT SVNSVTGIRI EDLPTSESTV HAQEQSPSAT ASNVATSHST EKVDGDTQTT 
VEKDGLSTVT LVGIIVGVLL AIGFIGGIIV WMRKMSGRY SP 



Seq ID NO: 504 DNA sequence 
Nucleic Acid Accession #: Eos 
Coding sequence: 62.-895 



1 
I 

CACTGCTCTG 
TATGATCATC 
CAAGGATGGA 
AGAAGCACGG 
TGAA66CGGC 



11 

1 

AGAATTTGTG 
TTAATTTACT 
ATTTTTCATA 
TCTGGCAAAT 
CATCTCGCAA 



21 



sequence 



31 



AGCAGCCCCT AACAGGCTGT 
TATTTCTCTT GCTATGGGAA 
ACTCCATATG GCTTGAACGA 
AOWVGCTCAC CTACGCAGAA 
CTTACAAGCA GCTAGAGGCA 



41 

1 

TACTTCACTA 
GACACTCAAG 
GCAGCCGGTG 
GCTAAGGCGG 
GCCAGAAAAA 



51 
I 

CAACTGACGA 
GATGGGGATT 
TGTACCACAG 
TGTGTGAATT 
TTG6ATTTCA 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



60 
120 



60 
120 
180 
240 
300 



377 



10 



15 



20 



WO 02/086443 

TGTCTGTGCT GCTGGATGGA TGGCTAAGGG CAGAGTTGGA TACCCCATTG TGAAGCCAGG 360 

GCCCAACTGT GGATTTGGAA A/^CTGGCAT TATTGATTAT GGAATCCGTC TCAATAGGAG 420 

TGAAAGATGG GATGCCTATT GCTACAACCC ACACGCAAAG GAGTGTGGTG GCGTCTTTAC 480 

AGATCCAAAG CAAATTTTTA AATCTCCAGG CTTCCCAAAT GAGTACGAAG ATAACCAAAT 540 

CTGCTACTGG CACATTAGAC TCAAGTATGG TCAGCGTATT CACCTGAGTT TTTTAGATTT 60 0 

TGACCTTGAA GATGACCCAG GTTGCTTGGC TGATTATGTT GAAATATATG ACAGTTACGA 660 

TGATGTCCAT GGCTTTGTGG GAAGATACTG TGGAGATGAG CTTCCAGATG ACATCATCAG 72 0 

TACAGGAAAT GTCATGACCT TGAAGTTTCT AAGTGATGCT TCAGTGACAG CTGGAGGTTT 780 

CCAAATCAAA TATGTTGCAA TGGATCCTGT ATCCAAATCC AGTCAAGGAA A/WU^TACAAG 840 

TACTACTTCT ACTGGAAATA AAAACTTTTT AGCTGGAAGA TTTAGCCACT TATAAAAAAA 900 

AAAAAAAGGA TGATCAAAAC ACACAGTGTT TATGTTGGAA TCTTTTGGAA CTCCTTTGAT 960 

CTCACTGTTA TTATTAACAT TTATTTATTA TTTTTCTAAA TGTGAAAGCA ATACATAATT 1020 

TAGGGAAAAT TGGAAAATAT AGGAAACTTT AAACGAGAAA ATGAAACCTC TCATAATCCC 1080 

ACTGCATAGA AATAACAAGC GTTAACATTT TCATATTTTT TTCTTTCAGT CATTTTTCTA 1140 

TTTGTGGTAT ATGTATATAT GTACCTATAT GTATTTGCAT TTGAAATTTT GGAATCCTGC 1200 

TCTATGTACA GTTTTGTATT ATACTTTTTA AATCTTGAAC TTTATAAACA TTTTCTGAAA 1260 

TCATTGATTA TTCTACAAAA ACATGATTTT AAACAGCTGT AAAATATTCT ATGATATGAA 1320 

TGTTTTATGC ATTATTTAAG CCTGTCTCTA TTGTTGGAAT TTCAGGTCAT TTTCATAAAT 1380 
ATTGTTGCAA TAAATATCCT TGAACACACA AAAAAAAAAA AA 

Seg ID NO: 505 Protein sequence 
Protein Accession #: Eos sec[uence 
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25 
30 
35 
40 
45 
50 
55 
60 



MIILIYLFLIi 
EGGHLATYKQ 

ERWDAYCYNP 
DLEDDPGCLA 
QIKYVAMDPV 



11 
1 

LWEDTQGWGF 
LEA/^KIGFH 

HAKECGGVFT 
DYVEIYDSYD 
SKSSQGKNTS 



21 
I 

KDGIFHNSIW 
VCAAGWMAKG 
DPKQIFKSPG 
DVHGFVGRYC 
TTSTGNKNFL 



31 



41 
I 



LERAAGVYHR EARSGKYKLT 
RVGYPIVKPG PNCGFGKTGl 
FPNEYEDNQI CYWHIRLKYG 
GDELPDDIIS TGNVMTLKFL 
AGRPSHL 



Seq ID NO: 50 6 DNA sequence 

Nucleic Acid Accession #s NM_0 07115.1 

Coding sequence: 69.. 902 



GAATTCGCAC 
CTGACGATAT 
GGGGATTCAA 
ACCACAGAGA 
GTGAATTTGA 
GATTTCATGT 
AGCCAGGGCC 
ATAG6AGTGA 
TCTTTACAGA 
ACCAAATCTG 
TAGATTTTGA 
GTTACGATGA 
TCATCAGTAC 
6AGGTTTCCA 
ATACAAGTAC 
AAAAAAAAAA 
TTGATCTCAC 
TAATTTAGGG 
ATCCCACTGC 
TTGTATTTGT 
„ CCTGCTCTAT 
TGAAATCATT 
ATGAATGTTT 
TAAATATTGT 



11 
I 

TGCTCTGAGA 
GATCATCTTA 
GGATGGAATT 
AGCACGGTCT 
AGGCGGCCAT 
CTGTGCTGCT 
CAACTGATGA 
AAGATGGGAT 
TCCAAAGCGA 
CTACTGGCAC 
CCTTGAAGAT 
TGTCCATGGC 
AGGAAATGTC 
AATCAAATAT 
TACTTCTACT 
AAGGATGATC 
TGTTATTATT 
AAAATTGGAA 
ATAGAAATAA 
GGTATATGTA 
GTACAGTTTT 
GATTATTCTA 
TATGCATTAT 
TGCAATAAAT 



21 

I 

ATTTGTGAGC 
ATTTACTTAT 
TTTCATAACT 
GGCAAATACA 
CTCGCAACTT 
GGATGGATGG 
TTTGGAAAAA 
GCCTATTGCT 
ATTTTTAAAT 
ATTAGACTCA 
GACCCAGGTT 
TTTGTGGGAA 
ATGACCTTGA 
GTTGCAATGG 
GGAAATAAAA 
AAAACACACA 
AACATTTATT 
AATATAGGAA 
CAAGCGTTAA 
TATATGTACC 
GTATTATACT 
CAAAAACATG 
TTAAGCCTGT 
ATCCTTCGGA 



31 

I. 

AGCCCCTAAC 
TTCTCTTGCT 
CCATATGGCT 
AGCTCACCTA 
ACAAGCAGCT 
CTAAGGGCAG 
CTGGCATTAT 
ACAACCCACA 
CrCCAGGCTT 
AGTATGGTCA 
GCTTGGCTGA 
GATACTGTGG 
AGTTTCTAAG 
ATCCTGTATC 
ACTTTTTAGC 
GTGTTTATGT 
TATTATTTTT 
ACTTTAAACG 
CATTTTCATA 
TATATGTATT 
TTTTAAATCT 
ATTTTAAACA 
CTCTATTGTT 
ATTC 



41 

1 

AGGCTGTTAC 
ATGGGAAGAC 
TGAACGAGCA 
CGCAGAAGCT 
AGAGGCAGCC 
AGTTGGATAC 
TGATTATGGA 
CGCAAAGGAG 
CCCAAATGAG 
GCGTATTCAC 
TTATGTTGAA 
AGATGAGCTT 
TGATGCTTCA 
CAAATCCAGT 
TGGAAGATTT 
TGGAATCTTT 
CTAAATGTGA 
AGAAAATGAA 
TTTTTTTCTT 
TGCATTTGAA 
TGAACTTTAT 
GCTGTAAAAT 
GGAATTTCAG 



51 
I 

YAEAKAVCEF 
IDYGIRLNRS 
QRIHLSFIiDF 
SDASVTAGGF 



51 
I 

TTCACTACAA 
ACTCAAGGAT 
GCCGGTGTGT 
AAGGCGGTGT 
AGAAAAATTG 
CCCATTGTGA 
ATCCGTCTCA 
TGTGGTGGCG 
TACGAAGATA 
CTGAGTTTTT 
ATATATGACA 
CCAGATGACA 
GTGACAGCTG 
CAAG6AAAAA 
AGCCACTTAT 
TGGAACTCCT 
TiAGAAATACA 
ACCTCTCATA 
TCAGTCATTT 
ATTTTGGAAT 
GAACATTTTC 
ATTCTATGAT 
GTCATTTTCA 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



65 



70 



Seq ID NO: 507 Protein sequence 
Protein Accession #: NP_009046.1 

1 11 21 31 41 51 

I I I I I I 

MIILIYLFLL LWEDTQGWGF KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEF 60 
EGGHLATYKQ LEAARKIGPH VCAAGWMAKG RVGYPIVKPG PNXXFGKTGI IDYGIRLNRS 12 0 
ERWDAYCYNP HAKECGGVFT DPKRIFKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 180 
DLEDDPGCLA DYVEIYDSYD DVHGFVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGF 240 
QIKYVAMDPV SKSSQGKNTS TTSTOJKNPL AGRPSHL 



75 Seq ID NO: 508 DNA sequence 

Nucleic Acid Accession #: NM_0 01 044.1 
Coding sequence : 129.. 1991 



80 



85 



ACCGCTCCGG 
AAAGCCCAGG 
GTGTGCCCAT 
CTAAGGAGCC 
ACGGAGTGCA 
AGGATCGGGA 
TGGACCTGGC 



11 
I 

AGCGGGAGGG 
CCCGGGCGGC 
GAGTAAGAGC 
CAATGCCGTG 
GCTCACCAGC 
GACCTGGGGC 
CAACGTCTGG 



21 

1 

GAGGCTTCGC 
CAGACCAAGA 
AAATGCTCCG 
GGCCCGAAGG 
TCCACCCTCA 
AAGAAGATCG 
CGGTTCCCCT 



31 
I 

GGAACGCTCT 
GGGAAGAAGC 
TGGGACTCAT 
AGGTGGAGCT 
CCAACCCGCG 
ACTTTCTCCT 
ACCTGTGCTA 



41 
I 

CGGCGCCAGG 
ACAGAATTCC 
GTCTTCCGTG 
CATCCTTGTC 
GCAGAGCCCC 
GTCCGTCATT 
CAAAAATGGT 



51 

I 

ACTCGCGTGC 
TCAACTCCCA 
GTGGCCCCGG 
AAGGAGCAGA 
GTGGAGGCCC 
GGCTTTGCTG 
GGCGGTGCCT 



60 
120 
180 
240 
300 
360 
420 



378 



wo 02/086443 

TCCTGGTCCC CTACCTGCTC TTCATGGTCA TTGCTGGGAT GCCACTTTTC TACATGGAGC 480 

TGGCCCTCGG CCAGTTCAAC AGGGAAGGGG CCGCTGGTGT CTGGAAGATC TGCCCCATAC 540 

TGAAAGGTGT GGGCTTCACG GTCATCCTCA TCTCACTGTA TGTCGGCTTC TTCTACAACG 600 

TCATCATCGC CTGGGCGCTG CACTATCTCT TCTCCTCCTT CACCACGGAG CTCCCCTGGA 660 

TCCACTGCAA CAACTCCTGG AACAGCCCCA ACTGCTCGGA TGCCCATCCT GGTGACTCCA 720 

GTGGAGACAG CTCGGGCCTC AACGACACTT TTGGGACCAC ACCTGCTGCC GAGTACTTTG 780 

AACGTGGCGT GCTGCACCTC CACCAGAGCC ATGGCATCGA CGACCTGGGG CCTCCGCGGT 840 

GGCAGCTCAC AGCCTGCCTG GTGCTGGTCA TCGTGCTGCT CTACTTCAGC CTCTGGAAGG 900 

GCGTGAAGAC CTCAGGGAAG GTGGTATGGA TCACAGCCAC CATGCCATAC GTGGTCCTCA 960 

CTGCCCTGCT CCTGCGTGGG GTCACCCTCC CTGGAGCCAT AGACGGCATC AGAGCATACC 102 0 

TGAGCGTTGA CTTCTACCGG CTCTGCGAGG CGTCTGTTTG GATTGACGCG GCCACCCAGG 10 80 

TGTGCTTCTC CCTGGGCGTG GGGTTCGGGG TGCTGATCGC CTTCTCCAGC TACAACAAGT 1140 

TCACCAACAA CTGCTACAGG GACGCGATTG TCACCACCTC CATCAACTCC CTGACGAGCT 1200 

TCTCCTCCGG CTTCGTCGTC TTCTCCTTCC TGGGGTACAT GGCACAGAAG CACAGTGTGC 1260 

CCATCGGGGA CGTGGCCAAG GACGGGCCAG GGCTGATCTT CATCATCTAC CCGGAAGCCA 1320 

TCGCCACGCT CCCTCTGTCC TCAGCCTGGG CCGTGGTCTT CTTCATCATG CTGCTCACCC 1380 

TGGGTATCGA CAGCGCCATG GGTGGTATGG AGTCAGTGAT CACCGGGCTC ATCGATGAGT 1440 

TCCAGCTGCT GCACAGACAC CGTGAGCTCT TCACGCTCTT CATCGTCCTG GCGACCTTCC 1500 

TCCTGTCCCT GTTCTGCGTC ACCAACGGTG GCATCTACX3T CTTCACGCTC CTGGACCATT 1560 

TTGCAGCCGG CACGTCCATC CTCTTTGGAG TGCTCATCGA AGCCATCGGA GTGGCCTGGT 1620 

TCTATGGTGT TGGGCAGTTC AGCGACGACA TCCAGCAGAT GACCGGGCAG CGGCCCAGCC 1680 

TGTACTGGCG GCTGTGCTGG AAGCTGGTCA GCCCCTGCTT TCTCCTGTTC GTGGTCGTGG 1740 

TCAGCATTGT GACCTTCAGA CCCCCCCACT ACGGAGCCTA CATCTTCCCC GACTGGGCCA 1800 

ACGCGCTGGG CTGGGTCATC GCCACATCCT CCATGGCCAT GGTGCCCATC TATGCGGCCT 1860 

ACAAGTTCTG CAGCCTGCCT GGGTCCTTTC GAGAGAAACT GGCCTACGCC ATTGCACCCG 1920 

AGAAGGACCG TGAGCTGGTG GACAGAGGGG AGGTGCGCCA GTTCACGCTC CGCCACTGGC 1980 

TCAAGGTGTA GAGGGAGCAG AGACGAAGAC CCCAGGAAGT CATCCTGCAA TGGGAGAGAC 2040 

ACGAACAAAC CAAGGAAATC TAAGTTTCGA GAGAAAGGAG GGCAACTTCT ACTCTTCAAC' 2100 

CTCTACTGAA AACACAAACA ACAAAGCAGA AGACTCCTCT CTTCTGACTG TTTACACCTT 2160 

TCCGTGCCGG GAGCGCACCT CGCCGTGTCT TGTGTTGCTG TAATAACGAC GTAGATCTGT 2220 

GCAGCGAGGT CCACCCCGTT GTTGTCCCTG CAGGGCAGAA AAACGTCTAA CTTCATGCTG 2280 

TCTGTGTGAG GCTCCCTCCC TCCCTGCTCC CTGCTCCCGG CTCTGAGGCT GCCCCAGGGG 2340 

CACTGTGTTC TCAGGCGGGG ATCACGATCC TTGTAGACGC ACCTGCTGAG AATCCCCGTG 2400 

CTCACAGTAG CTTCCTAGAC CATTTACTTT GCCCATATTA AAAAGCCAAG TGTCCTGCTT 2460 

GGTTTAGCTG TGCAGAAGGT GAAATGGAGG AAACCACAAA TTCATGCAAA GTCCTTTCCC 2 52 0 

GATGCGTGGC TCCCAGCAGA GGCCGTAAAT TGAGCGTTGA GTTGACACAT TGCACACACA 2 580 

GTCTGTTCAG AGGCATTGGA GGATGGGGGT CCTGGTATGT CTCACCAGGA AATTCTGTTT 2640 

ATGTTCTTGC AGCAGAGAGA AATAAAACTC CTTGAAACCA GCTCAGGCTA CTGCCACTCA 2700 

GGCAGCCTGT GGGTCCTTGT GGTGTAGGGA ACGGCCTGAG AGGAGCGTGT CCTATCCCCG 2760 

GACGCATGCA GGGCCCCCAC AGGAGCGTGT CCTATCCCCG GACGCATGCA GGGCCCCCAC 2820 

AGGAGCATGT CCTATCCCTG GACGCATGCA GGGCCCCCAC AGGAGCGTGT ACTACCCCAG 2880 

AACGCATGCA GGGCCCCCAC AGGAGCGTGT ACTACCCCAG GACGCATGCA GGGCCCCCAC 2940 

TGGAGCGTGT ACTACCCCAG GACGCATGCA GGGCCCCCAC AGGAGCGTGT CCTATCCCCG 3000 

GACCGGACGC ATGCAGGGCC CCCACAGGAG CGTGTACTAC CCCAGGACGC ATGCAGGGCC 3060 

CCCACAGGAG CGTGTACTAC CCCAGGATGC ATGCAGGGCC CCCACAGGAG CGTGTACTAC 3120 

CCCAGGACGC ATGCAGGGCC CCCATGCAGG CAGCCTGCAG ACCAACACTC TGCCTGGCCT 3180 

TGAGCCGTGA CCTCCAGGAA GGGACCCCAC TGGAATTTTA TTTCTCTCAG GTGCGTGCCA 3240 

CATCAATAAC AACAGTTTTT ATGTTTGCGA ATGGCTTTTT AAAATCATAT TTACCTGTGA 33 00 

ATCAAAACAA ATTCAAGAAT GCAGTATCCG CGAGCCTGCT TGCTGATATT GCAGTTTTTG 3360 

TTTACAAGAA TAATTAGCAA TACTGAGTGA AGGATGTTGG CCAAAAGCTG CTTTCCATGG 3420 

CACACTGCCC TCTGCCACTG ACAGGAAAGT GGATGCCATA GTTTGAATTC ATGCCTCAAG 3480 

TCGGTGGGCC TGCCTACGTG CTGCCCGAGG GCAGGGGCCG TGCAGGGCCA GTCATGGCTG 3540 

TCCCCTGCAA GTGGACGTGG GCTCCAGGGA CTGGAGTGTA ATGCTCGGTG GGAGCCGTCA 3 60 0 

GCCTGTGAAC TGCCAGGCAG CTGCAGTTAG CACAGAGGAT GGCTTCCCCA TTGCCTTCTG 3660 

GGGAGGGACA CAGAGGACGG CTTCCCCATC GCCTTCTGGC CGCTGCAGTC AGCACAGAGA 3720 

GCGGCTTCCC CATTGCCTTC TGGGGAGGGA CACAGAGGAC AGTTTCCCCA TCGCCTTCTG 3780 

GTTGTTGAAG ACAGCACAGA GAGCGGCTTC CCCATCGCCT TCTGGGGAGG GGCTCCGTGT 3840 

AGCAACCCAG GTGTTGTCCG TGTCTGTTGA CCAATCTCTA TTCAGCATCG TGTGGGTCCC 3 900 
TAAGCACAAT AAAAGACATC CACAATGGAA AAAAAAAAAG GAATTC 

Seq ID NO: 509 Protein sequence 
Protein Accession NP_001035.1 

1 11 21 31 41 51 

I I I I i I 

MSKSKCSVGL MSSWAPAKE PNAVGPKEVE LILVKEQNGV QLTSSTLTNP RQSPVEAQDR 60 

ETWGKKIDFL LSVIGFAVDL ANVWRPPYLC YKNGGGAFLV PYLLFMVIAG MPLFYMELAL 120 

GQFNREGAAG VWKICPII>KG VGFTVILISL YVGFFYNVII AWALHYLFSS FTTELPWIHC 180 

NNSWNSPNCS DAHPGDSSGD SSGLNDTFGT TPAAEYFERG VLHLHQSHGI DDLGPPRWQL 240 

TACLVLVIVL LYFSLWKGVK TSGKWWITA TMPYWLTAL LLRGVTLPGA IDGIRAYLSV 300 

DFYRLCEASV WIDAATQVCP SLGVGFGVIiI AFSSYNKFTN NCYRDAIVTT SINSLTSFSS 360 

GFWFSPLGY MAQKHSVPIG DVAKDGPGIiI FIIYPEAIAT LPLSSAWAW FFIMLLTLGI 420 

DSAMGGMESV ITGLIDEFQL LHRHRELFTL FIVLATFLLS LFCVTNGGIY VFTLLDHFAA 480 

GTSILFGVLI EAIGVAWFYG VGQFSDDIQQ MTGQRPSLYW RLCWKLVSPC FLLFWWSI 540 

VTFRPPHYGA YIPPDWANAL GWVIATSSMA MVPIYAAYKF CSLPGSFREK LAYAIAPEKD 600 
RELVDRGEVR QFTI/RHW1.KV 

Seq ID NO; 510 DNA sequence 
Nucleic Acid Accession #: NM_001216.1 
Coding sequence : 43 . . 1422 

1 11 21 31 

1111 
GCCCGTACAC ACCGTGTGCT GGGACACCCC ACAGTCAGCC 
AGCCCCTGGC TCCCTCTGTT GATCCCGGCC CCTGCTCCAG 
CTGTCACT6C TGCTTCTGAT GCCTGTCCAT CCCCAGAGGT 
TCCCCCTTGG GAGGAGGCTC TTCTGGGGAA GATGACCCAC 



41 51 
1 I 

GCATGGCTCC CCTGTGCCCC 60 

GCCTCACTGT GCAACTGCTG 120 

TGCCCCGGAT 6CAGGAGGAT 180 

TGGGCGAGGA GGATCTGCCC 240 
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AGTGAAGAGG ATTCACCCAG AGAGGAGGAT CCACCCGGAG AGGAGGATCT 
GAG6ATCTAC CTGGAGAGGA GGATCTACCT GAAGTTAAGC CTAAATCAGA 
TCCCTGAAGT TAGAGGATCT ACCTACTGTT GAGGCTCCTG GAGATCCTCA 
AATJiATGCCC ACAGGGACAA AGAAGGGGAT GACCAGAGTC ATTGGCGCTA 
CCGCCCTGGC CCCGGGTGTC CCCAGCCTGC GCGGGCCGCT TCCAGTCCCC 
CGCCCCCAGC TCGCCGCCTT CTGCCCGGCC CTGCGCCCCC TGGAACTCCT 
CTCCCGCCGC TCCCAGAACT GCGCCTGCGC AACAATGGCC ACAGTGTGCA 
CCTCCTGGGC TAGAGATGGC TCTGGGTCCC GGGCGGGAGT ACCGGGCTCT 
CTGCACTGGG GGGCTGCAGG TCGTCCGGGC TCGGAGCACA CTGTGGAAGG 
CCTGCCGAGA TCCACX3TGGT TCACCTCAGC ACCGCCTTTG CCAGAGTTGA 
GGGCGCCCGG GAGGCCTGGC CGTGTTGGCC GCCTTTCTGG AGGAGGGCCC 
AGTGCCTATG AGCAGTTGCT GTCTCGCTTG GAAGAAATCG CTGAGGAAGG 
CAGGTCCCAG GACTGGACAT ATCTGCACTC CTGCCCTCTG ACTTCAGCCG 
TATGAGGGGT CTCTGACTAC ACCGCCCTGT GCCCAGGGTG TCATCTGGAC 
CAGACAGTGA TGCTGAGTGC TAAGCAGCTC CACACCCTCT CTGACACCCT 
GGTGACTCTC GGCTACAGCT GAACTTCCGA GCGACGCAGC CTTTGAATGG 
GAGGCCTCCT TCCCTGCTGG AGTGGACAGC AGTCCTCGGG CTGCTGAGCC 
AATTCCTGCC TGGCTGCTGG TGACATCCTA GCCCTGGTTT TTGGCCTCCT 
ACCAGCGTCG CGTTCCTTGT GCAGATGAGA AGGCAGCACA GAAGGGGAAC 
GTGAGCTACC GCCCAGCAGA GGTAGCCGAG ACTGGAGCCT AGAGGCTGGA 
TGTGAGAAGC CAGCCAGAGG CATCTGAGGG GGAGCCGGTA ACTGTCCTGT 
ATGCCACTTC CTTTTAACT6 CCAAGAAATT TTTTAAAATA AATATTTATA 

Seq ID NO: 511 Protein sequence 
Protein Accession #: NP_00 1207.1 
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MAPLCPSPWIi 
GEEDLPSEED 
DPQBPQNNAH 
ELLGFQLPPL 
VEGHRFPAEI 
EEGSBTQVPG 
DTLWGPGDSR 
GLLFAVTSVA 



11 

1 

PLLIPAPAPG 
SPREEDPPGE 
RDKEGDDQSH 
PELRLRNNGH 
HWHIjSTAFA 
LDISALLPSD 
LQLNFRATQP 
FIiVQMRRQHR 



21 
I 

LTVQLLLSLL 
EDLPGEEDLP 
WRYGGDPPWP 
SVQLTLPPGL 
RVDEALGRPG 
FSRYFQYEGS 
LNGRVIBASF 
RGTKGQVSYR 



31 
I 

LLMPVHPQRL 
GEEDLPEVKP 
RVSPACAGRF 
EMALGFGREY 
GLAVLAAFLE 
LTTPPCAQGV 
PAGVDSSPRA 
PAEVAETGA 



41 



1 

PRMQEDSPLG 
KSEEEGSLKL 
QSPVDIRPQL 
RALQLHLHWG 
EGPEENSAYE 
IWTVFNQTVM 
AEPVQLNSCL 



ACCTGGAGAG 
AGAAGAGGGC 
AGAACCCCAG 
TGGAGGCGAC 
GGTGGATATC 
GGGCTTCCAG 
ACTGACCCTG 
GCAGCTGCAT 
CCACCGTTTC 
CGAGGCCTTG 
GGAAGAAAAC 
CTCAGAGACT 
CTACTTCCAA 
TGTGTTTAAC 
GTGGGGACCT 
GCGAGTGATT 
AGTCCAGCTG 
TTTTGCTGTC 
CAAAGGGGGT 
TCTTGGAGAA 
CCTGCTCATT 
AT 



51 
I 

GGSSGEDDPL 
EDIiPTVEAPG 
AAFCPALRPL 
AAGRPGSEHT 
QLLSRLEEIA 
LSAKQLHTLS 
AAGDILALVF 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



Seq ID NO: 512 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1,,3 978 



ATGGTGGGTG 
TTTGCAGAAA 
TTAGCACCCA 
ACGCCGGTGA 
TCGACATATG 
GTAGCAAGGG 
ACACGCGTGT 
CCGACAGTTC 
GTTGGCATTG 
GCCCTTGCCT 
TTGGTTTTTG 
CTCAATATAC 
CCAGCCACCA 
CCCACAGCTC 
GCCAAGCTCA 
ACAATGAATG 
TTTACCAACA 
TTTGTCCAAA 
ACATTATCCT 
ATTGCCATGT 
ATGGCTGAAG 
CCATCTTACA 
TTGACATGGG 
AGGCATTTAT 
GGAGCCACTG 
TTTGTGGTGA 
GCAGTGTTTG 
AAGGATGAAT 
GCCAAATACC 
CTCCTTGCAG 
ACTTTGGCCT 
CTCTTTGGAG 
CAGAAGGACC 
AACCTCTCTG 
CAGCTCTACC 
TTTGAGGAGT 
CTACAGTTCT 
AAGGGAACCC 
CTGCGAGGAT 
TTCAAGGAGA 
CTCTTCACTG 
CTGGGTCTCT 
ATGTGTGAGG 
ACTGCAAGCA 



11 
I 

AAGGACCCTA 
GATATGACCC 
ACCCGGTGGA 
TGGTGAAAGG 
ACTCATCTGA 
TGGGTCCTGA 
TGATGGACAT 
TCATTCACCA 
GACTGTGCAT 
GGGCCATCAA 
AA/^CCTAGT 
TGTCT^AGTGA 
TCCCGATCCT 
TCATCGGGAT 
ATTCAGCTTT 
AGTTTCTGAC 
CTATCCAAGA 
GTGGAAACTC 
GCCACATCCT 
TTAATGTAAT 
CGAATGTCTC 
TCACCCAACC 
AGCATGAAGC 
GCAAGAAACA 
GCCCAGAGGA 
GAAAGTTATG 
TTGGGAGAAT 
CTAGAAGGCT 
TGGGGAAGAT 
CTCTCCTAGG 
ACGTTTCACA 
AAAA6TATGA 
TGAGCAACCT 
GGGGGCAGAG 
TGCTGGACGA 
GCATTAAGAA 
TAGAGTCTTG 
ACAAGGAGTT 
TGCAGTTCAA 
GCCCTGCTGA 
TGTTCCTCTT 
GGTTGGACAA 
TCGGCGCGGT 
TGGTGTTCAT 



21 

1 

CCTTATCTCA 
CAGCCTGAAG 
TGATGCCGGG 
CTACCGGCAA 
CACCAATGCC 
GAAGGCCTCT 
CGTGGCCAAC 
AATCCTCCAG 
AGCCCTTTTT 
CTACCGCACG 
GTCCTTCAAG 
TAGCTATTCT 
AATGGTCTTT 
ATCAGTGTAT 
CCGAAGGTCA 
CTGCATCAGG 
TATAAGAAGG 
TGCCCTGGCC 
CCTGAGACGC 
GAAGTTTTCC 
TCTAAGGAGA 
AGAAGACCCA 
CAGCAGGAAA 
GA6GTCAGA6 
GCAAAGTGAC 
TCGTTATCCC 
CATCAGAGGA 
TCTTACTTGG 
CTTGGGAATA 
ACAGATGCAG 
GCAGGCATG6 
TCACCAAAGG 
CCCCTATGGA 
GCAGAGGATT 
CCCCCTGTCG 
GACGCTCAGG 
TGATGAAGTT 
AATGGAGGAG 
GGATCCTGAA 
GAGAGAGGAA 
CCTCCTGATG 
GGGCTCACGG 
GCTGGCAGAC 
GCTGGTGTTT 



31 
1 

GATCTGGACC 
ACCATGATCC 
CTACTCTCCT 
AGGCTGACCG 
AAAAGATTTC 
CTGAGCCACG 
ATCCTGTGCA 
CAGACTGAGA 
GCCACCGAGT 
GCCATCCGGT 
ACATTGACCC 
TTGTTTGAAG 
TGTGCGGCGT 
GTCATATTCA 
GCAATTTTGG 
CTGATCAAAA 
AGGGAAAGAA 
CCCATCGTGT 
AAACTCACCG 
ATTGCAATCT 
ATGAAGAAAA 
GATACTGTCT 
AGTACCCCAA 
GCATACAGTG 
AGCCTCAAAT 
GAAGCCCAGC 
TACAGGCCTC 
CCCCAAGAAG 
TGTGGGAATG 
CTGCAGAAAG 
ATCTTTCATG 
TATCAGCACA 
GACCTGACTG 
AGCCTGGCCC 
GCCGTGGACG 
GGAAAGACAG 
ATTTTATTAG 
AGAGGGCGCT 
CACCTTTACA 
GATGCTGGTA 
ATTGGCAGCG 
ATGACCTGTG 
ATCGGTCAGC 
GGCGTCACCA 



41 
1 

AGCGAGGCCG 
CAGTGCGACC 
TCGCCACATT 
TAGACACCCT 
GAGTCCTTTG 
TGGTGTGGAA 
TCATCATGGC 
GGACCTCTGG 
TTACCAAAGT 
TGAAGGTGGC 
ACATCTCTGT 
CTGCCTTGTT 
ACGCCTTTTT 
TACCCGTCCA 
TGACAGACAA 
TGTATGCCTG 
AATTACTGGA 
CCACCATAGC 
CACeCGTGGC 
TGCCCTTCTC 
TTCTCATAGA 
TGCTTTTAGC 
AGAAATTGCA 
AGAGGAGTCC 
CGGTTCTGCT^ 
TCCTGGCTTG 
ATGGATTTTC 
TGGATAGGAC 
TGGGAAGTGG 
GGGTGGTGGC 
GAAATGTGAG 
CAGTCCGCGT 
AGATTGGGGA 
GCGCTGTCTA 
CCCACGTGGG 
TCGTCCTGGT 
AAGATGGAGA 
ATGCAAAACT 
ATGCAGCAAT 
TAATCGGGTA 
CTGCCTTCAG 
GGCCCCAGGG 
ATGTGTACCA 
AAGGCTTCET 



60 
120 
180 
240 
300 
360 
420 



51 
1 

GCGGAGATCC 
CTGTGGAAGG 
TTCCTGGCTC 
GCCCCCATTG 
GGATGAAGAG 
ATTCCAGAGG 
AGCCATAGGG 
GAJtiAGTCTGG 
CTTCTTTTGG 
GCTCTCCACC 
TGGCGAGGTG 
TTGTCCTTTG 
CATTCTGGGG 
GATGTTTATG 
GCGAGTTCAG 
GGAGAAATCT 
7VAAAGCTGGA 
CATCGTGCTG 
ATTTAGTGTG 
CATCAAAGCA 
TAAAAGCCCC 
AAATGCCACC 
GAACCAGAAA 
ACCAGCCAAG 
CAGCATAAGC 
GAGGTGGCCA 
TGCTAAAGAC 
TCAAAGGGCA 
AAA6AGCTCC 
AGTCAATGGA 
AGAAAACATA 
CTGTGGCCTC 
GCGGGGCCTC 
CTCCGACCGT 
GAAGCACGTC 
GACCCACCAG 
GATTTGTGAA 
GATTCACAAC 
GGTGGAAGCC 
CCTCCTTTCT 
CAACTGGTGG 
CAACAGGACC 
GTGGGTGTAC 
CTTCACCAAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
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ACCACACTGA TGGCATCCTC CTCTCTGCAT GACACGGTGT TTGATAAGAT CTTAAAGAGC 2700 

CCAATGAGTT TCTTTGACAC GACTCCCACT GGCAGGCTAA TGAACCGTTT TTCCAAGGAT 2760 

ATGGACGAGC TGGATGTGAG GCTGCCGTTT CACGCAGAGA ACTTTCTGCA GCAGTTTTTT 2820 

ATGGTGGTGT TTATTCTCGT GATCTTGGCT GCTGTGTTTC CTGCTGTCCT TTTAGTCGTG 2 880 

5 GCCAGCCTTG CTGTAGGCTT CTTCATTCTG TTACGCATTT TCCACAGAGG AGTCCAGGAG 2 940 

CTCAAGAAGG TGGAGAATGT CAGCCGGTCA CCCTGGTTCA CCCACATCAC CTCCTCCATG 3000 

CAGGGCCTGG GCATCATTCA CGCCTATGGC AAGAAGGAGA GCTGCATCAC CTATACTTCA 3060 

TCCAAAGGCC TGTCATTGTC ATACATCATC CAGCTGAGCG GACTGCTCCA AGTGTGTGTG 3120 

CGAACGGGAA CAGAGACGCA AGCCAAATTC ACCTCCGTGG AGCTGCTCAG GGAATACATT 3180 

10 TCGACCTGTG TTCCTGAATG CACTCATCCC CTCAAAGTGG GGACCTGTCC CAAGGACTGG 3240 

CCCAGCTGTG GGGAGATCAC CTTCAGAGAC TATCAGATGA GATACAGAGA CAACACCCCC 3 3 00 

CTTGTTCTCG ACAGCCTGAA CTTGAACATA CAAAGTGGGC AGACAGTCGG GATTGTTGGA 33 60 

AGAACAGGTT CCGGAAAGTC ATCGTTAGGA ATGGCTTTGT TTCGTCTGGT GGAGCCAGCC 3420 

A6TGGCACAA TCTTTATTGA TGAGGTGGAT ATCTGCATTC TCAGCTTGGA AGACCTCAGA 3480 

15 ACCAAGCTGA CTGTGATCCC ACAGGATCCT GTCCTGTTTG TAGGTACAGT AAGGTACAAC 3540 

TTGGATCCCT TTGAGAGTCA CACCGATGAG ATGCTCTGGC AGGTTCTGGA GAGAACATTC 3600 

ATGAGAGACA CAATAATGAA ACTCCCAGAA AAATTACAGG CAGAAGTCAC AGAAAATGGA 3660 

G7VAAACTTCT CAGTAGGGGA ACGTCAGCTG CTTTGTGTGG CCCGAGCTCT TCTCCGTAAT 3720 

TCAAAGATCA TTCTCCTTGA TGAAGCCACC GCCTCTATGQ ACTCCAAGAC TGACACCCTG 3780 

20 GTTCAGAACA CCATCAAAGA TGCCTTCAAG GGCTGCACTG TGCTGACCAT CGCCCACCX3C 3840 

CTCAACACAG TTCTCAACTG CGATCACGTC CTGGTTATGG AAAATGGGAA GGTGATTGAG 3 900 

TTTGACAAGC CTGAAGTCCT TGCAGAGAAG CCAGATTCTG CATTTGCGAT GTTACTAGCA 3 960 

GCAGAAGTCA GATTGTAG 

25 Seq ID NO: 513 Protein sequence 

Protein Accession #: Eos sequence 

1 11 21 31 41 51 

on ' 1 t I I ' 

JU MVGEGPYLIS DLDQRGRRRS FAERYDPSLK TMIPVRPCAR LAPNPVDDAG LLSFATFSWL 60 

TPVMVKGYRQ RLTVDTDPPIj STYDSSDTNA KRFRVLWDEE VARVGPEKAS LSHWWKFQR 120 

TRVLMDIVAN ILCIIMAAIG PTVLIHQILQ QTERTSGKVW VGIGLCIALP ATBFTKVFFW 180 

^ 7VLAWAINYRT AIRLKVALST LVFENLVSFK TLTHISVGEV LNILSSDSYS LFEAALFCPL 240 

PATIPILMVF CAAYAFFILG PTAIiIGISVY VIFIPVQMFM AKLNSAFRRS AILVTDKRVQ 3 00 

35 TMNEFLTCXR LIKMYAWEKS FTNTIQDIRR RERKLLEKAG FVQSGNSALA PIVSTIAIVL 3 60 

TLSCHILLRR KLTAPVAFSV lAMFNVMKFS lAILPFSIKA MAEANVSLRR MKKILIDKSP 420 

PSYITQPEDP DTVLLLANAT LTWEHEASRK STPKKLQNQK RHLCKKQRSE AYSERSPPAK 480 

GATGPEEQSD SLKSVLHSIS FWRKLCRYP EAQLLAWRWP AVFVGRIIRG YRPHGFSAKD 540 

KDESRRLLTW PQEVDRTQRA AKYLGKILGI CGNVGSGKSS LLAALLGQMQ LQKGWAVNG 600 

40 TLAYVSQQAW IFHGNVRENI LFGEKYDHQR YQHTVRVCGL QKDLSNLPYG DLTEIGERGL 660 

NLSGGQRQRI SL/UiAVYSDR QLYLLDDPLS AVDAHVGKHV FEECIKKTLR GKTWLVTHQ 72 0 

LQFLBSCDEV ILLEDGBICE KGTHKELMEB RGRYAKLIHN liRGLQFKDPE HLYNAAMVEA 780 

FKESPAEREE DAGIIGYLLS LFTVFLFLLM IGSAAFSNWW LGLWLDKGSR MTCGPOGMRT 840 

MCEVGAVLAD IGQHVYQWVY TASMVFMLVF GVTKGPVFTK TTLMASSSLH DTVFDKILKS 900 

45 PMSFFDTTPT GRLMNRFSKD MDELDVRLPF HAENFLQQFF MWFILVILA AVFPAVLLW 960 

ASLAVGFFIL LRIFHRGVQE LKKVENVSRS PWFTHITSSM QGLGIIHAYG KKESCITYTS 1020 

SKGLSLSYII QLSGLLQVCV RTGTETQAKF TSVELLREYI STCVPECTHP LKVGTCPKDW 1080 

PSCGEITFRD YQMRYRDNTP LVLDSLNLNI QSGQTVGIVG RTGSGKSSLG MALFRLVEPA 1140 

SGTIFIDEVD ICILSLEDLR TKLTVIPQDP VLFVGTVRYN LDPFESHTDE MLWQVLERTF 1200 

50 MRDTIMKLPE KLQAEVTENG ENFSVGERQL LCVARALLRN SKIILLDEAT ASMDSKTDTL 1260 

VQNTIKDAFK GCTVLTIAHR LNTVIiNCDHV LVMENGKVIE FDKPEVLAEK PDSAFAMIiLA 1320 

AEVRL 

Seq ID NO: 514 DNA sequence 
55 Nucleic Acid Accession #: Z31S60 
Coding sequence: 1-966 

1 11 21 31 41 51 

/cn 1 I ' I I I 

OU CACAGCGCCC GCATGTACAA CATGATGGAG ACGGAGCTGA AGCCGCCGGG CCCGCAGCAA 60 

ACTTCGGGGG GCGGCGGCGG CAACTCCACC GCGGCGGCGG CCGGCGGCAA CCAGA7\AAAC 120 

AGCCCGGACC GCGTCAAGCG GCCCATGAAT GCCTTCATGG TGTGGTCCCG CGGGCAGCX3G 180 

CGCAAGATGG CCCAGGAGAA CCCCAAGATG CACAACTCGG AGATCAGCAA GCGCCTGGGC 240 

GCCGAGTGGA AACTTTTGTC GGAGACGGAG AAGCGGCCGT TCATCGACGA GGCTAAGCGG 300 

65 CTGCGAGCGC TGCACATGAA GGAGCACCCG GATTATAAAT ACCGGCCCCG GCGGAAAACC 3 60 

AAGACGCTCA TGAAGAAGGA TAAGTACACG CTGCCCGGCG GGCTGCTGGC CCCCGGCGGC 420 

AATAGCATGG CGAGCGGGGT CGGGGTGGGC GCCGGCCTGG GCGCGGGCGT GAACCAGCGC 480 

ATGGACAGTT ACGCGCACAT GAACGGCTGG AGCAACGGCA GCTACAGCAT GATGCAGGAC 540 

CAGCTGGGCT ACCCGCAGCA CCCGGGCCTC AATGCGCACG GCGCAGCGCA GATGCAGCCC 600 

70 ATGCACCGCT ACGACGTGAG CGCCCTGCAG TACAACTCCA TGACCAGCTC GCAGACCTAC 660 

ATGAACGGCT CGCCCACCTA CAGCATGTCC TACTCGCAGC AGGGCACCCC TGGCATGGCT 720 

CTTGGCTCCA TGGGTTCGGT GGTCAAGTCC GAGGCCAGCT CCAGCCCCCC TGTGGTTACC 780 

TCTTCCTCCC ACTCCAGGGC GCCCTGCCAG GCCGGGGACC TCCGGGACAT GATCAGCATG 840 

TATCTCCCCG GCGCCGAGGT GCCGGAACCC GCCGCCCCCA GCAGACTTCA CATGTCCCAG 900 

75 CACTACCAGA GCGGCCCGGT GCCCGGCACG GCCATTAACG GCACACTGCC CCTCTCACAC 960 

ATGTGAGGGC CGGACAGCGA ACTGGAGGGG GGAGAAATTT TCAAAGAAAA ACGAGGGAAA 1020 

TGGGAGGGGT GCAAAAGAGG AGAGTT^AGAA ACAGCATGGA GAAAACCCGG TACGCTCAAA 1080 

AAAAA 

80 Seq ID NO: 515 Protein sequence 
Protein Accession #: CAA83435 

1 11 21 31 41 51 

I I I I ' I 

OD HSARMYNMME TELKPPGPQQ TSGGGGGNST AAAAGGNQKN SPDRVKRPMN AFMVWSRGQR 60 

RKMAQENPKM HNSEISKRLG AEWKLLSBTE KRPFIDEAKR LRALHMKEHP DYKYRPRRKT 120 

KTLMKKDKYT LPGGLLAPGG NSMASGVGVG AGLGAGVNQR MDSYAHMNGW SNGSYSMMQD 180 



381 
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QLGYPQHPGL NAHGAAQMQP MHRYDVSALQ YNSMTSSQTY MNGSPTYSMS YSQQGTPGMA 
LGSMGSWKS BASSSPPWT SSSHSRAPCQ AGDLRDMISM YLPGAEVPEP AAPSRLHMSQ 
HYQSGPVPGT AINGTLPLSH M 



240 
300 
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Seq ID NO: 516 DNA sequence 
Nucleic Acid Accession. #: U91618 
Coding sequence: 29.. 541 



10 



15 



20 



25 



1 
I 

CGGACTTGGC 
CATGCTACTC 
AGCATTAGAA 
TCCCTCTTGG 
AGCTGAGGAA 
TGCTTTAGAT 
TCACAGCAGG 
TGACAA7VAAT 
GCTGTATGAG 
AGAGAATAAA 
ATTATATTTG 
ATTGAATGTG 
TCTTCAAAAA 



11 
1 

TTGTTAGAAG 
CTGGCTTTCA 
GCAGATTTCT 
AAGATGACTC 
ACAGGAGAAG 
GGCTTTAGCT 
GCTTTTCAAC 
GGAAAGGAAG 
AATAAACCCA 
TCATTTATTT 
TGTGAAAATG 
TTTTTCTGCA 
A7VAAAAAAAA 



21 

I 

GCTGAAAGAT 
GCTCCTGGAG 
TGACCAATAT 
TGCTAAATGT 
TTCATGAAGA 
TGGAAGCAAT 
ACTGGGAGTT 
AAGTCATAAA 
GAAGACCCTA 
ACATGTGATT 
TGACAAACAC 
CTAATAGAAA 
A7WVTGGGGCC 



31 
I 

GATGGCAGGA 
TCTGTGCTCA 
GCATACATCA 
TTGCAGTCTT 
GGAGCTTGTT 
GTTGACAATA 
AATCCAGGAA 
GAGAAAAATT 
CATACTCAAA 
GTGATTCATC 
ACTTATCTGT 
TTAGACTAAG 
GCAATT 



41 
I 

ATGAAi\ATCC 
GATTCAGAAG 
AAGATTAGTA 
GTAAATAATT 
GCAAGAAGGA 
TACCAGCTCC 
GATATTCTTG 
CCTTATATTC 
AGAGATTCTT 
ATCCCTTAAT 
CTCTTCTACA 
TGTTTTCAAA 



51 
I 

AGCTTGTATG 
AGGAAATGAA 
AAGCACATGT 
TGAACAGCCC 
AACTTCCTAC 
ACAAAATCTG 
ATACTGGAAA 
TGAAACGGCA 
ACTATTACTG 
TAAATATCAA 
ATTGTGGTTT 
TAAATCTAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: 517 Protein sequence 
Protein Accession AAB50564 



21 



31 



41 

1 



51 



30 MMAGMKIQLV CMLLLAFSSW SLCSDSEEEM KALEADFLTN MHTSKISK7VH VPSWKMTLLN 60 

VCSIjVNNLNS PAEETGEVHE EELVARRKLP TALDGFSLEA MLTIYQLHKI CHSRAFQHWE 120 
LIQEDILDTG NDKNGKEEVI KRKIPYILKR QLYENKPRRP YILKRDSYYY 



Seq ID NO: 518 DNA seqfuence 
35 Nucleic Acid Accession #: NM_006536.2 
Coding sequence : 1 0 9 .. 2 94 0 



40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



1 
1 

ACCTAAAACC 
ATGTATGCAG 
AGCATTGCAG 
GAACTCCCAT 
ATTGCAATTA 
ATAACTG7VAG 
ATAAAGATTT 
TCATATGAAA 
TACACCCTAC 
TTCCTACTGA 
GAATGGGCCC 
ATAAATGGGC 
GTGTGTGAAA 
GGATGCACCT 
AGTTTATCTT 
CTACAGAACC 
TTTCACCACA 
GTACAGGCTG 
GCTGACAGAC 
ATTCATACCT 
CACCAAATTA 
TCAGCTAAAA 
AAACTGAATG 
CTTCTTGGCA 
CTGGGTTCAT 
TTCTTTGTTC 
TCTGGAACTG 
AAACCTCACC 
ATGTTTCTAG 
GGACGAAAAT 
TGGATTCCAG 
TCTCTGCAAG 
GCCACTGTGG 
TATGCCAATG 
GAGCCAGAGA 
GTTATAAAAA 
TATAGCTTGA 
CCAGGGAGTC 
GCTCCAAGGA 
AGCTCAGGAG 
CCACCATGCA 
TGGACAGCAC 
AGTA7VAAGTC 
AAGCGAAATC 
ACGi\ATGGAC 
GCAATACGAG 
CCTCT6TTTA 
GGAGTTTTAA 



11 
1 

TTGCAAGTTC 
CAGGCTCAGT 
GTCCTATTTG 
TCCTGGGAGC 
ATCCTCAGGT 
CTTCATTTTA 
TAATACCTGC 
AGGCAAATGT 
AATACAGAGG 
ATGATAACTT 
ACCTCCQTTG 
AAAATCAAAT 
AAGGTCCTTG 
TTATCTACAA 
CTGTGGTTGA 
AGATGTGCAG 
GCTTTCCCAT 
GTGACAAAGT 
TCCTTCAACT 
TCGTGGGCAT 
ACAGCAATGA 
CAGACATCAG 
GAAAAGCTTA 
ATTGCTTACC 
CTGCAGCCCC 
CAGATATATC 
GAGACATTTT 
ATCAATTGAA 
TTACGTGGCA 
ACTACACAAA 
GAACAGCTAA 
CCCTGAAAGT 
AAGCCTTTGT 
TGAAACAGGG 
CTGGAGATCC 
ATGATGGAAT 
AAGTGCATGT 
ATGCTATGTA 
AATCAGTAGG 
GCTCCTTTTC 
AAATTATTGA 
CTGGAGAAGA 
TACAGAATAT 
CTCAGCAAGC 
CTGAACATCA 
CAATGGATAG 
TTCCCCCCAA 
CA6CAATGGG 



21 
I 

AGGAAGAAAC 
GTGAGTGAAC 
CAACCTGAAG 
TGGAGTACAG 
ACCTGAGAAT 
CCTATTTAAT 
CACATGGAAA 
CATAGTGACT 
GTGTGGAAAA 
AACAGCTGGC 
GGGTGTGTTC 
TA7AGTGACA 
CCCCCAAGAA 
TAGCACCCAA 
ATTTTGTAAT 
CCTCAGAAGT 
GAATGGGACT 
GGTCTGTTTA 
ACAACAAGCC 
TGCCAGTTTC 
TGATCGAAAG 
CATTTGTTCA 
TGGCTCTGTG 
CACTGTGCTC 
AAATCTGGAG 
AAACTCCAAT 
CCAGCAACAT 
AAACACAGTG 
GGCCAGTGGT 
TJ\ATTTTATC 
GCCTGGGCAC 
GACAGTGACC 
GGAAAGAGAC 
ATTTTATCCC 
TGTTACGCTG 
TTACTCGAGG 
CAATCACTCT 
TGTACCAGGT 
CAGAAATGAG 
AGTGCTGGGA 
CCTGGAAGCT 
CTTTGATCAG 
CCAAGATGAC 
TGGCATCAGG 
GCCAAATGGA 
GAACTCCTTA 
TTCTGATCCT 
TTTGATAGGA 



31 
I 

CATCTGCATC 
TGGAGGCTTC 
TTTGTGACTC 
CTTCAAGACA 
CAGAACCTCA 
GCTACCAAGA 
GCTAATAATA 
GACTGGTATG 
GAGGGAAAAT 
TACGGATCAC 
GATGAGTATA 
AGGTGTTCAT 
AACTGTATTA 
AATGCAACTG 
GCAAGTACCC 
GCATGGGATG 
GAGCTTCCAC 
GTGCTGGATG 
GCAGAATTTT 
GACAGCAAAG 
TTGCTGGTTT 
GGGCTTAAGA 
ATGATATTAG 
AGCAGTGGTT 
GAATTATCAC 
AGCATGATTG 
ATTCAGCTTG 
ACTGTGGATA 
CCTCCTGAGA 
ACCAATCTAA 
TGGACTTACA 
TCTCGCGCCT 
AGCCTCCATT 
ATTCTTAATG 
AGACTCCTTG 
TATTTTTTCT 
CCCAGCATAA 
TACACAGCAA 
GAGGAGCGAA 
GTTCCAGCTG 
GTAAAAGTAG 
GGCCAGGCTA 
TTTAACAATG 
GAGATATTTA 
GAAACACATG 
CAGTCTGCTG 
GTACCTGCCA 
ATCATTTGCC 



41 
I 

CATATTGAAA 
TCTACAACAT 
TCCTGGTTGC 
ATGGGTATAA 
TCTCAAACAT 
GAAGAGTATT 
ACAGCAAAAT 
GGGCACATGG 
ACATTCATTT 
GAGGCCGAGT 
ACAATGACAA 
CTGACATCAC 
TTAGTAAGCT 
CATCAATAAT 
ACAACCAAGA 
TAATCACAGA 
CTCCTCCCAC 
TGTCCAGCAA 
ATTTGATGCA 
GAGAGATCAG 
CATATCTGCC 
AAGGATTTGA 
TGACCAGCGG 
CAACAATTCA 
GTCTTACAGG 
ATGCTTTCAG 
T^GTACAGG 
ATACTGTGGG 
TTATATTATT 
CTTTTCGGAC 
CCCTGAACAA 
CCAACTCAGC 
TTCCTCATCC 
CCACTGTCAC 
ATGATGGAGC 
CCTTTGCTGC 
GCACCCCAGC 
ACGGTAATAT 
AGTGGGGCTT 
GCCCCCACCC 
AAGAGGAATT 
CAAGCTATGA 
CTATTTTAGT 
CGTTCTCACC 
AAAGCCACAG 
TATCTAACAT 
GAGATTATCT 
TTATTATAGT 



51 
1 

ACCTGACACA 
GACCCAAAGG 
CTTAAGTTCA 
TGGATTGCTC 
TJS^GGAAATG 
TTTCAGAAAT 
AA/iACAAGAA 
AGATGATCCA 
CACACCTAAT 
GTTTGTCCAT 
ACCTTTCTAC 
AGGCATTTTT 
TTTTAAAGAA 
GTTCATGCAA 
AGCACCAAAC 
CTCTGCTGAC 
ATTCTCGCTT 
GATGGCAGAG 
GATTGTTGAA 
AGCCCAGCTA 
CACCACTGTA 
GGTGGTTGAA 
AGATGATAAG 
CTCCATTGCC 
AGGTTTAAAG 
TAGAATTTCC 
TGAAAATGTC 
CAACGACACT 
TGATCCTGAT 
AGCTAGTCTT 
TACCCATCAT 
TGTGCCCCCA 
TGTGATGATT 
TGCCACAGTT 
AGGTGCTGAT 
AAATGGTAGA 
CCACTCTATT 
TCAGATGAAT 
TAGCCGAGTC 
TGATGTGTTT 
GACCCTATCT 
AATAAGAATG 
AAATACATCA 
CCAGATTTCC 
AATTTATGTT 
TGCCCAGGCG 
TATATTGAAA 
TGTGACACAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 



382 



10 



15 



WO 02/086443 

CATACTTTAA GCAGGAAAAA GAGAGCAGAC AAGAAAGAGA ATGGAACAAA ATTATTATAA 2940 

ATAAATATCC AAAGTGTCTT CCTTCTTAGA TATAAGACCC ATGGCCTTCG ACTACAAAAA 3000 

CATACTAACA AAGTCAAATT AACATCAAAA CTGTATTAAA ATGCATTGAG TTTTTGTACA 3060 

ATACAGATAA GATTTTTACA TGGTAGATCA ACAATTCTTT TTGGGGGTAG ATTAGAAAAC 312 0 

CCTTACACTT TGGCTATGAA CAAATAATAA AAATTATTCT TTAAAGTAAT GTCTTTAAAG 3180 

GCAAAGGGAA GGGTAAAGTC GGACCAGTGT CAAGGAAAGT TTGTTTTATT GAGGTGGAAA 3240 

AATAGCCCCA AGCAGAGAAA AGGAGGGTAG GTCTGCATTA TAACTGTCTG TGTGAAGCAA 3300 

TCATTTAGTT ACTTTGATTA ATTTTTCTTT TCTCCTTATC TGTGCAGTAC AGGTTGCTTG 3360 

TTTACATGAA GATCATGCTA TATTTTATAT ATGTAGCCCC TAATGCAAAG CTCTTTACCT 3420 

CTTGCTATTT TGTTATATAT ATTTCAGATG ACATCTCCCT GCTAATGCTC AGAGATCTTT 3480 

TTTCACTGTA AGAGGTAACC TTTAACAATA TGGGTATTAC CTTTGTCTCT TCATACCGGT 3540 

TTTATGACAA AGGTCTATTG AATTTATTTG TNTGTAAGTT TCTACTCCCA TCAAAGCAGC 3 600 

TTTCTAAGTT TATTGCCTTG GGTTATTATG GAATGATAGT TATAGCCCCN TATAATGCCT 3660 
TACCTAGGAA A 

Seq ID NO: 519 Protein sequence 
Protein Accession #: NP 006527.1 



PCT/US02/12476 



20 



25 



30 



35 



MTQRSIAGPI 
IKEMITEASF 
GDDPYTLQYR 
KPFYINGQNQ 
MFMQSLSSW 
TFSLVQAGDK 
RAQLHQINSN 
GDDKLLGNCL 
SRISSGTGDI 
FDPDGRKYYT 
AVPPATVEAF 
AGADVIKNDG 
IQMNAPRKSV 
LTLSWTAPGE 
PQISTNGPEH 
LILKGVLTAM 



11 
I 

CNLKFVTLLV 
YLFNATKRRV 
GCGKEGKYIH 
IKVTRCSSDI 
EFCNASTHNQ 
WCLVIjDVSS 
DDRKLLVSYL 
PTVLSSGSTI 
FQQHIQIiEST 
NNFITNLiTFR 
VERDSIiHFPH 
lYSRYFFSFA 
GRNBEERKWG 
DFDQGQATSY 
QPNGETHESH 
GLIGIICLII 



21 
1 

ALSSELPFLG 
FPRNIKIIiIP 
FTPNFLLNDN 
TGIFVCEKGP 
EAPNLQNQMC 
KMAEADRIiLQ 
PTTVSAKTDI 
HSIALGSSAA 
GENVKPHHQL 
TASLWIPGTA 
PVMIYANVKQ 
ANGRYSLKVH 
FSRVSSGGSP 
EIRMSKSLQN 
RIYVAIRAMD 
WTHHTLSRK 



31 

I 

AGVQLQDNGY 
ATWKANNNSK 

LTAGYGSRGR 
CPQENCIISK 
SLRSAWDVIT 
LQQAAEFYLM 
SICSGLKKGF 
PNLEELSRLT 
KNTVTVDNTV 
KPGHWTYTLN 
GFYPILNATV 
VNHSPSISTP 
SVLGVPAGPH 
IQDDENNAIL 
RNSLQSAVSN 
KRADKKENGT 



41 
1 

NGLIiIAINPQ 
IKQESYEKAN 

VFVHEWAHLR 
LFKEGCTFIY 
DSADFHHSFP 
QIVEIHTFVG 
EWEKIiNGKA 
GGLKFFVPDI 
GNDTMFLVTW 
NTHHSLQALK 
TATVEPETGD 
AHSXPGSHAM 
PDVFPPCKII 
VNTSKRNPQQ 
lAQAPIiFIPP 
KLIi 



51 
I 

VPENQNLISN 
VIVTDWYGAH 
WGVFDEYNND 
NSTQNATASI 
MNGTELPPPP 
lASFDSKGEI 
YGSVMILVTS 
SNSNSMIDAF 
QASGPPEIIIi 
VTVTSRASNS 
PVTLRLLDDG 
YVPGYTJUTGM 
DLEAVKVEEE 
AGIREIFTFS 
NSDPVPARDY 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 520 DNA sequence 

Nucleic Acid Accession #: NM_000228.1 

Coding sequence: 82.. 3600 



GCTTTCAGGC 
GGATCACCCC 
CTCCTGCATG 
CTTGTTGGGA 
ACCTACTGCA 
CCTCACAACT 
TGGTGGCAGT 
TTCCAGCTTC 
GAGCGCTCCT 
ACCTCCACCT 
CAGTCCCTGC 
ATGGATTTAG 
ATCACAAACT 
CCTCCCAGCG 
GGCCATGCTG 
CAGGTCCACG 
GCACCCTTCT 
CAAAGGTGCG 
GCCAGCCAGG 
AACTGTGAGC 
GAGACCTGCA 
CCAGTGACCG 
AAGCCGGGCT 
AACATCCTGG 
CTGCCCAACG 
AGTGGCCAGG 
CAACCAGTTC 
GCAGCCATCC 
TGTGACTGTG 
CTCTGCCGCC 
CGCTACCCGG 
GAGCAGGCCC 
GGGCTGGAGG 
ATCCGAGCAG 
GCCATCCTCT 
GAGACGTTGT 
ACTATGTATC 
GCCTTCCGGA 
GACAGCTCGC 
CGGCAGGCGG 
ATGTCTTCGT 
ATGGCTTGCA 
TGTGGCTCCC 
CAGGTGGCT6 



11 

1 

GATCTGGAGA 
ATTGGCTGAA 
CCCAACTU^GC 
GGACCCGGTT 
CCCAGTATGG 
ACTACAGTCA 
CCCAGAATGA 
AAGAAGTCAT 
CAGACTTCGG 
TCCCTCGGGT 
CTCAGAGGCC 
TGTCTGGGAT 
TGAGAGTCAA 
CCTACTATGC 
ATCGCTGCX3C 
ATGTCTGTGT 
ACAACAACCG 
ACTGCAATGG 
GGGCATATGG 
GGTGTCAGCT 
TCTCCTGCGA 
GGCAGTGTGT 
TCACTGGACT 
GGTCCCG6AG 
TGGTGGGTCC 
GCTGTGAACC 
ACAGGGCAGT 
GCCAGTGTCC 
ATTTCCGGGG 
CTGGCTTGAC 
TGTGCGTGGC 
TGCGCTTTGG 
ACCGTGGCCT 
TTCTCAGCAG 
CCCTCAGGCG 
CCCTTCCGAG 
AGAGGAAGAG 
TGCTGAGCAC 
GCCTTTTGGA 
GAGGAGGAGG 
TGCCTGACCT 
CCCCAATATC 
GCTGCAGGGG 
AGCAGCTGCG 



21 
I 

AAGAACGGCA 
GATGAGACCA 
CTGCTCCCGT 
TCTCCGAGCT 
CGAGTGGCAG 
CCGAGTAGAG 
TGTGAACCCT 
GATGGAGTTC 
TAAGACCTGG 
CCGCCAGGGT 
TAATGCACGC 
TCCAGCAACT 
TTTCACCAGG 
TGTGTCCCAG 
ACCCAAGCCT 
CTGCCAGCAC 
GCCCTGGAGA 
GCACTCAGAG 
AGGTGTGTGT 
GCACTATTTC 
GTGTGATCCG 
GTGCAAGGAG 
CACCTACGCC 
GGACATGCCG 
CAAATGTGAC 
GTGTGCCTGC 
GCCCTGTCGG 
AGACCGGACC 
AACAGAGGGC 
CGGGCCCCGC 
CTGCCACCCT 
TAGACTCCGC 
GGCCTCCCGG 
CCCCGCAGTC 
T^CTCTCCAG 
AGACCTGGAG 
GGAGCAGTTT 
AGCCTACGAG 
CCAGCTCAGG 
AGGCACCGGC 
GACACCCACC 
ATGCCCTGGT 
TGTCCTTCCC 
GGGCTTCAAT 



31 
I 

GAACACACAG 
TTCTTCCTCT 
GGGGCCTGCT 
TCATCTACCT 
ATGAAATGCT 
AATGTGGCTT 
GTCTCTCTGC 
CAGGGGCCCA 
CGAGTGTACC 
CGGCCTCAGA 
CTAAATGGGG 
CAAAGTCAAA 
CTGGCCCCTG 
CTCCGTCTGC 
GGGGCCTCTG 
AACACTGCCG 
CCGGCGGAGG 
ACATGTCACT 
GACAATTGCC 
CGGAACCGGC 
GATGGGGCAG 
CATGTGCAGG 
AACCCGCAGG 
TGTGACGAGG 
CAGTGTGCTC 
GACCCGCACA 
GAAGGCTTTG 
TATGGAGACG 
CCGGGCTGCG 
TGTGACCAGT 
TGCTTCCAGA 
AATGCCACCG 
ATCCTAGATG 
ACAGAGCAGG 
GGCCTGCAGC 
AGTCTTGACA 
GAAAAAATAA 
CAGTCAGCCC 
GACAGCCGGA 
AGCCCCAAGC 
TTCAACAAGC 
GAGCTATGTC 
AGGGCCGGTG 
GCCCAGCTCC 



41 

I 

CAAGGAAAGG 
TGTGTTTTGC 
ATCCACCTGT 
GTGGACTGAC 
GCAAGTGTGA 
CATCCTCCGG 
AGCTGGACCT 
TGCCCGCCGG 
AGTACCTGGC 
GCTGGCAGGA 
GGAAGGTCCA 
AAATTCAAGA 
TGCCCCAAAG 
AGGGGAGCTG 
CAGGCCCCTC 
GCCCAAATTG 
GCCAGGACGC 
TTGACCCCGC 
GGGACCACAC 
GCCCGGGAGC 
TGCCAGGGGC 
GAGAGCGCTG 
GCTGCCACCG 
AGAGTGGGCG 
CCTACCACTG 
ACTCCCCTCA 
GTGGCCTGAT 
TGGCCACAGG 
ACAAGGCATC 
GCCAGCGAGG 
CCTATGATGC 
CCAGCCTGTG 
CAAAGAGTAA 
AGGTGGCTCA 
TGGATCTGCC 
GAAGCTTCAA 
GCAGTGCTGA 
AGGCTGCTCA 
GAGAGGCAGA 
TTGTGGCCCT 
TCTGTGGCAA 
CCCAAGACAA 
GGGCCTTCTT 
AGCGGACCAG 



51 
I 

TCCTTTCTGG 
CCTGCCTGGC 
TGGGGACCTG 
CAAGCCTGAG 
CTCCAGGCAG 
CCCCATGCGC 
GGACAGGAGA 
CATGCTGATT 
TGCCGACTGC 
TGTTCGGTGC 
ACTT/ACCTT 
GGTGGGGGAG 
GGGCTACCAC 
CTTCTGTCAC 
CACCGCTGTG 
TGAGCGCTGT 
CCATGAATGC 
TGTGTTTGCC 
CGAAGGCJ\AG 
TTCCATTCAG 
TCCCTGTGAC 
TGACCTATGC 
CTGTGACTGC 
CTGCCTTTGT 
GAAGCTGGCC 
GCCCACAGTG 
GTGCAGCGCT 
ATGCCGAGCC 
AGGCCGCTGC 
CTACTGCAAT 
GGACCTCCGG 
GTCAGGGCCT 
GATTGAGCAG 
GGTGGCCAGT 
CCTGGAGGAG 
TGGTCTCCTT 
TCCTTCAGGA 
GCAGGTCTCC 
GAGGCTGGTG 
GAGGCTGGAG 
CTCCAGGCAG 
TGGCACAGCC 
GATQGCGGGG 
GCAGATGATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



383 



10 



15 



20 



25 



WO 02/086443 

AGGGCAGCCG AGGAATCTGC CTCACAGATT CAATCCAGTG CCCAGCGCTT GGAGACCCAG 2700 

GTGAGCGCCA GCCGCTCCCA GATGGAGGAA GATGTCAGAC GCACACGGCT CCTAATCCAG 2760 

CAGGTCCGGG ACTTCCTAAC AGACCCCGAC ACTGATGCAG CCACTATCCA GGAGGTCAGC 2820 

GAGGCCGTGC TGGCCCTGTG GCTGCCCACA GACTCAGCTA CTGTTCTGCA GAAGATGAAT 2880 

GAGATCCAGG CCATTGCAGC CAGGCTCCCC AACGTGGACT TGGTGCTGTC CCAGACCAAG 2 940 

CAGGACATTG CGCGTGCCCG CCGGTTGCAG GCTGAGGCTG AGGAAGCCAG GAGCCGAGCC 3000 

CATGCAGTGG AGGGCCAGGT GGAAGATGTG GTTGGGAACC TGCGGCAGGG GACAGTGGCA 3060 

CTGCAGGAAG CTCAGGACAC CATGCAAGGC ACCAGCCGCT CCCTTCGGCT TATCCAGGAC 3120 

AGGGTTGCTG AGGTTCAGCA GGTACTGCGG CCAGCAGAAA AGCTGGTGAC AAGCATGACC 3180 

AAGCAGCTGG GTGACTTCTG GACACGGATG GAGGAGCTCC GCCACCAAGC CCGGCAGCAG 3240 

GGGGCAGAGG CAGTCCAGGC CCAGCAGCTT GCGGAAGGTG CCAGCGAGCA GGCATTGAGT 3300 

GCCCAAGAGG GATTTGAGAG AATAAAACAA AAGTATGCTG AGTTGAAGGA CCGGTTGGGT 3 360 

CAGAGTTCCA TGCTGGGTGA GCAGGGTGCC CGGATCCAGA GTGTGAAGAC AGAGGCAGAG 3420 

GA6CT6TTT6 G6GAGACCAT GGAGATGAT6 GACAGGATGA i\AGACATGGA GTTGGAGCTG 3480 

CTGCGGGGCA GCCAGGCCAT CATGCTGCGC TCGGCGGACC TGACAGGACT GGAGAAGCGT 3540 

GTGGAGCAGA TCCGTGACCA CATCAATGGG CGCGTGCTCT ACTATGCCAC CTGCAAGTGA 3 600 

TGCTACAGCT TCCAGCCCGT TGCCCCACTC ATCTGCCGCC TTTGCTTTTG GTTGGGGGCA 3 660 

GATTGGGTTG GAATGCTTTC CATCTCCAGG AGACTTTCAT GCAGCCTAAA GTACAGCCTG 3720 

GACCACCCCT GGTGTGTAGC TAGTAAGATT ACCCTGAGCT GCAGCTGAGC CTGAGCCAAT 3780 

GGGACAGTTA CACTTGACAG ACAAAGATGG TGGAGATTGG CATGCCATTG AAACTAAGAG 3840 

CTCTCAAGTC AAGGAAGCTG GGCTGGGCAG TATCCCCCX3C CTTTAGTTCT CCACTGGGGA 3900 

GGAATCCTGG ACCAAGCACA AAAACTTAAC AAAAGTGATG TAAAAATGAA AAGCCAAATA 3960 
AAJ\ATCTTTG G 

Seq ID NO: 521 Protein sequence 
Protein Accession #: NP 000219.1 
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30 
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MRPFFLLCFA 
EWQMKCCKCD 
MEFQGPMPAG 
NARIiNGGKVQ 
VSQLRLQGSC 
PWRPAEGQDA 
HYFRNRRPGA 
TYANPQGCHR 
CACDPHNSPQ 
TEGPGCDKAS 
RLRNATASLW 
TLQGLQLDLP 
AYEQSAQAAQ 
TPTFNKLCGN 
GFNAQLQRTR 
DPDTDAATIQ 
RLQAEAEEAR 
VLRPAEKLVT 
IKQKYAEIjKD 
MLRSADIjTGL 



11 

I 

LPGLIjHAQQA 
SRQFHNYYSH 
MLIERSSDFG 
LNLMDLVSGI 
FCHGHADRCA 
HECQRCDCNG 
SIQETCISCE 
CDCNILGSRR 
PTVQPVHRAV 
GRCLCRPGLT 
SGPGLEDRGL 
LEEETLSLPR 
QVSDSSRLLD 
SRQMACTPIS 
QMIRAAEESA 
EVSEAVLALW 
SRAHAVEGQV 
SMTKQLGDFW 
RLGQSSMLGE 
EKRVEQIRDH 



21 

I 

CSRGACYPPV 
RVENVASSSG 
KTWRVYQYLA 
PATQSQKIQE 
PKPGASAGPS 
HSETCHFDPA 
CDPDGAVPGA 
DMPCDEESGR 
PCREGFGGLM 
GPRCDQCQRG 
ASRILDAKSK 
DLESLDRSFN 
QLRDSRREAE 
CPGELCPQDN 
SQIQSSAQRL 
LPTDSATVLQ 
EDWGNLRQG 
TRMEELRHQA 
QGARIQSVKT 
INGRVLYYAT 



31 
I 

GDLLVGRTRF 
PMRWWQSQND 
ADCTSTFPRV 
VGEITNLRVN 
TAVQVHDVCV 
VFAASQGAYG 
PCDPVTGQCV 
CLCIiPNWGP 
CSAAAIRQCP 
YCNRYPVCVA 
lEQIRAVLSS 
GLLTMYQRKR 
RLVRQAGGGG 
GTACGSRCRG 
ETQVSASRSQ 
KMNEIQAIAA 
TVALQEAQDT 
RQQGAEAVQA 
EAEELFGETM 
CK 



Seq ID NO: 522 DNA sequence 
Nucleic Acid Accession #: NM_001944. 
Coding sequence: 84.. 3083 



1 
I 

TTTTCTTAGA 
TTTCACCAGG 
CCATCTTCGT 
ATGATGAAGA 
AATTTGCCAA 
TTACTTCAGA 
ATCAGCCGCC 
CTATAGTCGA 
AAGGACTAGA 
ATCCTCCAGT 
ACTCACTGGT 
AAATTGCCTT 
GAAACACTGG 
ATCGTCTGGT 
GTAATATTAA 
CAGCACGTAT 
TGGATGAAGA 
GAAATTGGTT 
AGGCTCTAGA 
CTGAATTTCA 
AGGTAATAAA 
AAAAAGGCAT 
ATGAGGACAC 
GATACCTAAT 
ATTCTACTTT 
CGGGTAAAAC 
CAACAGCTGT 
CTAGAACACT 
TAAAGTTGCC 
CCCA66AACA 
ACAATCGGTG 



11 

I 

CATTAACTGC 
GAAATCAGAG 
GGTGGTCATA 
AGAGATGACT 
ACCCTGCAGA 
TTACCAAGCA 
TTTTGGAATC 
CCGGGAGGAA 
TGTAGAGAAA 
ATTTTCACAA 
GATGATACTA 
CAAAATTGTC 
GGAAGTCCGT 
TGTGAGTGGT 
AGTGAAAGAT 
TGAAGAATU^T 
GTACACAGAT 
TGAAATACAA 
TTATGAACAA 
CCAATCAGTT 
TGTAAGAGAA 
AAGTAGCAAA 
TAACAAAGCT 
GATTGATTCA 
CATAGTTAAC 
TTCTACAGGC 
CCTCGAAAAA 
GAATAATAGA 
TGCCGTATGG 
GATACCTCCT 
TGAGATGCCA 



21 
I 

AGACGGCTGG 
ACAATGATGG 
TTGGTTCATG 
ATGCAACAAG 
GAAGGAGAAG 
ACCCAGAAAA 
TTTGTTGTTG 
ACTCCAAGCT 
CCACTTATAC 
CAAATTTTCA 
AATGCCACAG 
TCTCAGGAAC 
ACTTTGACCA 
GCAGACAAAG 
GTCAACGATA 
ATTTTAAGTT 
AATTGGCTTG 
ACTGATCCTA 
CTACAAAGCG 
ATCTCTCGAT 
GGAATTGCAT 
AAATTGGTGG 
GCCTCAJ^ATG 
AAAACTGCTG 
AAAACAATCA 
ACGGTATATG 
GATGCAGTTT 
TACACTGGCC 
AGTATCACAA 
GGAGTATACC 
CGCAGCTTGA 



31 
I 

CAGGATAGAA 
GGCTCTTCCC 
GAGAATTGCG 
CTAAAAGAAG 
ATAACTCAAA 
TCACCTACCG 
ACAAAAACAC 
TCCTGATCAC 
TAACGGTTAA 
TGGGTGAAAT 
ATGCAGATGA 
CAGCAGGCAC 
ATTCTCTTGA 
ATGGAGAAGG 
ACTTCCCAAT 
CTGAATTACT 
CAGTATATTT 
GAACTAATGA 
TG7UWVCTTAG 
ACCGAGTTCA 
TCCGTCCTGC 
ATTATATCCT 
TCAAATATGT 
AAATCAAATT 
CAGCTGAGGT 
TTAGAGTACC 
GCAGTTCTTC 
CCTATACATT 
CCCTCAATGC 
ACATCTCCCT 
CACTGGAAGT 



41 
I 

LRASSTCGLT 
VNPVSLQLDL 
RQGRPQSWQD 
FTRLAPVPQR 
CQHNTAGPNC 
GVCDNCRDHT 
CKEHVQGERC 
KCDQCAPYHW 
DRTYGDVATG 
CHPCFQTYDA 
PAVTEQEVAQ 
EQFEKISSAD 
GTGSPKLVAL 
VLPRAGGAFL 
MBEDVRRTRL 
RLPNVDLVLS 
MQGTSRSLRL 
QQLAEGASEQ 
EMMDRMKDME 



41 
I 

GCAGCGGCTC 
CAGAACTACA 
AATAGAGACT 
GCAAAAACGT 
AAGAAACCCA 
AATCTCTGGA 
TGGA6ATATT 
ATGTCGGGCT 
AATTTTGGAT 
TGAAGAAAAT 
ACCAAACCAC 
ACCCATGTTC 
CCGAGAGCAA 
ACTATCAACT 
GTTTAGAGAC 
TCGATTTCAA 
CTTTACCTCT 
AGGCATCCTG 
TATTGCTGTC 
GTCAACCCCA 
TTCCAAGACA 
GGGAACATAT 
CATGGGACX3T 
TGTCAAAAAT 
TCTGGCCATA 
CGATTTCAAT 
ACCTTCCGTG 
TGCACTGGAA 
TACCTCGGCC 
GGTACTTACA 
CTGTCAGTGT 



51 

KPETYCTQYG 
DRRFQLQEVM 
VRCQSLPQRP 
GYHPPSAYYA 
ERCAPFYNNR 
EGKNCERCQL 
DLCKPGFTGL 
KIiASGQGCEP 
CRACDCDFRG 
DLREQAIiRFG 
VASAILSLRR 
PSGAFRMLST 
RLEMSSLPDL 
MAGQVAEQLR 
LIQQVRDFLT 
QTKQDIARAR 
IQDRVAEVQQ 
ALSAQEGFER 
LELLRGSQAI 



51 
I 

ACTTGGACTT 

GGGGCTCTGG 

AAAGGTCAAT 

GAATGGGTGA 

ATTGCCAAGA 

GTGGGAATCG 

AACATAACAG 

CTAAATGCCC 

ATTAATGATA 

AGTGCCTCAA 

TTGAATTCTA 

CTCCTAAGCA 

GCTAGCAGCT 

CAATGTGAAT 

TCTCAGTATT 

GTAACAGATT 

GGGAATGAAG 

AAAGTGGTGA 

AAAAACAAAG 

GTCACAATTC 

TTTACTGTGC 

CAAGCCATCG 

AACGATGGTG 

ATGAACCGAG 

GATGAATACA 

GACAATTGTC . 

GTTGTCTCCG 

GATCAACCTG 

CTCCTCAGAG 

GACAGTCAGA 

GACT^CAGGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
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180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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GCATCTGTGG AACTTCTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC AGGCCGCACT 1920 

CAGGGAGGCT GGGGCCTGCC GCCATCGGCC TGCTGCTCCT TGGTCTCCTG CTGCTGCTGT 1980 

TGGCCCCCCT TCTGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGG GGAGTGACAG 2040 

GTGGTTTTAT CCCAGTTCCT GATGGCTCAG AAGGAACAAT TCATCAGTGG GGAATTGAAG 2100 

GAGCCCATCC TGAAGACAAG GAAATCACAA ATATTTGTGT GCGTCCTGTA ACAGCCAATG 2160 

GAGCCGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCACAGCGG 2220 

TGGAAGGCAC TTCAGGAATG GAAATGACCA CTAAGCTTGG AGCAGCCACT GAATCTGGAG 2280 

GTGCTGCAGG CTTTGCAACA GGGACAGTGT CAGGAGCTGC TTCAGGATTC GGAGCAGCCA 2340 

CTGGAGTTGG CATCTGTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG CATTCCACTG 240 0 

GAGGAACCAA TAAGGACTAC GCTGATGGGG CGATAAGCAT GAATTTTCTG GACTCCTACT 2460 

TTTCTCAGAA AGCATTTGCC TGTGCGGAGG AAGACGATGG CCAGGAAGCA AATGACTGCT 252 0 

TGTTGATCTA TGATAATGAA GGCGCAGATG CCACTGGTTC TCCTGTGGGC TCCGTGGGTT 258 0 

GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACTT GGACCCAAAT 2640 

TTAAAAAACT T6CAGAGATA AGCCTTGGTG TTGATGGTGA AGGCAAAGAA GTTCAGCCAC 2700 

CCTCTAAAGA CAGCGGTTAT GGGATTGAAT CCTGTGGCCA TCCCATAGAA GTCCAGCAGA 2760 

CAGGATTTGT TAAGTGCCAG ACTTTGTCAG GAAGTCAAGG AGCTTCTGCT TTGTCCGCCT 2820 

CTGGGTCTGT CCAGCCAGCT GTTTCCATCC CTGACCCTCT GCAGCATGGT AACTATTTAG 2880 

TAACGGAGAC TTACTCGGCT TCTGGTTCCC TCGTGCAACC TTCCACTGCA GGCTTTGATC 2 940 

CACTTCTCAC ACAAAATGTG ATAGTGACAG AAAGGGTGAT CTGTCCGATT TCCAGTGTTC 3000 

CTGGCAACCT AGCTGGCCCA ACGCAGCTAC GAGGGTCACA TACTATGCTC TGTACAGAGG 3060 

ATCCTTGCTC CCGTCTAATA TGACCAGAAT GAGCTGGAAT ACCACACTGA CCAAATCTGG 3120 

ATCTTTGGAC TAAAGTATTC AAAATAGCAT AGCAAAGCTC ACTGTATTGG GCTAATAATT 3180 

TGGCACTTAT TAGCTTCTCT CATAAACTGA TCACGATTAT AAATTAAATG TTTGGGTTCA 3240 

TACCCCAAAA GCAATATGTT GTCACTCCTA ATTCTCAAGT ACTATTCATU^ TTGTAGTAAA 3300 
TCTTAAAGTT TTTCAAAACC CTAAAATCAT ATTCGC 

Seq ID NO: 523 Protein sequence 
Protein Accession #: NP_001935.1 

1 11 21 31 41 51 

i I 1 I I I 

MMGLFPRTTG ALAIFWVIL VHGELRIETK GQYDEEEMTM QQAKRRQKRE WVKFAKPCRE 60 

GEDNSKRNPI AKITSDYQAT QKITYRISGV GIDQPPFGIF WDKNTGDIN ITAIVDREET 120 

PSFIiITCRAL NAQGLDVEKP LILTVKII.DI NDNPPVFSQQ IFMGEIEENS ASNSLVMILN 180 

ATDADEPNHL NSKIAFKIVS QEPAGTPMFIj LSRNTGEVRT LTNSLDREQA SSYRLWSGA 240 

DKDGEGLSTQ CECNIKVKDV NDNFPMFRDS QYSARIEENI LSSELLRFQV TDLDEEYTDN 3 00 

WLAVYFFTSG NEGNWFBIQT DPRTNEGILK WKALDYEQL QSVKLSIAVK NKAEFHQSVI 360 

SRYRVQSTPV TIQVINVREG lAFRPASKTF TVQKGISSKK LVDYILGTYQ AIDEDTNKAA 420 

SNVKYVMGRN DGGYLMIDSK TAEIKFVKNM NRDSTFIVNK TITAEVLAID EYTGKTSTGT 480 

VYVRVPDFND NCPTAVLEKD AVCSSSPSW VSARTLNNRY TGPYTFALED QPVKLPAVWS 540 

ITTLNATSAL LRAQEQIPPG VYHISLVLTD SQNNRCEMPR SLTLEVCQCD NRGICGTSYP 600 

TTSPGTRYGR PHSGRLGPAA IGIiLLLGLLIi LLLAPLLLLT CDCGAGSTGG VTGGFIPVPD 660 

GSEGTIHQWG lEGAHPEDKE ITNICVPPVT ANGADFMESS EVCTNTYARG TAVEGTSGME 720 

MTTKLGAATE SGGAAGFATG TVSGAASGFG AATGVGICSS GQSGTMRTRH STGGTNKDYA 780 

DGAISMNFLD SYFSQKAFAC AEEDDGQEAN DCLDIYDNEG ADATGSPVGS VGCCSFIADD 840 

LDDSFLDSLG PKFKKIjAEIS LGVDGEGKEV QPPSKDSGYG IESCGHPIEV QQTGFVKCQT 90 0 

LSGSQGASAL SASGSVQPAV SIPDPLQHGN YLVTETYSAS GSLVQPSTAG FDPIjLTQNVI 960 
VTERVICPIS SVPGNIiAGPT QLRGSHTMLC TEDPCSRLI 

Seq ID NO: 524 DNA sequence 

Nucleic Acid Accession #s XM_058069.2 

Coding sequence: 1..1413 

1 11 21 31 41 51 

I I I I I I 

ATGAAGTTTC TTCTAATACT GCTCCTGCAG GCCACTGCTT CTGGAGCTCT TCCCCTGAAC 60 

AGCTCTACAA GCCTGGAAAA AAATAATGTG CTATTTGGTG AAAGATACTT AGAAAAATTT 120 

TATGGCCTTG AGATAAACAA ACTTCCAGTG ACAAAAATGA AATATAGTGG AAACTTAATG 180 

AAGGAAAA7UV TCCAAGAAAT GCAGCACTTC TTGGGTCTGA AAGTGACCGG GCAACTGGAC 240 

ACATCTACCC TGGAGATGAT GCACGCACCT CGATGTGGAG TCCCCGATGT CCATCATTTC 300 

AGGGAAATGC CAGGGGGGCC CGTATGGAGG AAACATTATA TCACCTACAG AATCAATAAT 360 

TACACACCTG ACAT6AACCG TGAGGATGTT GACTACGCAA TCCGGAAAGC TTTCCAAGTA 420 

TGGAGTAATG TTACCCCCTT GAi\ATTCAGC AAGATTAACA CAGGCATGGC TGACATTTTG 480 

GTGGTTTTTG CCCGTGGAGC TCATGGAGAC TTCCATGCTT TTGATGGCAA AGGTGGAATC 540 

CTAGCCCATG CTTTTGGACC TGGATCTGGC ATTGGAGGGG ATGCACATTT CGATGAGGAC 600 

GAATTCTGGA CTACACATTC AGGAGGCACA AACTTGTTCC TCACTGCTGT TCACGAGATT 660 

GGCCATTCCT TAGGTCTTGG CCATTCTAGT GATCCAAAGG CCGTAATGTT CCCCACCTAC 720 

AAATATGTTG ACATCAACAC ATTTCGCCTC TCTGCTGATG ACATACGTGG CATTCAGTCC 780 

CTGTATGGAG ACCCAAAAGA GAACCAACGC TTGCCAAATC CTGACAATTC AGAACCAGCT 840 

CTCTGTGACC CCAATTTGAG TTTTGATGCT GTCACTACCG TGGGAAATAA GATCTTTTTC 900 

TTCAAAGACA GGTTCTTCTG GCTGAAGGTT TGTGAGAGAC CAAAGACCAG TGTTAATTTA 960 

ATTTCTTCCT TATGGCCAAC CTTGCCATCT GGCATTGAAG CTGCTTATGA AATTGAAGCC 1020 

AGAAATCAAG TTTTTCTTTT TAAAGATGAC AAATACTGGT TAATTAGCAA TTTAAGACCA 1080 

GAGCCAAATT ATCCCAAGAG CATACATTCT TTTGGTTTTC CTAACTTTGT GAAAAAAATT 1140 

GATGCAGCTG TTTTTAACCC ACGTTTTTAT AGGACCTACT TCTTTGTAGA TAACCAGTAT 1200 

TGGAGGTATG ATGAAAGGAG ACAGATGATG GACCCTGGTT ATCCCAAACT GATTACCAAG 1260 

AACTTCCAAG GAATCGGGCC TAAAATTGAT GCAGTCTTCT ACTCTAAAAA CAAATACTAC 1320 

TATTTCTTCC AAGGATCTAA CCAATTTGAA TATGACTTCC TACTCCAACG TATCACCAAA 13 80 
ACACTGAAAA GCAATAGCTG GTTTGGTTGT TGA 

Seq ID NO: 525 Protein sequence 
Protein Accession #: P39900 

1 11 21 31 41 51 

I I I I- 1 I 

MKFIjIjIIiIiLQ ATASGAIiPLN SSTSLEKNNV LFGERYLEKF YGLEINKbPV TKMKYSGNLM 60 

KEKIQEMQHP LGLKVTGQLD TSTLEMMHAP RCGVPDVHHF RBMPGGPVWR KHYITYRINN 120 
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YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KINTGMADIL 
LAHAFGFGS6 IGC^AHFDED EFWTTHSGGT NLFLTAVHEI 
KYVDINTFRL SADDIRGIQS LYGDPKBNQR LPNPDNSEPA 
FKDRFFWLKV SERPKTSVNL ISSLWPTLPS GIEAAYEIEA 
EPNYPKSIHS FGFPNFVKKI DAAVFNPRFY RTYFFVDNQY 
NFQGIGPKID AVFYSKNKYY YFFQGSNQFE YDFLLQRITK 

Seq ID NOt 52 6 DNA sequence 

Nucleic Acid Accession #: NM_024423.1 

Coding sequence: 64.. 2590 



PCT/US02/12476 



WPARGAHGD 
GHSLGLGHSS 
LCDPNLSFDA 
RNQVFLFKDD 
WRYDERRQMM 
TLKSNSWFGC 



GGCAGGTCTC 
CCGATGGCCG 
CTGACCCTCG 
CCTTCTAAAC 
TCTGCAGACC 
TACACAGCCA 
6ACAAAAGGA 
TCGAAGACAA 
ATTCCTTGCT 
GAATCTGATG 
AAAGAACCTT 
CCTGTGGATC 
GGATATTCAG 
CACCCTGTTT 
ACTACAGTGG 
CTGAAATACA 
AGCACAGGCG 
TCATTGATAA 
ACTTGTATCA 
TATGAAGCAT 
GATAAGGATT 
GAAAATGGAC 
GTAAAGCCAC 
GAAGCGCCAT 
GTTCATGTGA 
ATTAAAGTikAA 
AATAGAAATG 
ATTGATGAAA 
CCCAAAAATG 
ACTG6AACAC 
GAATATGTAG 
GATGAACCTG 
AGTAGACTGT 
AATGCTGGAT 
GCAACAAAAT 
ACTTCAAGGA 
ATAGCACTGC 
GGGAAACGTT 
CCTGGAGACG 
AGCCAAGGTT 
GAAATGATGA 
ACCCTGGACT 
GAGTGGCACA 
TAAAAATTAA 
CCAAGATTAT 
CTGCTGCAGT 
ATTTATTACA 
TTTGTCAGAC 
TGTATATGAT 
AGCCAGTTGT 
TCTCAAACTC 
GATATTTTAG 
ATATCACATT 
AGTATCACTA 
ATGTTGCAGC 
TTGGAGGCi^A 
GGAAATAAAT 
AACAAAGAGG 
GAAAAAAGAG 
GAGGAAATAG 
TTCTGGTTTC 
CTGTTTCAAG 
GGGGTTCCCT 
ATAACAAAAA 
TTCTCTCTTA 
AGTTTAGAGG 
ATTGTCCTTA 
TTTCATTTTT 
AGGCCTTGTG 
CCTTAAGTGA 
CTTTCTCCAG 
AAAGATCAAG 
TATTTGTACA 
AAGGAATATG 



11 
I 

GCTCTCGGCA 
CCGCTGGGCC 
TGATCTTCAG 
TAGAGGCAGA 
TCATCCGGTC 
GGGCTGTTGC 
AACAGACACA 
GACACACTAG 
CTATGC7VAGA 
CAGCACAGAA 
TAAATTTGTT 
GTGAAGAATA 
CAGATCTGCC 
TCACAGAAGC 
GGGTGGTTTG 
GCATTTTGCA 
TAATCACCAC 
TGAAAGTACA 
TAACAGTAAC 
TTGTAGAGGA 
TAATTAACAC 
ATTTCA7VAAT 
TGAATTATGA 
TTGCTAGAGA 
GGGATCTGGA 
ACTTAGC7VGT 
GCAATGGTTT 
TTTCAGGGTC 
AGTTGTATAA 
TTGCTGTGAA 
TCATTTGCAA 
TCCATGGAGC 
GGAGCCTCAC 
TTCAAGAATA 
TATTGAGAGT 
GTACAGGAGT 



TTCCTGAAGA 
ATAGAGTGTG 
TTTGTGGTAC 
AAGGAGGAAA 
CCTGCAGGGG 
GTTTTACTCA 
ACATAAAAGA 
GTCCTCACTT 
GAAAAGCAGG 
TTAGCAGAAG 
ATTCTGGAGG 
GATTTTTTTC 
TGCTTATCTT 
CAGCACTGGA 
TAATAAATAT 
ATTATGTATT 
TGTGAAGAAA 
TCATAAAGAA 
AATGTGTTGA 
GTGTGTGTGT 
AAAATGGTAA 
AGAGCTTCCT 
TTCCTGTCCA 
TGTGGGAAG6 
ATTTCTGCAT 
GCTTTTTGGT 
CATTTTAAAA 
TAGTGACCAA 
CTAGAGG6AG 
AACCTAAGCC 
CTCCTCACTG 
GGCCCCCTTC 
CTCCAGGTTT 
AGAAATTTTA 
TTGTCATTTT 
GTCAGAGGGC 
GGTGGGA6TA 



21 

I 

CCCTCCCGGC 
CCGGCGCTCC 
TCGTGATGGT 
CAA/^TAATT 
AAGTGATCCT 
GCTGTCTGAT 
GAAAGAGGTT 
AGAAACTGTT 
G/WVTTCCTTG 
CTATACTGTC 
TTATATAGAA 
TGATGTTTTT 
CCTCCCACTA 
7VATTTATAAT 
TGCCACAGAC 
GCAGACACCA 
AGTCTCTCAT 
AGACATGGAT 
AGATTCAAAT 
AAATGCATTC 
TGCCAATTGG 
CAGCACAGAC 
AGAA/^CCGT 
TATTCCCAGA 
TGAGGGGCCT 
GGGGTCAAAG 
AAGGTACAAA 
AATCATAACT 
TATTACAGTC 
CATTGAAGAT 
ACCAAAJVATG 
TCCATTTTAT 
CAAAGTTAAT 
TACCATTCCT 
TAATCTGTGT 
AATACTTGGA 
ATTGCTAACT 
TTTAGCACAG 
CTCTGCCAAT 
TATGGGATCA 
CCAGACCTT6 
AGGACACACG 
ACCCCGTCTC 
AATTGCATCG 
ATAACTATGA 
AAGAAGATGG 
CATGCACAAA 
TTTCCAAAAA 
TCAATTTTGA 
TTCCAAAAAG 
ATTAAGGTCT 
GCTGGATAAA 
CACTTTAAGT 
GTTTTGGAAA 
TTGGGACTCA 
AGTGCCCTAT 
ATATTATTAT 
AAACTTGAAA 
AGGCCTGGGC 
ATTTGTGTAA 
AAATAGGGAA 
CCACAAGTTA 
AGCAAGGGTC 
CTTACCTTTA 
CATCTTTTTA 
CTGAGGGGAG 
CCACAAACTT 
CCCTTCTTCT 
TTTCGGCTTT 
TCCACCATCC 
AAATAATAGA 
AGAACAGAGG 
AACAGGAAGA 
AAAGCAACAT 



31 
I 

GCCCGCGTTC 
GTGCGCGGAG 
GAAGCCTGCA 
GGCAGAGTTA 
GATTTCAGAG 
AAGAAAAGAT 
ACTGTGCTGC 
CTCAGGCGTG 
GGCCCTTTCC 
TTCTACTCAA 
AGAGACACTG 
GATTTGATTG 
CCCATCAGGG 
TTTGAAGTTT 
AGAGATGAAC 
AGGTCACCTG 
TATTTGGACA 
GGCCAGTTTT 
GATJ\ATGCAC 
AATGTGGAAA 
AGAGTCAATT 
AAAGAAACTA 
CAAGTGAACC 
GTGACAGCCT 
GAATGCACTC 
ATCAACGGCT 
AAATTGCATG 
TCCAAAATCC 
CTGGCAATAG 
GTAAATGATA 
GGGTATACCG 
TTCAGTTTGC 
GATACAGCTG 
ATTACTGTAA 
Gi^ATGTACTC 
AAATGGGCAA 
TTA6TATGTG 
CAAAACTTAA 
GGATTTATGA 
GGAATGAAAA 
GAATCCTGCC 
GAGGTGGACA 
GGTG/^AGAAT 
ATGTAATCAG 
GGGAAGAGGA 
CCTTGACTTT 
GAGATAATGT 
TAATATTGTA 
ATTATGCTAC 
TGAAA7!J\TGT 
CTAAAGCATC 
TATTAGTCCA 
GATAGTTTAA 
AGAAAC7\ATG 
CCCCTACTGC 
GAAGTAGCAA 
TAATCAATGC 
TGAGGCTGGG 
TCTTAAATGC 
TTTGTTTAAA 
TCCAATG6AA 
GTAGCAAACT 
CAGAGATGAG 
CTGAAGTTAA 
ATTTAGATCC 
GATCTTACTG 
GACACCTGAT 
GAGTGGCATT 
CTGCTAAAGC 
TTCAGCGTGA 
AGAAATAGAA 
GAACTTTGGG 
TGCAGGCCTT 
CGTCTGCTTC 



41 
I 

TCCTGGCCCT 
CCGTCTGCCT 
AAAAGGTGAT 
ATTTGGAAGA 
TTCTAAATGA 
CATTTACCAT 
TAGAACATCA 
CCAAGAGGAG 
CATTGTTTCT 
TAAGTGGACG 
GAAATCTATT 
CTTATGCGTC 
TAGAGGATGA 
TGGAAAGTAG 
CGGACACAAT 
GGCTCTTTTC 
GAGAGGTTGT 
TTGGATTGAT 
CCACTTTCAG 
TCTTACGAAT 
TTACCATTTT 
ATGAAGGTGT 
TGGAAATTGG 
TGAACAGAGC 
CTGCAGCCCA 
ATAAGGCATA 
ATCCTAAAGG 
TGGATAGGGA 
ACAAAGATGA 
ATCCACCAGA 
ACATTTTAGC 
CCAATACTTC 
CCCGTCTTTC 
AAGACAGGGC 
ATCCAACTCA 
TCCTTGCAAT 
GAGTTTTTGG 
TTATATCAAA 
CCCAAACTAC 
ATGGAGGGCA 
GGGGGGCTGG 
ACTGCAGATA 
CCATTAGAGG 
AATGAAGACC 
TCTCCAGCTG 
TTAAATAATT 
CACAGTGCTA 
AAGTTCAATT 
TCACCAATTT 
TAAAACAGAC 
TGCTCTTTTT 
ACAATAGCTA 
AAAATAAACA 
AAGACTGAAT 
ACTACCAAAT 
TTTTCTATAG 
AATATTTAAA 
GTATAGTTTG 
TGCATTATAA 
ATTGTAAATA 
CAGTAGCTTT 
GGG6AATACT 



ATCCTCTATT 
AAATAACCAT 
AAAGCACCCT 
CAGGTCTGGG 
GGCCTGAATC 
AACACCTCCA 
ATTAATTTTT 
ATTTTGAATG 
AGAAAGCAGC 
CAA6GGCAAG 
ATACTTTTTC 



FHAFDGKGGI 
DPKAVMFPTY 
VTTVGNKIFF 
KYWIiISNIiRP 
DPGYPKLITK 



51 
I 

GCCCGGCATC 
GCATCTGCTG 
ACTTAATGTA 
GTGCTTCAGG 
TGGGTCAGTG 
ATGGCTTTCT 
GAAGAAGGTA 
ATGGGCACCT 
TCAACAAGTT 
TGGAGTTGAT 
TTGCACTCGG 
AACTGCAGAT 
AAATGACAAC 
TAGACCTGGT 
GCATACGCGC 
TGTGCATCCC 
AGACAAGTAC 
AGGCACATCA 
ACAAAATGCT 
ACCTATAGAA 
AAAGGGAAAT 
TCTTTCTGTT 
AGTAAACAAT 
CTTGGTTACA 
ATATGTGCGG 
TGACCCCGAA 
TTGGATCACC 
GGTTGAAACT 
TAGATCATGT 
AATACTTCAA 
TGTTGATCCT 
TCCAGAAATC 
ATATCAGAAA 
CGGCCAAGCT 
GTGTCGTGCG 
ATTACTGGGT 
TGCAACTAAA 
CACA6AAGCA 
CAACAACTCT 
GGAAACCATT 
GCATCATCAT 
CACTTACTCG 
ACACACTGGT 
GCATGCCATC 
GTTCTGTGGG 
TGGAACCCAA 
CAATTAGGTC 
TCAACATGTA 
ATATTTTTAA 
AACTGGTAAA 
TTTTTTTACG 
AGTTATGCTA 
AGAAATATTG 
TAAATTAAAA 
TCATTTGACT 
GAATATAGTT 
ATGAAATGAG 
TCCTACAATA 
CTGAGTCTAT 
AATTAAACTT 
GCTTTGCAGT 
CGCTGCAGCT 
CGGGGAGCTA 
GCTGTTTCTA 
GTCCTCCTAG 
GGGGAGATTG 
AGCTACAAAA 
AAGGAAAGCC 
GCAGAGATTC 
AATCAGTTTG 
TATAAAAGT^ 
CCAA6TAGGT 
GAGAGGCCAC 
CTAGGCTTGG 



180 
240 
300 
360 
420 



60 
120 

180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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1320 
1380 
1440 
1500 
1560 
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1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
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CACTGCCTTT TCCTTTCTCA GGCCAATGGC AACTGCCATT TGAGTCCGGT GAGGGATCAG 4500 

CCAACCTCTT CTCTATGGCT CACCTTATTT GGAGTGAGAA ATCAAGGAGA CAGAGCTGAC 4560 

TGCATGATGA GTCTGAAGGC ATTTGCAGGA TGAGCCTGAA CTGGTTGTGC AGAACAAACA 4620 

AGGCATTCAT GGGAATTGTT GTATTCCTTC TGCAGCCCTC CTTCTGGGCA CTAAGAAGGT 4680 

CTATGAATTA AATGCCTATC TAAAATTCTG ATTTATTCCT ACATTTTCTG TTTTCTAATT 4740 

TGACCCTAAA ATCTATGTGT TTTAGACTTA GACTTTTTAT TGCCCCCCCC CCCTTTTTTT 4800 

TTGAGACGGA GTCTCGCTCT GACGCACAGG CTGGAGTGCA GTGGCTCCGA TCTCTGCTCA 4860 

CTGAAAGCTC CGCCTCCCGG GTTCATGCCA TTCTCCTGCC TCAGCCTCCT GAGTAGCTGG 4920 

GACTAGAGGC GCCCACCACC ACGCCCGGCT AATTTTTTGT ATTTTTAATA GAGACX3GGGT 4980 

TTCACTGTGT TAGCCAGGAT GGTCTCGATC TCCTGACCTC GTGATCCGCC TGCCTCGGCC 5040 

TCCCAAAGTG CTGGGATTAC AGGCATGACC CACCGCTCCC GGCCTTGTTT TCCGTTTAAA 5100 

GTCGTCTTCT TTTAATGTAA TCATTTTGAA CATGTGTGAA AGTTGATCAT ACGAATTGGA 5160 

TCAATCTTGA AATACTCAAC CAAAAGACAG TCGAGAAGCC AGGGGGAGT^ AGAACTCAGG 5220 

GCACAAAATA TTGGTCTGAG AATGGi^ATTC TCTGTAAGCC TAGTTGCTGA AATTTCCTGC 5280 

TGTAACCAGA AGCCAGTTTT ATCTAACGGC TACTGAAACA CCCACTGTGT TTTGCTCACT 5340 

CCCACTCACC GATCAAAACC TGCTACCTCC CCAAGACTTT ACTAGTGCCG ATAAACTTTC 5400 

TCAAAGAGCA ACCAGTATCA CTTCCCTGTT TATAAAACCT CTAACCATCT CTTTGTTCTT 5460 

TGAACATGCT GAAAACCACC TGGTCTGCAT GTATGCCCGA ATTTGTAATT CTTTTCTCTC 5520 

AAATGAAAAT TTAATTTTAG GGATTCATTT CTATATTTTC ACATATGTAG TATTATTATT 5580 

TCCTTATATG TGTAAGGTGA AATTTATGGT ATTTGAGTGT GCAAGAAAAT ATATTTTTAA 5640 

AGCTTTCATT TTTCCCCCAG TGAATGATTT AGAATTTTTT ATGTAAATAT ACAGAATGTT 5700 

TTTTCTTACT TTTATAAGGA AGCAGCTGTC TAAAATGCAG TGGGGTTTGT TTTGCAATGT 5 760 

TTTAAACAGA GTTTTAGTAT TGCTATTAAA AGAAGTTACT TTGCTTTTAA AGAAACTTGG 582 0 

CTGCTTAAAA TAAGCAAAAA TTGGATGCAT AAAGTAATAT TTACAGATGT GGGGAGATGT 5880 

AATAAAACAA TATTAACTTG GCTGCTTAAA ATAAGCAAAA ATTGGATGCA TAAAGTAATA 5940 

TTTACAGATG TGGGGAGATG TAATAAAACA ATATTAACTT GGTTTCTTGT TTTTGCTGTA 6000 

TTTAGAGATT . AAATAATTCT AAGATGATCA CTTTGCAAAA TTATGCTTAT GGCTGGCATG 6060 

GAAATAGAAA TACTCAATTA TGTCTTTGTT GTATTAATGG GGAATATTTT GGACAATGTT 6120 

TCATTATCAA ATTGTCGACA TCATTAATAT ATATTGTAAT GTTGGGAAGA GATCACTATT 6180 

TTGAAGCACA GCTTTACAGA TGAGTATCTA TGATACATAT GTATAATAAA TTTTGATCGG 6240 

GTATTAAAAG TATTAGAAGG TGGTTATAAT TGCAGAGTAT TCCATGAATA GTACACTGAC 6300 

ACAGGGGTTT TACTTTGAGG ACCAGTGTAG TCAAGGGAAA ACATGAGTTA T^AAAGAAAAG 6360 

CAGGCAATAT TGCAGTCTTG ATTCTGCCAC TTACAGGATA GATAATGCCT GAACTTTAAT 642 0 

GACAAGATGA TCCAACCATA AAGGTGCTCT GTGCTTCACA GTGAATCTTT TCCCCATGCA 6480 

GGAGTGTGCT CCCCTACAAA CGTTAAGACT GATCATTTCA AAAATCTATT AGCTATATCA 6540 

AAAGCCTTAC ATTTTAATAT AGGTTGAACC AAAATTTC7A TTCCAGTAAC TTCTATTGTA 6600 

ACCATTATTT TTGTGTATGT CTTCAAGAAT GTTCATTGGA TTTTTGTTTG TAATAGTAAA 6660 

ATACCGGATA CATTTCACGT GTCCTTCAGT ATTGATTTGG TTGAATATTG GGTCATAATG 6720 

GTTGAGAAGC ATGGACACTA GAGCCAGAAT GCTTGGATAT GAATCCTGGA TCTGTCACTT 6780 

ACTTCTGTGT GACCTTTGAA AGGCTACTTA TTTCCTCTCT TAGCTTTCTC ATTAAAATCA 6840 

ATGAACAATG CCAGCCTCAT GGGGTTGTTG AATGATTAAA TTAGTTAATA TACCTAAAGT 6900 

ACATAGAACA CTGCCTGCAC ATAGTAAAAG AATTATAAGT GTGAGGTAGT TGGTA7WVTT 6960 

ATGTAGTTGG ATATACTACC GAACAATATC TAATCTCTTT TTAGGGAAAT A7VAGTTTGTG 7020 
CATATATATA ATCCCGAAAC ATG 

Seq ID NO: 527 Protein sequence 
Protein Accession #t NP_077741.1 

1 11 21 31 41 51 

I I I 1 I I 

MAAAGPRRSV RGAVCLKLLL TLVIFSRDGE ACKKVILNVP SKLEADKIIG RVNLEECFRS 60 

ADLIRSSDPD FRVIiNDGSVY TARAVALSDK KRSFTIWLSD KRKQTQKBVT VLLEHQKKVS 120 

KTRHTRETVL RRAKRRWAPI PCSMQENSLG PFPLFLQQVE SDAAQNYTVF YSISGRGVDK 180 

EPLNLFYIER DTGNIiFCTRP VDREEYDVFD LIAYASTADG YSADLPLPLP IRVEDENDNH 240 

PVFTEAIYNF EVLESSRPGT TVGWCATDR DEPDTMHTRL KYSILQQTPR SPGLFSVHPS 300 

TGVITTVSHY LDREWDKYS LIMKVQDMDG QFFGLIGTST CIITVTDSND NAPTFRQNAY 360 

EAFVEENAEN VEILRIPIED KDLINTANWR VNFTILKGNE NGHFKISTDK ETNEGVLSW 420 

KPLNYEENRQ VNLEIGVNNE APFARDIPRV TALNRALVTV HVRDLDEGPE CTPAAQYVRI 480 

KENLAVGSKI NGYKAYDPEN RNGNGLRYKK LHDPKGWITI DEISGSIITS KILDREVETP 540 

KNELYNITVL AIDKDDRSCT GTLAVNIEDV NDNPPEILQE YWICKPKMG YTDILAVDPD 600 

EPVHGAPFYF SLPNTSPEIS RLWSLTKVND TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 660 

TKLLRVNLCE CTHPTQCRAT SRSTGVILGK WAILAILLGI ALLFSVLLTL VCGVFGATKG 720 

KRFPEDLAQQ NLIISNTEAP GDDRVCSANG FMTQTTNNSS QGFCGTMGSG MKNGGQETIE 780 
MMKGGNQTLE SCRGAGHHHT LDSCRGGHTE VDNCRYTYSE WHSFTQPRLG EESIRGHTG 

Seq ID NO: 528 DNA sequence 

Nucleic Acid Accession #: NM__001941.2 

Coding sequence: 64.. 2754 

1 11 21 31 41 51 

1 I 1 ! I I 

GGCAGGTCTC GCTCTCGGCA CCCTCCCGGC GCCCGCGTTC TCCTGGCCCT GCCCGGCATC 60 

CCGATGGCCG CCGCTGGGCC CCGGCGCTCC GTGCGCGGAG CCGTCTGCCT GCATCTGCTG 120 

CTGACCCTCG TGATCTTCAG TCX3TGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 180 

CCTTCTAAAC TAGAGGCAGA CA7U\ATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 240 

TCTGCAGACC TCATCCGGTC AAGTGATCCT GATTTCAGAG TTCTAAATGA TGGGTCAGTG 3 00 

TACACAGCCA GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 3 60 

GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 420 

TCGAAGACAA GACAC3VCTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 480 

ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT 540 

GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 600 

AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG 660 

CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTGCAGAT 720 

GGATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATGACAAC 780 

CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 840 

ACTACAGTGG GGGTGQTTTG TGCCACAGAC AGAGATGAAC CGGACACMT GCATACGCGC 900 

CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCT6 GGCTCTTTTC TGTGCATCCC 960 
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AGCACAGGCG TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 

TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGGATTGAT AGGCACATCA 1080 

ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 

TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 1200 

GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 

GAAAATGGAC ATTTCAAAAT CAGC7VCAGAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 

GTAAAGCCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 1380 

GAAGCGCCAT TTGCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 1440 

GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGCGG 1500 

ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 

AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 

ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 

CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 

ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 1800 

GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 

GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 

AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 

AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 

GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 

ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 

ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 222 0 

GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAG7WVGCA 22 80 

CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2 340 

AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 240 0 

GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 

ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 

GAGTGGCACA GTTTTACTCA ACCCCGTCTC GGTGAAAAAT TGCATCGATG TAATCAGAAT 2580 

GAAGACCGCA TGCCATCCCA AGATTATGTC CTCACTTATA ACTATGAGGG AAGAGGATCT 2 640 

CCAGCTGGTT CTGTGGGCTG CTGCAGTGAA AAGCAGGAAG AAGATGGCCT TGACTTTTTA 2 700 

AATAATTTGG AACCCAAATT TATTACATTA GCAGAAGCAT GCACAAAGAG ATAATGTCAC 2760 

AGTGCTACAA TTAGGTCTTT GTCAGACATT CTGGAGGTTT CCAAAAATAA TATTGTAAAG 2820 

TTCAATTTCA ACATGTATGT ATATGATGAT TTTTTTCTCA ATTTTGAATT ATGCTACTCA 2880 

CCAATTTATA TTTTTAAAGC CAGTTGTTGC TTATCTTTTC CAAAAAGTGA AAAATGTTAA 2940 

AACAGACAAC TGGTTiAATCT CAAACTCCAG CACTGGAATT AAGGTCTCTA AAGCATCTGC 3000 

TCTTTTTTTT TTTTACGGAT ATTTTAGTAA TAAATATGCT GGATAAATAT TAGTCCAACA 30 60 

ATAGCTAAGT TATGCTAATA TCACATTATT ATGTATTCAC TTTAAGTGAT AGTTTAAAAA 3120 

ATAAACAAGA J\ATATTGAGT ATCACTATGT GAAGAAAGTT TTGGAAAAGA AACAATGAAG 3180 

ACTGAATTAA ATTAAAAATG TTGCAGCTCA TAAAGAATTG GGACTCACCC CTACTGCACT 3240 

ACCAAATTCA TTTGACTTTG GAGGCT^AAAT GTGTTGAAGT GCCCTATGAA GTAGCAATTT 3300 

TCTATAGGAA TATAGTTGGA AATAAATGTG TGTGTGTATA TTATTATTAA TCAATGCAAT 33 60 

ATTTAAAATG AAATGAGAAC AAAGAGGAAA ATGGTAAAAA CTTGAAATGA GGCTGGGGTA 342 0 

TAGTTTGTCC TACAATAGAA AAAAGAGAGA GCTTCCTAGG CCTGGGCTCT TAAATGCTGC 3480 

ATTATAACTG AGTCTATGAG GAAATAGTTC CTGTCCAATT TGTGTAATTT GTTTAAAATT 3540 

GTAAATAAAT TAAACTTTTC TGGTTTCTGT GGGAAGGAAA TAGGGAATCC AATGGAACAG 3600 

TAGCTTTGCT TTGCAGTCTG TTTCAAGATT TCTGCATCCA CAAGTTAGTA GCAAACTGGG 3 660 

GAATACTCGC TGCAGCTGGG GTTCCCTGCT TTTTGGTAGC AAGGGTCCAG AGATGAGGTG 3 720 

TTTTTTTCGG GGAGCTAATA ACAAAAACAT TTTAAAACTT ACCTTTACTG AAGTTAAATC 3 780 

CTCTATTGCT GTTTCTATTC TCTCTTATAG TGACCAACAT CTTTTTAATT TAGATCCAAA 3 840 

TAACCATGTC CTCCTAGAGT TTAGAGGCTA GAGGGAGCTG AGGGGAGGAT CTTACTGAAA 3900 

GCACCCTGGG GAGATTGATT GTCCTTAAAC CTAAGCCCCA CAAACTTGAC ACCTGATCAG 3960 

GTCTGGGAGC TACAAAATTT CATTTTTCTC CTCACTGCCC TTCTTCTGAG TGGCATTGGC 4020 

CTGAATCAAG GAAAGCCAGG CCTTGTGGGC CCCCTTCTTT CGGCTTTCTG CTAAAGCAAC 4080 

ACCTCCAGCA GAGATTCCCT TAAGTGACTC CAGGTTTTCC ACCATCCTTC AGCGTGAATT 4140 

AATTTTTAAT CAGTTTGCTT TCTCCAGAGA AATTTTAAAA TAATAGAAGA 7\ATAGAAATT 4200 

TTGAATGTAT AAAAGAAAAA GATCAAGTTG TCATTTTAGA ACAGAGGGAA CTTTGGGAGA 4260 

AAGCAGCCCA AGTAGGTTAT TTGTACAGTC AGAGGGCAAC AGGAAGATGC AGGCCTTCAA 4320 

GGGCAAGGAG 7W3GCCACAAG GAATATGGGT GGGAGTAT^ GCAACATCGT CTGCTTCATA 43 80 

CTTTTTCCTA GGCTTGGCAC TGCCTTTTCC TTTCTCAGGC CAATGGCAAC TGCCATTTGA 4440 

GTCCGGTGAG GGATCAGCCA ACCTCTTCTC TATGGCTCAC CTTATTTGGA GTGAGAAATC 4500 

AAGGAGACAG AGCTGACTGC ATGATGAGTC TGAAGGCATT TGCAGGATGA GCCTGAACTG 4560 

GTTGTGCAGA ACAAACAAGG CATTCATGGG AATTGTTGTA TTCCTTCTGC AGCCCTCCTT 4620 

CTGGGCACTA AGAAGGTCTA TGAATTAAAT GCCTATCTAA AATTCTGATT TATTCCTACA 4680 

TTTTCTGTTT TCTAATTTGA CCCTAAAATC TATGTGTTTT AGACTTAGAC TTTTTATTGC 4740 

CCCCCCCCCC TTTTTTTTTG AGACGGAGTC TCGCTCTGAC GCACAGGCTG GAGTGCAGTG 4800 

GCTCCGATCT CTGCTCACTG AAAGCTCCGC CTCCCGGGTT CATGCCATTC TCCTGCCTCA 4860 

GCCTCCTGAG TAGCTGGGAC TACAGGCGCC CACCACCACG CCCGGCTAAT TTTTTGTATT 4920 

TTTAATAGAG ACGGGGTTTC ACTGTGTTAG CCAGGATGGT CTCGATCTCC TGACCTCGTG 4980 

ATCCGCCTGC CTCGGCCTCC CAAAGTGCTG GGATTACAGG CATGACCCAC CGCTCCCGGC 5040 

CTTGTTTTCC GTTTAAAGTC GTCTTCTTTT AATGTAATCA TTTTGAACAT GTGTGAAAGT 5100 

TGATCATACG AATTGGATCA ATCTTGAAAT ACTCAACCAA AAGACAGTCG AGAAGCCAGG 5160 

GGGAGAAAGA ACTCAGGGCA CAAAATATTG GTCTGAGAAT GGAATTCTCT GTAAGCCTAG 522 0 

TTGCTGAAAT TTCCTGCTGT AACCAGAAGC CAGTTTTATC TAACGGCTAC TGAAACACCC 52 80 

ACTGTGTTTT GCTCACTCCC TCACTCACCG ATCAAAACCT GCTACCTCCC CAAGACTTTA 5340 

CTAGTGCCGA TAAACTTTCT CAAAGAGCAA CCAGTATCAC TTCCCTGTTT ATAAAACCTC 5400 

TAACCATGTC TTTGTTCTTT GAACATGCTG AAAACCACCT GGTCTGCATG TATGCCCGAA 5460 

TTTGTAATTC TTTTCTCTCA AATGAAAATT TAATTTTAGG GATTCATTTC TATATTTTCA 5520 

CATATGTAGT ATTATTATTT CCTTATATGT GTAAGGTGAA ATTTATGGTA TTTGAGTGTG 5580 

CAAGAAAATA TATTTTTAAA GCTTTCATTT TTCCCCCAGT GAATGATTTA GAATTTTTTA 5640 

TGTAAATATA CAGAATGTTT TTTCTTACTT TTATAAGGAA GCAGCTGTCT AAAATGCAGT 5700 

GGGGTTTGTT TTGCAATGTT TTAAACAGAG TTTTAGTATT GCTATTAAAA GAAGTTACTT 5760 

TGCTTTTAAA GAAACTTGGC TGCTTAAAAT AAGCAAAAAT TGGATGCATA AAGTAATATT 582 0 

TACAGATGTG GGGAGATGTA ATAAAACAAT ATTAACTTGG TTTCTTGTTT TTGCTGTATT 5880 

TAGAGATTAA ATAATTCTAA GATGATCACT TTGCAAAATT ATGCTTATGG CTGGCATGGA 5940 

AATAGAAATA CTCAATTATG TCTTTGTTGT ATTAATGGGG AATATTTTGG ACAATGTTTC 6000 

ATTATCAAAT TGTCGACATC ATTAATATAT ATTGTAATGT TGGGAAGAGA TCACTATTTT 60 60 

GAAGCACAGC TTTACAGATG AGTATCTATG ATACATATGT ATAATAAATT TTGATCGGGT 6120 

ATTAAAAGTA TTAGAAGGTG GTTATAATTG CAGAGTATTC CATGAATAGT ACACTGACAC 6180 
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AGGGGTTTTA CTTTGAGGAC 
GGCAATATTG CAGTCTTGAT 
CAAGATGATC CAACCATAAA 
AGTGTGCTCC CCTACAAACG 
AGCCTTACAT TTTAATATAG 
CATTATTTTT GTGTATGTCT 
ACCGGATACA TTTCACGTGT 
TGAG7U\GCAT GGACACTAGA 
TTCTGTGTGA CCTTTGAAAG 
GAACAATGCC AGCCTCATGG 
ATAGAACACT GCCTGCACAT 
GTAGTTGGAT ATACTACCGA 
TATATATAAT CCCGAAACAT 



CAGTGTAGTC AAGGGATUU^C 
TCTGCCACTT ACAGGATAGA 
GGTGCTCTGT GCTTCACAGT 
TTAAGACTGA TCATTTCAAA 
GTTGAACCAA AATTTCAATT 
TCAAGAATGT TCATTGGATT 
CCTTCAGTAT TGATTTGGTT 
GCCAGAATGC TTGGATATGA 
GCTACTTATT TCCTCTCTTA 
GGTTGTTGAA TGATTAAATT 
AGTAAAAGAA TTATAAGTGT 
ACAATATCTA ATCTCTTTTT 
G 



ATGAGTTAAA AAGAAAAGCA 
TAATGCCTGA ACTTTAATGA 
GAATCTTTTC CCCATGCAGG 
AATCTATTAG CTATATCAAA 
CCAGTAACTT CTATTGTAAC 
TTTGTTTGTA ATAGTAAAAT 
GAATATTGGG TCATAATGGT 
ATCCTGGATC TGTCACTTAC 
GCTTTCTCAT TAAAATCT^T 
AGTTAATATA CCTAAAGTAC 
GAGGTAGTTG GTAAAATTAT 
AGGGT^TAA AGTTTGTGCA 



Seq ID NO: 529 Protein sequence 
Protein Accession #: NP_001932.1 



1 11 21 31 41 51 

I I I I I I 

MAAAGPRRSV RGAVCLHLLL TLVIFSRDGE ACKKVILNVP SKLEADKIIG RVNLEECFRS 
ADIiIRSSDPD FRVLNDGSVY TARAVALSDK KRSFTIWLSD KRKQTQKEVT VLLEHQKKVS 
KTRHTRETVL RRAKRRWAPI PCSMQENSLG PFPLFLQQVE SDAAQNYTVF YSISGRGVDK 
EPLNLFYIER DTGNLFCTRP VDREEYDVFD LIAYASTADG YSADLPLPLP IRVEDENDNH 
PVFTEAIYNF EVLESSRPGT TVGWCATDR DEPDTMHTRL KYSILQQTPR SPGLFSVHPS 
TGVITTVSHY LDREWDKYS IiIMKVQDMDG QFFGLIGTST CIITVTDSND NAPTFRQNAY 
EAFVEENAFN VEILRIPIED KDLINTANWR VNFTILKGNE NGHFKISTDK ETNEGVLSW 
KPLNYEENRQ VNLEIGVNNE APFARDIPRV TALNRALVTV HVRDLDEGPE CTPAAQYVRI 
KENLAVGSKI NGYKAYDPEN RNGNGLRYKK LHDPKGWITI DEISGSIITS KILDREVETP 
KNELYNITVL AIDKDDRSCT GTLAVNIEDV NDNPPEILQE YWICKPKMG YTDIIiAVDPD 
EPVHGAPFYF SLPNTSPEIS RLWSLTKVND TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 
TKLLRVNLCE CTHPTQCRAT SRSTGVILGK WAILAILLGI ALLFSVLLTL VCGVFGATKG 
KRFPEDLAQQ NLIISNTEAP GDDRVCSANG FMTQTTNNSS QGFCGTMGSG MKNG6QETIE 
MMKGGNQTLE SCRGAGHHHT IiDSCRGGHTE VDNCRYTYSE WHSFTQPRLG EKLHRCNQNE 
DRMPSQDYVL TYNYEGRGSP AGSVGCCSEK QEEDGLDFIiN NLEPKFITLA EACTKR 



Seq ID NO: 530 DNA sequence 

Nucleic Acid Accession #: NM_016583.2 

Coding sequence : 72 . . 842 

1 11 21 31 41 51 

I I I I I I 

GGAGTGGGGG AGAGAGAGGA GACCAGGACA GCTGCTGAGA CCTCTAAGAA GTCC7\GATAC 
TAAGAGCAAA GATGTTTCAA ACTGGQGGCC TCATTGTCTT CTACGGGCTG TTAGCCCAGA 
CCATGGCCCA GTTTGGAGGC CTGCCCGTGC CCCTGGACCA GACCCTGCCC TTGAATGTGA 
ATCCAGCCCT GCCCTTGAGT CCCACAGGTC TTGCAGGAAG CTTGACAAAT GCCCTCAGCA 
ATGGCCTGCT GTCTGGGGGC CTGTTGGGCA TTCTGGAAAA CCTTCCGCTC CTGGACATCC 
TGAAGCCTGG AGGAGGTACT TCTGGTGGCC TCCTTGGGGG ACTGCTT6GA AAAGTGACGT 
CAGTGATTCC TGGCCTGAAC AACATCATTG ACATAAAGGT CACTGACCCC CAGCTGCTGG 
AACTTGGCCT TGTGCAGAGC CCTGATGGCC ACCGTCTCTA TGTCACCATC CCTCTCGGCA 
TAAAGCTCCA AGTGT^TACG CCCCTGGTCG GTGCAAGTCT GTTGAGGCTG GCTGTGAAGC 
TGGACATCAC TGCAGAAATC TTAGCTGTGA GAGATAAGCA GGAGAGGATC CACCTGGTCC 
TTGGTGACTG CACCCATTCC CCTGGAAGCC TGCAAATTTC TCTGCTTGAT GGACTTGGCC 
CCCTCCCCAT TCAAGGTCTT CTGOACAGCC TCACAGGGAT CTTGAATAAA GTCCTGCCTG 
AGTT6GTTCA GGGCAACGTG TGCCCTCTGG TCAATGAGGT TCTCAGAGGC TTGGACATCA 
CCCTGGTGCA TGACATTGTT AACATGCTGA TCCACGGACT ACAGTTTGTC ATCAAGGTCT 
AAGCCTTCCA GGAAGGGGCT GGCCTCTGCT GAGCTGCTTC CCAGTGCTCA CAGATGGCTG 
GCCCATGTGC TGGAAGATGA CACAGTTGCC TTCTCTCCGA GGAACCTGCC CCCTCTCCTT 
TCCCACCAGG CGTGTGTAAC ATCCCATGTG CCTCACCTAA TAAAATGGCT CTTCTTCTGC 
AAAAAAAAAA AAA7\i\AAAAA AAAAAAAAA 



Seq ID NOs 531 Protein sequence 
Protein Accession #: NP_057667.1 



1 11 21 31 41 51 

I I I I I I 

MFQTGGLIVF YGLLAQTMAQ FGGLPVPIiDQ TLPIiNVNPAIi PLSPTGLAGS LTNALSNGLL 

SGGLLGILEN LPLLDIIjKPG GGTSGGLLGG LIiGKVTSVIP GLNNIIDIKV TDPQIjIiELGL 

VQSPDGHRLY VTIPLGIKLQ VNTPLVGASL LRLAVKLDIT ABIIiAVRDKQ ERIHLVLGDC 

THSPGSLQIS LLDGLGPLPI QGLLDSLTGI LNKVIiPELVQ GNVCPLVNEV LRGLDITLVH 
DIVNMI.IHGL QFVIKV 



Seq ID NO; 532 DNA sequence 

Nucleic Acid Accession #: NM_004363.1 

Coding sequence: 115.. 2223 



1 11 21 31 41 51 

i I I 1 I I 

CTCAGGGCAG AGGGAGGAAG GACAGCAGAC CAGACAGTCA CAGCAGCCTT GACAAAACGT 
TCCTGGAACT CAAGCTCTTC TCCACAGAGG AGGACAGAGC AGACAGCAGA GACCATGGAG 
TCTCCCTCGG CCCCTCCCCA CAGATGGTGC ATCCCCTGGC AGAGGCTCCT GCTCACAGCC 
TCACTTCTAA CCTTCTGGAA CCCGCCCACC ACTGCCAAGC TCACTATTGA ATCCACGCCG 
TTCAATGTCG CAGAGGGGAA GGAGGTGCTT CTACTTGTCC ACAATCTGCC CCAGCATCTT 
TTTGGCTACA GCTGGTACAA AGGTGAAAGA GTGGATGGCA ACCGTCAAAT TATAGGATAT 
GTAATAGGAA CTCAACAAGC TACCCCAGGG CCCGCATACA GTGGTCGAGA 6ATAATATAC 
CCCAATGCAT CCCTGCTGAT CCAGAACATC ATCCAGAATG ACACAGGATT CTACACCCTA 
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CACGTCATAA AGTCAGATCT TGTGAATGAA GAAGCAACTG GCCAGTTCCG GGTATACCCG 540 

GAGCTGCCCA AGCCCTCCAT CTCCAGCAAC AACTCCAAAC CCGTGGAGGA C7UVGGATGCT 600 

GTGGCCTTCA CCTGTGAACC TGAGACTCAG 6ACGCAACCT ACCTGTGGTG GGTAAACAAT 660 

CAGAGCCTCC CGGTCAGTCC CAGGCTGCAG CTGTCCAATG GCAACAGGAC CCTCACTCTA 720 

5 TTCAATGTCA CAAGAAATGA CACAGCAAGC TACAAATGTG AAACCCAGAA CCCAGTGAGT 780 

GCCAGGCGCA GTGATTCAGT CATCCTGAAT GTCCTCTATG GCCCGGATGC CCCCACCATT 840 

TCCCCTCTAA ACACATCTTA CAGATCAGGG GAAAATCTGA ACCTCTCCTG CCACGCAGCC 900 

TCTAACCCAC CTGCACAGTA CTCTTGGTTT GTCAATGGGA CTTTCCAGCA ATCCACCCAA 960 

GAGCTCTTTA TCCCCAACAT CACTGTGAAT AATAGTGGAT CCTATACGTG CCAAGCCCAT 102 0 

10 AACTCAGACA CTGGCCTCAA TAGGACCACA GTCACGACGA TCACAGTCTA TGCAGAGCCA 10 80 

CCCAAACCCT TCATCACCAG CAACAACTCC AACCCCGTGG AGGATGAGGA TGCTGTAGCC 1140 

TTAACCTGTG AACCTGAGAT TCAGAACACA ACCTACCTGT GGTGGGTAAA TAATCAGAGC 12 00 

CTCCCGGTCA GTCCCAGGCT GCAGCTGTCC AATGACAACA GGACCCTCAC TCTACTCAGT 1260 

^ - GTCACAAGGA ATGATGTAGG ACCCTATGAG TGTGGAATCC AGAACGAATT AAGTGTTGAC 1320 

15 CACAGCGACC CAGTCATCCT GAATGTCCTC TATGGCCCAG ACGACCCCAC CATTTCCCCC 1380 

TCATACACCT ATTACCGTCC AGGGGTGAAC CTCAGCCTCT CCTGCCATGC AGCCTCTAAC 1440 

CCACCTGCAC AGTATTCTTG GCTGATTGAT GGGAACATCC AGCAACACAC ACAAGAGCTC 150 0 

TTTATCTCCA ACATCACTGA GAAGAACAGC GGACTCTATA CCTGCCAGGC CAATAACTCA 1560 

GCCAGTGGCC ACAGCAGGAC TACAGTCAAG ACAATCACAG TCTCTGCGGA GCTGCCCAAG 1620 

20 CCCTCCATCT CCAGCAACAA CTCCAAACCC GTGGAGGACA AGGATGCTGT GGCCTTCACC 1680 

T6TGAACCTG AGGCTCAGAA CACAACCTAC CTGTGGTGGG TAAATGGTCA GAGCCTCCCA 1740 

GTCAGTCCCA GGCTGCAGCT GTCCAATGGC AACAGGACCC TCACTCTATT CAATGTCACA 1800 

AGAAATGACG CAAGAGCCTA TGTATGTGGA ATCCAGAACT CAGTGAGTGC AAACCGCAGT 1860 

GACCCAGTCA CCCTGGATGT CCTCTATGGG CCGGACACCC CCATCATTTC CCCCCCAGAC 192 0 

25 TCGTCTTACC TTTCGGGAGC GAACCTCAAC CTCTCCTGCC ACTCGGCCTC TAACCCATCC 1980 

CCGCAGTATT CTTGGCGTAT CAATGGGATA CCGCAGCAAC ACACACAAGT TCTCTTTATC 2040 

GCCAAAATCA CGCCAAATAA TAACGGGACC TATGCCTGTT TTGTCTCTAA CTTGGCTACT 2100 

GGCCGCAATA ATTCCATAGT CAAGAGCATC ACAGTCTCTG CATCTGGAAC TTCTCCTGGT 2160 

CTCTCAGCTG GGGCCACTGT CGGCATCATG ATTGGAGTGC TGGTTGGGGT TGCTCTGATA 2220 

30 TAGCAGCCCT GGTGTAGTTT CTTCATTTCA GGAAGACTGA CAGTTGTTTT GCTTCTTCCT 2280 

TAAAGCATTT GCAACAGCTA CAGTCTAAAA TTGCTTCTTT ACCAAGGATA TTTACAGAAA 2340 

AGACTCTGAC CAGAGATCGA GACCATCCTA GCCAACATCG TGAAACCCCA TCTCTACTAA 2400 

AAATACA7UUV ATGAGCTGGG CTTGGTGGCG CGCACCTGTA GTCCCAGTTA CTCGGGAGGC 2460 

TGAGGCAGGA GAATCGCTTG AACCCGGGAG GTGGAGATTG CAGTGAGCCC AGATCGCACC 252 0 

35 ACTGCACTCC AGTCTGGCAA CAGAGCAAGA CTCCATCTCA AAAAGAAAAG AAAAGAAGAC 2 580 

TCTGACCTGT ACTCTTGAAT ACAAGTTTCT GATACCACTG CACTGTCTGA GAATTTCCAA 2 640 

AACTTTAATG AACTAACTGA CAGCTTCATG AAACTGTCCA CCAAGATCAA GCAGAGAAAA 2700 

TAATTAATTT CATGGGACTA AATGAACTAA TGAGGATTGC TGATTCTTTA AATGTCTTGT 2760 

TTCCCAGATT TCAGGT^AACT TTTTTTCTTT TAAGCTATCC ACTCTTACAG CAATTTGATA 2820 

40 AAATATACTT TTGTGAACAA AAATTGAGAC ATTTACATTT TCTCCCTATG TGGTCGCTCC 2880 

AGACTTGGGA AACTATTCAT GAATATTTAT ATTGTATGGT AATATAGTTA TTGCACAAGT 2940 
TCAATAAAAA TCTGCTCTTT GTATAACAGA AAAA 



PCT/US02/12476 



45 



Seq ID NO: 533 Protein sequence 
Protein Accession #: NP_004354.1 



50 
55 
60 
65 
70 
75 
80 



MESPSAPPHR 
HLFGYSWYKG 
TLHVIKSDLV 
NNQSLPVSPR 
TISPLNTSYR 
AHNSDTGLNR 
QSLPVSPRLQ 
SPSYTYYRPG 
NSASGHSRTT 
LPVSPRLQLS 
PDSSYLSGAN 
ATGRNNSIVK 



11 
1 

WCIPWQRLLL 
ERVDGNRQII 
NEEATGQFRV 
LQLSNGNRTL 
SGENLNLSCH 
TTVTTITVYA 
LSNDNRTLTL 
VNLSLSCHAA 
VKTITVSAEL 
NGNRTDTLFN 
LNLSCHSASN 
SITVSASGTS 



21 

t 

TASIiLTFWNP 
GYVIGTQQAT 
YPELPKPSIS 
TLFNVTRNDT 
AASNPPAQYS 
EPPKPFITSN 
LSVTRNDVGP 
SNPPAQYSWL 
PKPSISSNNS 
VTRNDARAYV 
PSPQYSWRIN 
PGXjSAGATVG 



31 

I 

PTTAKLTIES 
PGPAYSGREI 
SNNSKPVEDK 
ASYKCETQNP 
WFVNGTFQQS 
NSNPVEDEDA 
YECGIQNELS 
IDGNIQQHTQ 
KPVEDKDAVA 
CGIQNSVSAN 
GIPQQHTQVL 
IMIGVLVGVA 



41 

1 

TPFNVAEGKE 
lYPNASLLIQ 
DAVAFTCEPE 
VSARRSDSVI 
TQELFIPNIT 
VALTCEPEIQ 
VDHSDPVILN 
ELFISNITEK 
FTCEPEAQNT 
RSDPVTLDVL 
FIAKITPNNN 
LI 



51 

VLLLVHNLPQ 
NIIQNDTGFY 
TQDATYLWWV 
LNVLYGPDAP 
VNNSGSYTCQ 
NTTYLWWVNN 
VLYGPDDPTI 
NSGLYTCQAN 
TYLWWVNGQS 
YGPDTPIISP 
GTYACFVSNL 



Seq ID NO: 534 DNA sequence 

Nucleic Acid Accession #: NM_006952.1 

Coding sequence: 11., 793 



AATCCCGACA 
TGGAAATGTG 
ATCTGACCAA 
GGCTGCCTGG 
TGTAGGCATC 
AGTATATGCC 
ACCCAACCTC 
TGATGACCAG 
CAATTGCTGT 
TGAGAATAAT 
AGAACCTCTC 
CTGCTATGAA 
ATTTGCCATT 
AATTGAATAT 



11 
1 

ATGGCGAAAG 

ATTATTGGTT 
CACAGCCTCT 
ATCGGCATAT 
ATGAAGTCCA 
TTTGAAGTGG 
TTCCTGAAGC 
TGGAAAAACA 
GGCGTAAATG 
GATGCTGACT 
AACCTGGAGG 
CTGATCTCTG 
CTCTGCTGGA 
TAAGAA 



21 
I 

ACAACTCAAC 
GTTGCGGCAT 
ACCCACTGCT 
TTGTGGGCAT 
GCAGGAAAAT 
CATCTTGTAT 
AGATGCTAGA 
ATGGAGTCAC 
GTCCATCAGA 
ATCCCTGGCC 
CTTGTAAACT 
GTCCAATGAA 
CTTTTTGGGT 



31 
I 

TGTTCGTTGC 
TGCCCTGACT 
TGAAGCCACC 
CTGCCTCTTC 
TCTTCTGGCG 
CACAGCAGCA 
GAGGTACCAA 
CAAAACCTGG 
CTGGCAAAAA 
TCGTCAATGC 
AGGCGTGCCT 
CCGACACGCC 
TCTCCTGGGT 



60 
120 
180 

240 
300 
360 
420 
480 
540 
600 
660 



41 


51 




1 

TTCCAGGGCC 


1 

TGCTGATTTT 


60 


GCGGAGTGCA 


TCTTCTTTGT 


120 


GACAACGATG 


ACATCTATGG 


180 


TGCCTGTCTG 


TTCTAGGCAT 


240 


TATTTCATTC 


TGATGTTTAT 


300 


ACACAACGAG 


ACTTTTTCAC 


360 


AACAACAGCC 


CTCCAAACAA 


420 


GACAGGCTCA 


TGCTCCAGGA 


480 


TACACATCTG 


CCTTCCGGAC 


540 


TGTGTTATGA 


ACAATCTTAA 


600 


GGTTTTTATC 


ACAATCAGGG 


660 


TGGGGGGTTG 


CCTGGTTTGQ 


720 


ACCATGTTCT 


ACTGGAGCAG 


780 



85 



Seq ID NO: 535 Protein secjuence 
Protein Accession #: NP_008883.1 

1 11 21 



31 
I 



390 



wo 02/086443 

MAKDNSTVRC FQGLLIFGNV IIGCCGIALT AECIFFVSDQ HSLYPLLEAT DNDDIYGAAW 60 

IGIFVGICLF CLSVL6IVGI MKSSRKILLA YFIIjMFIVYA FEVASCITAA TQRDFFTPNIi 120 

FLKQMLERYQ NNSPPNNDDQ WKNNGVTKTW DRLMLQDNCC GVNGPSDWQK YTSAFRTENN 180 

DADYPWPRQC CVMNNLKEPL NLEACKLGVP GFYHNQGCYE LISGPMNRHA WGVAWFGFAl 240 
LCWTPWVLLG TMFYWSRIEY 



PCT/US02/12476 



10 



15 



20 



25 



Seq ID NO; 53 6 DNA sequence 

Nucleic Acid Accession #: NM_002638.1 

Coding sequence: 120.. 473 



1 
I 

CAATACAGCT 
GCTGGACTGC 
TGAGGGCCAG 

AGGCAGCTGT 
TCAATGGACA 
CGCAAGAGCC 
TCCGGTGCGC 
TCAAGAAGTG 
CGGTCCTTGC 
TGCTGCCCTT 
GAGCTGCCTC 



11 

1 

AAGGAATTAT 
ATAAAGATTG 
CAGCTTCTTG 

CACGGGAGTT 
AGATCCCGTT 
AGTCAAAGGT 
CATGTTGAAT 
CTGTGAAGGC 
TGCACCTGTG 
CCCCTTCCCA 
TCTCATCCAC 



21 

i 

CCCrrGTAAA 
GTATGGCCTT 
ATCGTGGTGG 
CCTGTTAAAG 
AAAGGACAAG 
CCAGTCTCCA 
CCCCCTAACC 
TCTTGCGGGA 
CCGTCCCCAG 
CACTGTCCAT 
TTTCCAATAA 



31 

1 

TACCACAGAC 
AGCTCTTAGC 
TGTTCCTCAT 
GTC/y^GACAC 
TTTCAGTTAA 
CTAAGCCTGG 
GCTGCTTGAA 
TGGCCTGTTT 
AGCTACAGGC 
TCTTCCTCCC 
A 



41 
I 

CCGCCCTGGA 
CAAACACCTT 
CGCTGGGACG 
TGTCAAAGGC 
AGGTCAAGAT 
CTCCTGCCCC 
AGATACTGAC 
CGTTCCCCAG 
CCCATCTGGT 
ATTCAGGATG 



51 

GCCAGGCCAA 
CCTGACACCA 
CTGGTTCTAG 
CGTGTTCCAT 
AAAGTCAAAG 
ATTATCTTGA 
TGCCCAGGAA 
TGAAGGGAGC 
CCTAAGTCCC 
CCCACGGCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



Seq ID NO: 537 Protein sequence 
Protein Accession #: NP_002 629.1 



1 11 21 31 41 51 

30 MRASSFLIW VFLIAGTIrVIi EAAVTGVPVK GQDTVKGRVP FNGQDPVKGQ VSVKGQDKVK 
AQEPVKGPVS TKPGSCPIIL IRCAMLNPPN RCIiKDTDCPG IKKCCEGSCG MACFVPQ 



60 



Seq ID NO: 53 8 DNA sequence 
Nucleic Acid Accession #: NM_001793.2 
35 Coding sequence: 71,. 2560 



40 



45 



50 



55 



60 



65 



70 



75. 



80 



85 



AAAGGGGCAA 
CTCTGCAGCC 
CTGGCTGCAG 
CTTGGAGGCG 
CTGCCCTGGG 
TGGCGAGACA 
ATCCAAACGT 
TGAAAATGGC 
AGACACCAAG 
CTTCGCTGTA 
GATTGCCAAG 
CCCCATGAAC 
GGACACCTTC 
GACAGCCACG 
CCATAGCCAA 
CACCATCAGC 
CATCCAGGCC 
GATCCTTGAT 
GCCTGAGAAT 
CAACTCACCA 
TACCATCACC 
TTTTGAGGCC 
GCTGAAGCTC 
ACCTGTGTTT 
GCCTGTGTGT 
CATCCTGAGA 
TGTGGGCACC 
GGTCTTGGCC 
ACTGATTGAT 
CCAAAGCCCT 
CCCTTTCCAG 
GGAAGGTGAC 
GCACCTTTCT 
GTGCGACTGC 
CCCTGTGCTG 
GAG/^AAGAAG 
CGTCTTCTAC 
GCTCCACCGA 
CATCATCCCG 
TATAATTGAG 
CTTGGTGTTC 
CTCCGCCTCC 
GAAGCTGGCA 
GGGACCAAAC 
GACTTCGGAG 
ACGTTAGAGT 
AGCACTGAAA 
TCTTACCTGC 
TACAGTGGAC 



11 
I 

GAGCTGAGCG 
ATGGGGCTCC 
TGCGCGGCCT 
GGAGGCGCGG 
CAAGAGCCAG 
GTCCAGGAAA 
ATCTTACGAA 
AAGGGTCCCT 
ATTTTCTACA 
GAGAAGGAGA 
TATGAGCTCT 
ATCTCCATCA 
CGAGGGAGTG 
GATGAGGATG 
GAACCAAAGG 
GTCATCTCCA 
ACAGACATGG 
GCCAATGACA 
GCAGTGGGCC 
GCGTGGCGTG 
ACCCACCCTG 
AAAAACCAGC 
CCAACCTCCA 
GTCCCACCCT 
GTCTACACTG 
GACCCAGCAG 
CTCGACCGTG 
ATGGACAATG 
GTCAATGACC 
GTGCGCCAGG 
GCCCAGCTCA 
ACAGTGGTCT 
CTGTCTGACC 
CATGGCCATG 
GGGGCTGTCC 
CGGAAGATCA 
TATGGCGAAG 
GGTCTGGAGG 
ACACCCATGT 
AACCTGAAGG 
6ACTAT6AGG 
GACCAAGACC 
GACATGTACG 
GTCAGGCCAC 
CTTGTCAGGA 
GGTTGCTTCC 
ACCTCTCCAC 
CGTAAAATGC 
TTTCTCTCTG 



21 
1 

GAACACCGGC 
CTCGTGGACC 
CCGAGCCGTG 
AGCAGGAGCC 
CTCTGTTTAG 
GAAGGTCACT 
GACACAAGAG 
TCCCCCAGAG 
GCATCACGGG 
CAGGCTGGTT 
TTGGCCACGC 
TCGTGACCGA 
TCTTAGAGGG 
ATGCCATCTA 
ACCCACACGA 
GTGGCCTGGA 
ATGGGGAGGG 
ATGCTCCCAT 
ATGAGGTGCA 
CCACCTACCT 
AGAGCAACCA 
ACACCCTGTA 
CAGCCACCAT 
CCAAAGTCGT 
CAGAAGACCC 
GGTGGCTAGC 
AGGATGAGCA 
GAAGCCCTCC 
ATGGCCCAGT 
TGCTGAACAT 
CAGATGACTC 
TGTCCCTGAA 
ATGGCAACAA 
TCGAAACCTG 
TGGCTCTGCT 
AGGAGCCCCT 
AGGGGGGTGG 
CCAGGCCGGA 
ACCGTCCTCG 
CGGCTAACAC 
GCAGCGGCTC 
AAGATTACGA 
GTGGCGGGGA 
AGAGCATCTC 
AGTGGCCGTA 
TTAGCCTTTC 
CTGGGCCAGG 
TCAACCCTGT 
GAATGGAACC 



31 
I 

CCGCCGTCGC 
TCTCGCGTCT 
CCGGGCGGTC 
CGGCCAGGCG 
CACTGATAAT 
GAAGGAAAGG 
AGATTGGGTG 
ACTGAATCAG 
GCCGGGGGCA 
GTTGTTGAAT 
TGTGTCAGAG 
CCAGAATGAC 
AGTCCTACCA 
CACCTACAAT 
CCTCATGTTC 
CCGGGAAAAA 
CTCCACCACC 
GTTTGACCCC 
GAGGCTGACG 
TATCATGGGC 
GGGCATCCTG 
CGTTGAAGTG 
AGTGGTCCAC 
TGAGGTCCAG 
TGACAAGGAG 
CATGGACCCA 
GTTTGTGAGG 
CACCACTGGC 
CCCTGAGCCC 
CACGGACAAG 
AGACATCTAC 
GT^GTTCCTG 
AGAGCAGCTG 
CCCTGGACCC 
GTTCCTCCTG 
CCTACTCCCA 
CGAAGAGGAC 
GGTGGTTCTC 
GCCAGCCAAC 
AGACCCCACA 
CGACGCCGCG 
TTATCTGAAC 
GGACGACTAG 
CAAGGGGTCT 
GCAACTTGGC 
AGGATGGAGG 
GTTGCCTCAG 
GTCCTGGGCC 
TTCTTAGGCC 



41 

1 

GGCAGCTGCT 
CTCCTCCTTC 
TTCAGGGAGG 
CTGGGGAAAG 
GATGACTTCA 
AATCCATTGA 
GTTGCTCCAA 
CTCAAGTCTA 
GACAGCCCCC 
AAGCCACTGG 
AATGGTGCCT 
CACAAGCCCA 
GGTACTTCTG 
GGGGTGGTTG 
ACCATTCACC 
GTCCCTGAGT 
ACGGCAGTGG 
CAGAAGTACG 
GTCACTGATC 
GGTGACGACG 
ACAACCAGGA 
ACCAACGAGG 
GTGGAGGATG 
GAGGGCATCC 
AATCAAAAGA 
GACAGTGGGC 
AACAACATCT 
ACGGGAACCC 
CGTCAGATCA 
GACCTGTCTC 
TGGACGGCAG 
AAGCAGGATA 
ACGGTGATCA 
TGGAAGGGAG 
CTGGTGCTGC 
GAAGATGACA 
CAGGACTATG 
CGCAATGACG 
CCAGATGAAA 
GCCCCGCCCT 
TCCCTGAGCT 
GAGTGGGGCA 
GCGGCCTGCC 
CAGTTCCCCC 
GGAGACAGGC 
AATGTGGGCA 
AGGCCAAGTT 
TGGGCCTGCT 
TCCTGGTGCA 



51 
I 

TCACCCCTCT 
TCCAGGTTTG 
CTGAAGTGAC 
TATTCATGGG 
CTGTGCGGAA 
AGATCTTCCC 
TATCTGTCCC 
ATAAAGATAG 
CTGAGGGTGT 
ACCGGGAGGA 
CAGTGGAGGA 
AGTTTACCCA 
TGATGCAGGT 
CTTACTCCAT 
GGAGCACAGG 
ACACACTGAC 
CAGTAGTGGA 
AGGCCCATGT 
TGGACGCCCC 
GGGACCATTT 
AGGGTTTGGA 
CCCCTTTTGT 
TGAATGAGGC 
CCACTGGGGA 
TCAGCTACCG 
AGGTCACAGC 
ATGAAGTCAT 
TTCTGCTAAC 
CCATCTGCAA 
CCCACACCTC 
AGGTCAACGA 
CATATGACGT 
GGGCCACTGT 
GTTTCATCCT 
TTTTGTTGGT 
CCCGTGACAA 
ACATCACCCA 
TGGCACCAAC 
TCGGCAACTT 
ACGACACCCT 
CCCTCACCTC 
GCCGCTTCAA 
TGCAGGGCTG 
TTCAGCTGAG 
TATGAGTCTG 
GTTTGACTTC 
TCCAGAAGCC 
GTGACTGACC 
ACTTAATTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 



391 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TTTTTTTAAT GCTATCTTCA AAACGTTAGA GAAAGTTCTT CAAAAGTGCA GCCCAGAGCT 3 000 

GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC CCCAATGCCT CCCATTCGGA 3060 

TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CCTTATTTTT TATTTTCCCT 3120 

GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT ATGTACTAGA ACTTTTTTAT 3180 
TAAAGAAACT TTTCCCAGAA AAAAA 

Seq ID NO: 53 9 Protein sequence 
Protein Accession #: NP_001784.2 



PCT/US02/12476 



MGLPRGPLAS 
QEPALFSTDN 
KGPFPQRLNQ 
YELFGHAVSE 
DEDDAIYTYN 
TDMDGDGSTT 
AWRATYLIMG 
PTSTATIWH 
DPAGWLAMDP 
VNDHGPVPEP 
TWLSLKKFL 
GAVLALIiFLL 
GLEARPEWL 
DYEGSGSDAA 



11 

1 

LLLLQVCWLQ 
DDFTVRNGET 
LKSNKDRDTK 
NGASVEDPMN 
GWAYSIHSQ 
TAVAWEILD 
GDDGDHFTIT 
VEDVNEAPVF 
DSGQVTAVGT 
RQITICNOSP 
KQDTYDVHLS 
LVLLLLVRKK 
RNDVAPTIIP 
SLSSLTSSAS 



21 
I 

CAASEPCRAV 
VQERRSLKER 
IFYSITGPGA 
ISIIVTDQND 
EPKDPHDLMF 
ANDNAPMFDP 
THPESNQGIL 
VPPSKWEVQ 
LDREDEQFVR 
VRQVLNITDK 
LSDHGNKEQL 
RKIKEPLLLP 
TPMYRPRPAN 
DQDQDYDYLN 



31 
I 

FREAEVTLEA 
NPLKIFPSKR 
DSPPE6VFAV 
HKPKPTQDTF 
TIHRSTGTIS 
QKYEAHVPEN 
TTRKGLDFEA 
EGIPTQBPVC 
NNIYEVMVLA 
DLSPHTSPFQ 
TVIRATVCDC 
EDDTRDNVFY 
PDEIGNFIIE 
EWGSRFKKLA 



-Seq ID NO: 540 DNA sequence . 
Nucleic Acid Accession Eos sequence 
Coding sequence: 1 . . 672 



1 
I 

ATGAGGCTCC 
CGGGGCTCCC 
AAGGGCGGGG 
CTGCTCGCCT 
GCGAGACAAC 
TGTCATGTTT 
ACAGAGCCAT 
AAGCAGTGCT 
CTCCTGGAAG 
TTAGAGGGGC 
AGCTGTGGTG 
AGCCTGTCTT 



11 
I 

AAAGACCCCG 
CCTACCGGCC 
AGGGGGCGCC 
TGCTGCTGGT 
GAGATCCAGA 
GTGAGAGAGA 
ACTGCGTTAT 
CCGCTGGTTG 
AGCCCATGCC 
CACCTATCAA 
GGCTGTGGCT 
GA 



21 

1 

ACAGGCCCCG 
AGACCCGGGG 
GCGCGCTGAC 
CGTGGCCCTA 
GGACTCCCAG 
A7\ACACTTTC 
AGCGGCCGTG 
TGCAGCGATG 
CTTCTTTTAC 
CTCATCAGTG 
GGCCATCCTC 



31 

1 

GCGGGTGGGA 
AGAGGCGCGC 
CCTCCCTGGG 
CCGCGGGTGT 
CGAACGGACG 
GAGTGCCAGA 
AAAATATTTC 
GAGAGACCCA 
CTCAAGTGTT 
TTCAAAGAAT 
CTGCTGCTGG 



41 

1 

GGAEQEPGQA 
ILRRHKRDWV 
EKETGWLLIiN 
RGSVLEGVIiP 
VISSGLDREK 
AVGHEVQRLT 
KNQHTLYVEV 
VYTAEDPDKE 
MDNGSPPTTG 
AQLTDDSDIY 
HGHVETCPGP 
YGEEGGGEED 
NLKAANTDPT 
DMYGGGEDD 



41 
I 

GGCGCGCGCC 
GGAGGCTGCG 
CACCGCTGGG 
GGACAGACGC 
AGGGTGACAA 
ACCCAAGGAG 
CACGTTTTTT 
AGCCAGAGGA 
GTAAAATTCG 
ATGCTGGGAG 
CCTCCATTGC 



51 

1 

LGKVFMGCPG 
VAPISVPENG 
KPLDREEIAK 
GTSVMQVTAT 
VPEYTLTIQA 
VTDLDAPNSP 
TNEAPFVLKL 
NQKISYRILR 
TGTLLLTLID 
WTAEVNEEGD 
WKGGFILPVL 
QDYDITQLHR 
APPYDTLLVF 



51 
I 

CCGGGGCGGG 
AAGGTTCCAG 
GACGATGGCG 
CAACCTGACT 
TAGAGTGTGG 
GTGCAAATGG 
CATGGTTGCG 
GAAGCGGTTT 
CTACTGCAAT 
CATGGGTGAG 
AGCCGGCCTC 



Seq ID NO: 541 Protein sequence 
Protein Accession Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MRLQRPRQAP AGGRRAPRGG RGSPYRPDPG RGARRLRRFQ KGGEGAPRAD PPWAPLGTMA 

LLALLLWAL PRVWTDANLT ARQRDPEDSQ RTDEGDNRVW CHVCERENTF ECQNPRRCKW 

TEPYCVIAAV KIFPRFFMVA KQCSAGCAAM ERPKPEEKRF LLEEPMPPFY LKCCKIRYCN 

LEGPPIJTSSV FKEYAGSMGB SCGGLWIiAIL LLLASIAAGL SLS 

Seq ID NO: 542 DNA sequence 

Nucleic Acid Accession #: XM_035292.2 

Coding sequence: 53.. 1576 



11 



21 



31 



41 



51 



GCTCGCTGGG 
TGCGGGCCCG 
GGAGAAGATG 
CGTGACCCTG 
TATCGGCTCG 
GCTGGCGCTG 
CGCGGAGCTC 
CTACGGCTCG 
ATCGCAGTAC 
CTGCCCGGTG 
GGCCGTGAAC 
CAAGCTCCTG 
TGTGTCCAAT 
TGTGCTGGCA 
CACAGAGGAA 
CATCGTGACG 
GCAGATGCTG 
GTCCTGGATC 
GTTCACATCC 
CTCCATGATC 
GACGCTGCTC 
CAACTGGCTC 
TGAGCTTGAG 
CCTCTTCCTG 
CATCATCCTC 
GTGGCTCCTC 



CCGCGGCTCC 
AAGCGGCGCG 
CTGGCCGCCA 
CAGCGGAACA 
GGCATCTTCG 
GTGGTGTGGG 
GGCACCACCA 
CTGCCCGCCT 
ATCGTGGCCC 
CCCGAGGAGG 
TGCTACAGCG 
GCCCTGGCCC 
CTAGATCCCA 
TTATACAGCG 
ATGATCAACC 
CTGGTGTACG 
TCGTCCGAGG 
ATCCCCGTCT 
TCCAGGCTCT 
CACCCACAGC 
TACGCCTTCT 
TGCGTGGCCC 
CGGCCCATCA 
ATCGCCGTCT 
AGCGGGCTGC 
CAGGGCATCT 



CGGGTGTCCC 
CGCTAGCGGC 
AGAGCGCGGA 
TCACGCTGCT 
TGACGCCCAC 
CCGCGTGCGG 
TCTCCAJ\ATC 
TCCTCAAGCT 
TGGTCTTCGC 
CAGCCAAGCT 
TGAAGGCCGC 
TGATCATCCT 
ACTTCTCATT 
GCCTCTTTGC 
CCTACAGAAA 
TGCTGACCAA 
CCGTGGCCGT 
TCGTGGGCCT 
TCTTCGTGGG 
TCCTCACCCC 
CCAAGGACAT 
TGGCCATCAT 
AGGTGAACCT 
CCTTCTGGAA 
CCGTCTACTT 
TCTCCACGAC 



AGGCCCGGCC 
GCCGGCGGCC 
CGGCTCGGCG 
CAACGGCGTG 
GGGCGTGCTC 
CGTCTTCTCC 
GGGCGGCGAC 
CTGGATCGAG 
CACCTACCTG 
CGTGGCCTGC 
CACCCGGGTC 
GCTGGGCTTC 
TGAAGGCACC 
CTATGGAGGA 
CCTGCCCCTG 
CCTGGCCTAC 
GGACTTCGGG 
GTCCTGCTTC 
GTCCCGGGAA 
CGTGCCGTCC 
CTTCTCCGTC 
CGGCATGATC 
GGCCCTGCCT 
GACACCCGTG 
CTTCGGGGTC 
CGTCCTGTGT 



GGTGCGCAGA 
GAGGAGAAGG 
CCGGCAGGCG 
GCCATCATCG 
AAGGAGGCAG 
ATCGTGGGCG 
TACGCCTACA 
CTGCTCATCA 
CTCAAGCCGC 
CTCTGCGTGC 
CAGGATGCCT 
GTCCAGATCG 
AAACTGGATG 
TGGAATTACT 
GCCATCATCA 
TTCACCACCC 
AACTATCACC 
GGCTCCGTCA 
GGCCACCTGC 
CTCGTGTTCA 
ATCAACTTCT 
TGGCTGCGCC 
GTGTTCTTCA 
GAGTGTGGCA 
TGGTGGAAAA 
CAGAAGCTCA 



GCATGGCGGG 
AAGAGGCGCG 
AGGGCGAGGG 
TGGGGACCAT 
GCTCGCCGGG 
CGCTCTGCTA 
TGCTGGAGGT 
TCCGGCCTTC 
TCTTCCCCAC 
TGCTGCTCAC 
TTGCCGCCGC 
GAAAGGGTGA 
TGGGGAACAT 
TGAATTTCGT 
TCTCCCTGCC 
TGTCCACCGA 
TGGGCGTCAT 
ATGGGTCCCT 
CCTCCATCCT 
CGTGTGTGAT 
TCAGCTTCTT 
ACAGAAAGCC 
TCCTGGCCTG 
TCGGCTTCAC 
ACAAGCCCAA 
TGCAGGTGGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



392 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CCCCCAGGAG ACATAGCCAG GAGGCCGAGT GGCTGCCGGA GGAGCATGC 

Seq ID NO: 543 Protein sequence 
Protein Accession #: XP 035292.2 



PCT/US02/12476 



MAGAGPKRRA 
GTIIGSGIFV 
LEVYGSLPAF 
LLTAVNCYSV 
GNIVLALYSG 
STBQMLSSEA 
SILSMIHPQL 
RKPELERPIK 
KPKWLLQGIF 



11 

I 

LAAPAAEEKE 
TPTGVLKEAG 
LKLWIELLII 
KAATRVQDAF 
LFAYGGWNYL 
VAVDFGNYHL 
IiTPVPSI/VFT 
VNLALPVFFI 
STTVLCQKIjM 



21 
I 

EAREKMLAAK 
SPGLALWWA 
RPSSQYIVAL 
AAAKLLALAIj 
NFVTEEMINP 
GVMSWIIPVF 
CVMTLIiYAFS 
LACLFLIAVS 
QWPQET 



31 

I 

SADGSAPAGE 
ACGVFSIVGA 
VFATYLLKPL 
IILLGFVQIG 
YRNLPLAIII 
VGLSCFGSVN 
KDIFSVINFF 
FWKTPVECGI 



Seq ID NO: 544 DNA sequence 

Nucleic Acid Accession NM_005268»1 

Coding sequences 168.. 989 



TAAAAAGCAA 
TCTGGATATG 
AGCCCTGAGG 
TCTTTGAGGG 
TGTCTCTGGT 
GTGATGACCA 
TTGATGAGTT 
CATGCCCCTC 
ACCGAGAAGC 
GTGGGCTCT6 
TTCTCTATGT 
ACGCAGATCC 
TTTTCACCCT 
TCATCTACCT 
TGTGCACAGG 
CGGGTGACCT 
GAGACCATGT 
CCTGGATGGQ 
CATGAGGTAG 
TCAACTCCAG 
GCTCGGTTTC 



11 

I 

AAGAATTCGC 
AAATTCAAGC 
A6TAGTCACT 
ACTCCTGAGT 
CTTCATCTTC 
CAAGGACTTC 
CTTCCCTGTG 
ACTGCTCGTG 
CCATGGQGAG 
GTGGACATAT 
GTTCCACTCA 
ATGTCCCAAT 
CTTCATGGTG 
GGTGAGCAAG 
TCATCACCCC 
CATCTTTCTG 
GAAGAAAACC 
GAGGCTCTAG 
GGGCAGGCAA 
CCACCTGCCC 
CTTTTCTAGA 



21 

I 

GGCCGCGTCG 
TGCTTGCTGA 
CAGTAGCAGC 
GGGGTCAACA 
CGCGTGCTGG 
GACTGCAATA 
TCCCATGTGC 
GTCATGCACG 
AACAGTGGGC 
GTCTGCAGCC 
TTCTACCCCA 
ATAGTGGACT 
GCCACAGCTG 
AGATGCCACG 
CACGGTACCA 
GGCTCAGACA 
ATCTTGTGAG 
CATCTCTCAT 
GAGAGAGGAT 
CAGCTCGACG 
ATGGA7VATAG 



31 

1 

ACACGGGCTT 
GTCCTATTGC 
TGACGCGTGG 
AGTACTCCAC 
TGTACCTGGT 
CTCGCCAGCC 
GCCTCTGGGC 
TGGCCTACCG 
GCCTCTACCr 
TAGTGTTCAA 
TiATATATCCT 
GCTTCATCTC 
CCATCTGCAT 
AGTGCCTGGC 
CCTCTTCCTG 
GTCATCCTCC 
GGGCTGCCTG 
AGGTGCAACC 
TCAGACGCTC 
GCACTGGGCC 
TGAGGGCCAA 



Seq ID NO: 545 Protein secpience 
Protein Accession #: NP_0052S9.1 



MNWSIFEGLL 
SNVCFDEFFP 
GKKRGGLWWT 
SEKNIFTLFM 
DDLLSGDLIF 



11 
I 

SGVNKYSTAF 
VSHVRLWALQ 
YVCSLVFKAS 
VATAAICILIj 
LGSDSHPPLL 



21 31 

1 1 
GRIWLSLVFI FRVLVYLVTA 
LILVTCPSLL WMHVAYREV 
VDIAFLYVFH SFYPKYILPP 
NLVELIYLVS KRCHECLAAR 
PDRPRDHVKK TIL 



Seq ID NO: 546 DNA sequence 

Nucleic Acid Accession #: NM_002391.1 

Coding sequence: 26.. 457 



1 
I 

CGGGCGAAGC 
CGCCCTGCTG 
CCCGGGGAGC 
CGGCGTGGGT 
GCCCTGCAAC 
TGCGTGTGAT 
CAATGCTCAG 
AAAGGCCAAA 
GCCCCTGGTG 
CACCAGTGCC 
ACTCCCCAGC 
TGAGCCTCCC 
ATTACTAAGA 
TAATAT 



11 
I 

AGCGCGGGCA 
GCGCTCACCT 
GAGTGCGCTG 
TTCCGCGAGG 
TGGAAGAAG6 
GGGGGCACAG 
TGCCAGGAGA 
GCCAAGAAAG 
TCACATGGGG 
TTCTGTCTGC 
CCCACCCCTA 
CCAAA6CAAT 
AACACATCAA 



21 
1 

GCGAGATGCA 
CCGCGGTCGC 
AGTGGGCCTG 
GCACCTGCGG 
AGTTTGGAGC 
GCACCAAAGT 
CCATCCGCGT 
GGAAGGGAAA 
CCTGGCCACG 
TCGTTAGCTT 
AGTGCCCAAA 
GTGAGTCCCA 
ATAAACTGAC 



31 
I 

GCACCGAGGC 
CAAAAAGAAA 
GGGGCCCTGC 
GGCCCAGACC 
CGACTGCAAG 
CCGCCAAGGC 
CACCAAGCCC 
GGACTAGACG 
CCCTCCCTCT 
TAATCAATCA 
GTGGGGAGGG 
GAGCCCGCTT 
TTTTTCCCCC 



41 


51 
1 




! 

GEGVTLQRNI 


1 

TLIiNGVAIIV 


60 


LCYAELGTTI 


SKSGGDYAYM 


120 


FPTCPVPEEA 


AKLVACLCVL 


180 


KGDVSNLDPN 


FSPEGTKIiDV 


240 


SLPIVTLVYV 


LTNLAYFTTL 


300 


GSLFTSSRLF 


FVGSREGHLP 


360 


SFFNWLCVAL 


AI IGMI WLRH 


420 


GFTIILSGLP 


VYFFGVWWKN 


480 


41 

1 


51 
1 




1 

CCCCGAAAAC 


I 

CTTCCCCGCT 


60 


CGGCTGCTGG 


GAGCCAGGAG 


120 


GTCCACCATG 


AACTGGAGTA 


180 


AGCCTTTGGG 


CGCATCTGGC 


240 


GACGGCCGAG 


CGTGTGTGGA 


300 


CGGCTGCTCC 


AACGTCTGCT 


360 


CCTGCAGCTT 


AT CCTGGTGA 


420 


GGAGGTTCA6 


GAGAAGAGGC 


480 


GAACCCCGGC 


AAGAAGCGGG 


540 


GGCGAGCGTG 


GACATCGCCT 


600 


CCCTCCTGTG 


GTCAAGTGCC 


660 


CAAGCCCTCA 


GAGAAGAACA 


72 0 


CCTGCTCAAC 


CTCGTGGAGC 


780 


AGCAAGGAAA 


GCTCAAGCCA 


840 


CJ\AACAAGAC 


GACCTCCTTT 


900 


TCTCTTACCA 


GACCGCCCCC 


960 


GACTGGTCTG 


GCAGGTTGGG 


1020 


TGAGAGTGGG 


GGAGCTAAGC 


1080 


TGGGAGCCAG 


TTCCTAGTCC 


1140 


AGTTCCCCCT 


CTGCTCTGCA 


1200 


TGC 






41 


51 




1 

ERVWSDDHKD 


1 

FDCNTRQPGC 


60 


QEKRHREAHG 


ENSGRLYLNP 


120 


WKCHADPCP 


NIVDCFISKP 


180 


KAQAMCTGHH 


PHGTTSSCKQ 


240 


41 
1 


51 
1 




1 

TTCCTCCTCC 


1 

T CACCCTCCT 


60 


GATAAGGTGA 




120 


ACCCCCAGCA 


GCAAGGATTG 


180 


CAGCGCATCC 


GGTGCAGGGT 


240 


TACAAGTTTG 


AGAACTGGGG 


300 


ACCCTGAAGA 


AGGCGCGCTA 


360 


TGCACCCCCA 


AGACCAAAGG 


420 


CC/^GCCTGG 


ATGCCAAGGA 


480 


CCCAGGCCCG 


AGATGTGACC 


540 


TGCCCTGCCT 


TGTCCCTCTC 


600 


ACAAGGGATT 


CTGGGRAGCT 


660 


TTGTTCTTCC 


CCACAATTCC 


720 


CAATAAAAGC 


TCTT cttttt 


780 



Seq ID NO: 547 Protein sequence 
Protein Accession #: NP_0023 82.1 



11 



21 



31 



41 



51 



MQHRGFLLLT IiLALLALTSA VAKKKDKVKK GGPGSECAEW AWGPCTPSSK DCGVGFREGT 
CGAQTQRIRC RVPCNWKKEF GADCKYKFEN WGACDGGTGT KVRQGTLKKA RYNAQCQETI 
RVTKPCTPKT KAKAKAKKGK GKD 



60 
120 



Seq ID NO: 548 DNA sequence 



393 



wo 02/086443 

Nucleic Acid Accession #: 
Coding sequence: 1..786 



PCT/US02/12476 



NM 006783 .1 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



1 
[ 

ATGGATTGGG 
GGGAAGGTGT 
CAGGAAGTGT 
AAAAATGTGT 
CTGATCTTCG 
GAAACCACTC 
ATTAAAAAGC 
TTTTTCCGAA 
TACCT^CCTGC 
TTTATTTCTA 
ATTTGCATGC 
AGATCAAAGA 
CAGAATGAAA 
AGCTAA 



11 

I 

GGACGCTGCA 
GGATCACAGT 
GGGGTGACGA 
GCTATGACCA 
TCTCCACCCC 
GCAAGTTCAG 
ACAAGGTTCG 
TCATCTTTGA 
CCTGGGTGTT 
GGCCAACAGA 
TGCTTAACGT 
GAGCACAGAC 
TGAATGAGCT 



21 
I 

CACTTTCATC 
CATCTTTATT 
GCAAGAGGAC 
CTTTTTCCCG 
AGCGCTGCTG 
GCGAGGAGAG 
GATAGAGGGG 
AGCAGCCTTT 
GAAATGTGGG 
GAAGACCGTG 
GGCAGAGTTG 
GCAAAAAAAT 
GATTTCAGAT 



31 

i 

GGGGGTGTCA 
TTCCGAGTCA 
TTCGTCTGCA 
GTGTCCCACA 
GTGGCCATGC 
AAGAGGAATG 
TCGCTGTGGT 
ATGTATGTGT 
ATTGACCCCT 
TTTACCATTT 
TGCTACCTGC 
CACCCCAATC 
AGTGGTCAAA 



41 

i 

ACAAACACTC 
TGATCCTAGT 
ACACACTGCA 

TCCGGCtGTG 
ATGTGGCCTA 
ATTTCAAAGA 
GGACGTACAC 
TTTACTTCCT 
GCCCCAACCT 
TTATGATTTC 
TGCTGAAAGT 
ATGCCCTAAA 
ATGCAATCAC 



51 
I 

CACCAGCATC 
GGTGGCTGCC 
ACCGGGATGC 
GGCCCTCCAG 
CTACAGGCAC 
CATAGAGGAC 
CAGCAGCATG 
TTACAATGGG 
TGTTGACTGC 
TGCGTCTGTG 
GTGTTTTAGG 
GGAGAGTAAG 
AGGTTTCCCA 



11 



21 



31 



41 



51 



CATCCCTCTG 
TCACCCTGGG 
AGGACCTGGA 
ACATCTCCCT 
CCACCCCCGA 
AGAAGAAGGT 
TGGCGAACGA 
AGGACACCAC 
AGGACGATGA 
GGTACTTGCT 
CCAGGAAGAC 
TTTCAAAGAA 
TCCTGCTGCA 
GCAGAGGTTA 



GCTCCAGAGC 
CGTGGCCCTG 
GCTCCCAAAG 
CATGGCGACA 
GGACAACCTG 
CCTTGGAGAG 
GGCCACGCTG 
CACCCCCATC 
GATCATGCAG 
GGACTTGAAA 
CAGACTCCCA 
TAACCACAGC 
CACCTGCACC 
TTAATAAACC 



TCAGAGCCAC 
GTCTGTGGTG 
TTGGCAGGGA 
CTGAAGGCCC 
GAGATCGTTC 
AAGACTGGGA 
CTCGATACTG 
CAGAGCATGA 
GGATTCATCA 
CAGATGGAAG 
CCCTTCCACA 
TCAGAAGACG 
ATTGCCATGG 
CTTGGAGCAT 



CCACAGCCGC 
TCCCGGCCAT 
CCTGGCACTC 
CTCTGAGGGT 
TGCACAGATG 
ATCCAAAGAA 
ACTACGACAA 
TGTGCCAGTA 
GGGCTTTCAG 
AGCCGTGCCG 
GCTCCAGAGC 
ATGACGTGGT 
GGAGGCTGCT 
G 



AGCCATGCTG 
GGACATCCCC 
CATGGCCATG 
CCACATCACC 
GGAGAACAAC 
GTTCAAGATC 
TTTCCTGTTT 
CCTGGCCAGA 
GCCCCTGCCC 
TTTCTAGCTC 
AGTGGGACTT 
CATCTGTGTC 
CCCTGGGGGC 



TGCCTCCTGC 
CAGACCAAGC 
GCGACCAACA 
TCACTGTTGC 
AGCTGTGTTG 
AACTATACGG 
CTCTGCCTAC 
GTCCTGGTGG 
AGGCACCTAT 
ACCTCCGCCT 
CCTCCTGCCC 
GCCATCCCCT 
AGAGTCTCTG 



Seq ID NO: 551 Protein sequence 
Protein Accession #: NP_002562.1 

1 11 21 31 41 51 

I I I I I I 

MDIPQTKQDL ELPKLAGTWH SMAMATNNIS LMATLKAPLR VHITSLLPTP EDNLEIVLHR 
WENNSCVEKK VLGEKTGNPK KFKINYTVAN EATLLDTDYD NFLFLCLQDT TTPIQSMMCQ 
YLARVLVEDD EIMQGFIRAF RPLPRHLWYL LDLKQMEEPC RF 

Seq ID HO: 552 DNA sequence 

Nucleic Acid Accession #: NM_006500.1 

Coding sequence: 2 7.. 1967 



ACTTGCGTCT 
TCGCCGCCTG 
CGCCTGAGCT 
AGTCCCAAGG 
TCATCTTCCG 
TCAGCCTCCA 
GCATCTTCTT 
TCTACAAAGC 
GTAAGGAGCC 
TCATCTGGTA 
CGTCCCAGAC 
TGGTTAAAGA 
GGAACCACAT 
TGTGGCTGGA 
GTTTGGCTGA 
GGGAGGCAGA 
AGGAACACAG 
TGAGTGAACC 
CCCCTGAGAG 
ACCTCGAGTT 



11 
I 

CGCCCTCCGG 
CTGCTGCTGT 
GGTGGAGGTG 
CAACCTCAGC 
TGTGCGCCAG 
GGACAGAGGG 
GTGCCAGGGC 
TCCGGAGGAG 
TGAGGAGGTC 
CAAGAATGGC 
TGTGGAGTCG 
AGACAAAGAT 
GT^GGAGTCC 
AGTGGAGCCC 
TGGCAACCCT 
GGAAGAGACA 
TGGGCGCTAT 
ACAGGAACTA 
ACAQGAAGGC 
CCAGTGGCTG 



21 

I 

CCAAGCATGG 
CCTCGCGTCG 
GAAGTGGGCA 
CATGTCGACT 
GGCCAGGGCC 
GCTACTCTGG 
AAGCGCCCTC 
CCAAACATCC 
GCTACCTGTG 
CGGCCTCTGA 
AGTGGTTTGT 
GCCCAGTTTT 
AGGGT^GTCA 
GTGGGAATGC 
CCACCACACT 
ACCAACGACA 
GAATGTCAGG 
CTGGTGAACT 
AGCAGCCTCA 
AGAGAAGAGA 



31 
I 

GGCTTCCCAG 
CGGGTGTGCC 
GCACAGCCCT 
GGTTTTCTGT 
AGAGCGAACC 
CCCTGACTCA 
GGTCCCAGGA 
AGGTCAACCC 
TAGGGAGGAA 
AGGAGGAGT^A 
ACACCTTGCA 
ACTGTGAGCT 
CCGTCCCTGT 
TGAAGGAAGG 
TCAGCATCAG 
ACGGGGTCCT 
CCTGGAACTT 
ATGTGTCTGA 
CCCTGACCTG 
CAGACCAGGT 



41 
I 

GCTGGTCTGC 
CGGAGAGGCT 
TCTGAAGTGC 
CCACAAGGAG 
TGGGGAGTAC 
AGTCACCCCC 
GTACCGCATC 
CCTGGGCATC 
CGGGTACCCC 
GAACCGGGTC 
GAGTATTCTG 
CAACTACCGG 
TTTCTACCCG 
GGACCGCGTG 
CAAGCAGAAC 
GGTGCTGGAG 
GGACACCATG 
CGTCCGAGTG 
TGAGGCAGAG 
GCTGGAAAGG 



51 
1 

GCCTTCTTGC 
GAGCAGCCTG 
GGCCTCTCCC 
AAGCGGACGC 
GAGCAGCGGC 
CAAGACGAGC 
CAGCTCCGCG 
CCTGTGAACA 
ATTCCTCAAG 
CACATTCAGT 
AAGGCACAGC 
CTGCCCAGTG 
ACAGA/UU^G 
GAAATCAGGT 
CCCAGCACCA 
CCTGCCCGGA 
ATATCGCT6C 
AGTCCCGCAG 
AGTAGCCAGG 
GGGCCTGTGC 



60 
120 
180 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



Seq ID NO: 549 Protein sequence 
Protein Accession #: NP_006774.1 

1 11 21 31 41 51 

I I I I I I 

MDWGTLHTFI GGVNKHSTSI GKVWITVIFI FRYMILWAA QEVWGDEQED FVGNTLQPGC 60 
KNVCYDHFFP VSHIRLWALQ. LIFVSTPALIj VAMHVAYYRH ETTRKFRRGE KRNDFKDIED 120 
IKKHKVRIEG SLWWTYTSSI FFRIIFEAAF MYVFYFLYNG YHLPWVLKCG IDPCPNLVDC 180 
FISRPTEKTV FTIFMISASV ICMLIjNVAEL CYLLLKVCFR RSKRAQTQKN HPNHALKESK 240 
QNEMNELISD SGQNAITGFP S 

Seq ID NO: 550 DNA sequence 

Nucleic Acid Accession #: NM_002571.1 

Coding sequence: 99.. 587 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



394 



wo 02/086443 

TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCGGCTA TCGCTGCGTG GCGTCTGTGC 1260 

CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT GGCCATTTTT GGCCCCCCTT 1320 

GGATGGCATT C7\AGGAGAGG AAGGTGTGGG TGAAAGAGAA TATGGTGTTG AATCTGTCTT 13 80 

GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA CGTCAACGGC ACGGCAAGTG 1440 

AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 150 0 

TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT GGGCAAAAAC ACCAGCATCC 1560 

TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA CTCCAACACT^ ACCACTGGCC 1620 

TCAGCa^CTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCTCCACA GAGAGAAAGC 1680 

TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 1740 

TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 

GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA GACCGAACTT GTAGTTGAAG 1860 

TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 

GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG GCATTAGCCC C6AATCACTT 1980 

CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 2 040 

CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 

GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 

GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC AAGCAAGGAG CCCCAGTCTC 2220 

CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 

AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 

CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG AGGACACACC GGAGCCAGGC 2400 

GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG AGAGCACCCC AGCGGCATCC 24 60 

AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 252 0 

ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT CCTTAAAAGA TACGTGCCGG 2580 

GGCCAGGT6T GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCGA GGCGGGCGGA 2640 

TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CTACTAAAAA 2700 

TACAAAAATU^ AATTAGCTAG GCGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 2760 

CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2 820 

CACTGCACTC CAGCCTGGGC AACACAGCGA GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 2880 

ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2940 

TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAGC 3000 

GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 3060 

TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 3120 

AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 3180 

CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 3240 

TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 3300 

AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA TTCTACATGG GTACCACAGG 3360 

AACCTGGGGG CCTGTGA7\AC TACAACCAAA AGGCACACAA AACCGTTTCC AGTTGGCAGC 3420 

AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA AGCTCTACCA GAGCAGACAG 3480 

CTACCCTACT TTTCAGCAGC AAAACGTCCC GTATGACGCA GCACGAAGGG CCTGGCAGGC 3540 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA CTT 

Seq ID NO: 553 Protein sequence 
Protein Accession #5 NP_006491.1 

1 11 21 31 41 51 

I I I I I I 

GLPRLVCAFL LAACCCCPRV AGVPGEAEQP APBLVEVEVG STALLKCGLS QSQGNLSHVD 60 

WFSVHKEKRT LIFRVRQGQG QSEPGEYEQR LSLQDRGATI* ALTQVTPQDE RIFLCQGKRP 120 

RSQEYRIQLR VYKAPEEPNI QVNPLGIPVN SKEPEEVATC VGRNGYPIPQ VIWYKNGRPL 180 

KEEKNRVHIQ SSQTVESSGti YTLQSILKAQ LVKEDKDAQF YCELNYRLPS G^fHMKESRBV 240 

TVPVFYPTEK VWLEVEPVGM LKEGDRVEIR CLADGNPPPH FSISKQNPST REAEEETTND 300 

NGVLVLEPAR KEHSGRYECQ AWNLDTMISL LSBPQEIiLVN YVSDVRVSPA APERQEGSSL 360 

TLTCEAESSQ DLEFQWLREE TDQVLERGPV LQLHDIiKRBA GGGYRCVASV PSIPGLNRTQ 420 

LVKIiAIFGPP WMAFKERKVW VKENMVLNLS CEASGHPRPT ISWNVNGTAS EQDQDPQRVL 480 

STIiNVLVTPE LLETGVECTA SNDLGKNTSI LFLELVNLTT LTPDSNTTTG LSTSTASPHT 540 

RANSTSTERK LPEPESRGW IVAVIVCILV IiAVLGAVLYF LYKKGKLPCR RSGKQEITLP 600 
PSRKTELWE VKSDKLPEEM GLLQGSSGDK RAPGDQGEKY IDLRH 

Seq ID NO: 554 DNA sequence 

Nucleic Acid Accession #: NM_003183.3 

Coding sequence: 165.. 263 9 

1 11 21 31 41 51 

I I I I I I 

TCGAGCCTGG CGGTAGAATC TTCCCAGTAG GCGGCGCGGG AGGGAAAAGA GGATTGAGGG 60 

GCTAGGCCGG GCGGATCCCG TCCTCCCCCG ATGTGAGCAG TTTTCCGAAA CCCCGTCAGG 120 

CGAAGGCTGC CCAGAGAGGT GGAGTCGGTA GCGGGGCCGG GAACATGAGG CAGTCTCTCC 180 

TATTCCTGAC CAGCGTGGTT CCTTTCGTGC TGGCGCCGCG ACCTCCGGAT GACCCGGGCT 240 

TCGGCCCCCA CCAGAGACTC GAGAAGCTTG ATTCTTTGCT CTCAGACTAC GATATTCTCT 3 00 

CTTTATCTAA TATCCAGCAG CATTCGGTAA GAAAAAGAGA TCTACAGACT TCAACACATG 3 60 

TAGAAACACT ACTAACTTTT TCAGCTTTGA AAAGGCATTT TAAATTATAC CTGACATCAA 420 

GTACTGAACG' TTTTTCACAA AATTTCAAGG TCGTGGTGGT GGATGGTATiA AACGAAAGCG 480 

AGTACACTGC AAAATGGCAG GACTTCTTCA CTGGACACGT GGTTGGTGAG CCTGACTCTA 540 

GGGTTCTAGC CCACATAAGA GATGATGATG TTATAATCAG AATCAACACA GATGGGGCCG 600 

AATATAACAT AGAGCCACTT TGGAGATTTG TTAATGATAC CAAAGACAAA AGAATGTTAG 660 

TTTATAAATC TGAAGATATC AAGAATGTTT CACGTTTGCA GTCTCCAAAA GTGTGTGGTT 720 

ATTTAAAAGT GGATAATGAA GAGTTGCTCC C/iAAAGGGTT AGTAGACAGA GAACCACCT6 780 

AAGAGCTTGT TCATCGAGTG AAAAGAAGAG CTGACCCAGA TCCCATGAAG AACACGTGTA 840 

AATTATTGGT GGTAGCAGAT CATCGCTTCT ACAGATACAT GGGCAGAGGG GAAGAGAGTA 900 

CAACTACAAA TTACTTAATA GAGCTAATTG ACAGAGTTGA TGACATCTAT CGGAACACTT 960 

CATGGGATAA TGCAGGTTTT AAAGGCTATG GAATACAGAT AGAGCAGATT CGCATTCTCA 1020 

AGTCTCCACA AGAGGTAAi\A CCTGGTGAAA AGCACTACAA CATGGCAAAA AGTTACCCAA 10 80 

ATGAAGAAAA GGATGCTTGG GATGTGAAGA TGTTGCTAGA GCAATTTAGC TTTGATATAG 1140 

CTGAGGAAGC ATCTAAAGTT TGCTTGGCAC ACCTTTTCAC ATACCAAGAT TTTGATATGG 1200 

GAACTCTTGG ATTAGCTTAT GTTGGCTCTC CCA6AGCAAA CAGCCATGGA GGTGTTTGTC 1260 

CAAAGGCTTA TTATAGCCCA GTTGGGAAGA AATVATATCTA TTTGAATAGT GGTTTGACGA 1320 

GCACAAAGAA TTATGGTAAA ACCATCCTTA CAAAGGAAGC TGACCTGGTT ACAACTCATG 1380 



395 



10 
15 
20 
25 
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35 
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WO 02/086443 

AATTGGGACA TAATTTTGGA GCAGAACATG ATCCGGATGG TCTAGCAGAA 
ATGAGGACCA GGGAGGGAAA TATGTCATGT ATCCCATAGC TGTGAGTGGC 
ACAATAAGAT GTTTTCAAAC TGCAGTAAAC AATCAATCTA TAAGACCATT 
CCCAGGAGTG TTTTCAAGAA CGCAGCAATA AAGTTTGTGG GAACTCGAGG 
GAGAAGAGTG TGATCCTGGC ATCATGTATC TGAACAACGA CACCTGCTGC 
GCACGTTGAA GGAAGGTGTC CAGTGCAGTG ACAGGAACAG TCCTTGCTGT 
AGTTTGAGAC TGCCCAGAAG AAGTGCCAGG AGGCGATTAA TGCTACTTGC 
CCTACTGCAC AGGTAATAGC AGTGAGTGCC CGCCTCCAGG AAATGCTGAA 
TTTGCTTGGA TCTTGGCAAG TGTAAGGATG GGAAATGCAT CCCTTTCTGC 
AGCAGCTGGA GTCCTGTGCA TGTAATGAAA CTGACAACTC CTGCAAGGTG 
ACCTTTCTGG CCGCTGTGTG CCCTATGTCG ATGCTGAACA AAAGAACTTA 
AAGGAAAGCC CTGTACAGTA GGATTTTGTG ACATGAATGG CAAATGTGAG 
AGGATGTAAT TGAACGATTT TGGGATTTCA TTGACCAGCT GAGCATCAAT 
AGTTTTTAGC AGACAACATC GTTGGGTCTG TCCTGGTTTT CTCCTTGATA 
CTTTCAGCAT TCTTGTCCAT TGTGTGGATA AGAAATTGGA TAAACAGTAT 
CTCTGTTTCA CCCCAGTAAC GTCGAAATGC TGAGCAGCAT GGATTCTGCA 
TTATCAAACC CTTTCCTGCG CCCCAGACTC CAGGCCGCCT GCAGCCTGCC 
CTTCGGCGCC AGCAGCTCCA AAACTGGACC ACCAGAGAAT GGACACCATC 
CCAGCACAGA CTCCCATATG GACGAGGATG GGTTTGAGAA GGACCCCTTC 
GCACAGCTGC CAAGTCATTT GAGGATCTCA CGGACCATCC GGTCGCCAGA 
CTGCCTCCTT TAAACTGCAG CGTCAGAATC GTGTTAACAG CTW^GAAACA 
TTAGTTCTCA GCTCTTCTGA CTTAAGTGTG CAAAATATTT TTATAGATTT 
TCAATCACAG CTTGTATTTT GTGAAGACTG GGAAGTGACT TAGCAGATGC 
TTTGAA.CTTC CTGCAGGTAA ACAGTTCTTG TGTGGTTTGG CCCTTCTCCT 
TAAGGTGAAA GTGAATCTAC TTATTTTGAG GCTTTCAGGT TTTAGTTTTT 
TTGACCTGTG GTGCAAAAGC AGAAAATACA GCTGGATTGG GTTATGAATA 
TGTAAATTAA TCTTTTATAT TGATAACAGC ACTGACTAGG GAAATGATCA 
ATACACTGTA ATGAACCGCT GAATATGAAG CATTTGGCAT TTATTTGTGA 
ATAGTTTTTT tTTTTTTTTT TTTTTTTTGC CTTCAACTAA A/^CAAAGGA 
GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACCCAGAGT 
CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT AATATGGCTC 
TAAGACTAAT GCTCAGAACC TAACCACTAC CTTACAGTGA GGGCTATACA 
TTGAATTTAT GGAATCTACC AACTGTTTAG GGCCCTGATT TGCTGGGCAG 
TTTTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA TACATGTCTT 
CTATTGGCTG GGAGTGGTGG CTCATGCCTG TAATCCCAGC ACTTGGAGAG 
CGCCACTACA CTCCAGCCTG GGTGACAGAG TGAGATCTGC CTC 

Seq ID NO: 555 Protein sequence 
Protein Accession #: NP 003174.2 
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MRQSLLFLTS 
QTSTHVETLL 
GEFDSRVLAH 
PKVCGYLKVD 
RGEESTTTNY 
AKSYPNEEKD 
HGGVCPKAYY 
AECAPNEDQG 
SRVDEGEECD 
TCKGVSYCTG 
KVCCRDhSGR 
INTFGKFLAD 
SASVRIIKPF 
PFPNSSTAAK 



11 

1 

WPPVLAPRP 
TFSALKRHFK 
IRDDDVIIRI 
NEELLPKGLV 
IiIELIDRVDD 
AWDVKMLLEQ 
SPVGKKNIYL 
GKYVMYPIAV 
PGIMYLNNDT 
NSSECPPPGN 
CVPYVDAEQK 
NIVGSVLVFS 
PAPQTPGRLQ 
SFEDLTDHPV 



21 

I 

PDDPGFGPHQ 
LYLTSSTERF 
NTDGAEYNIE 
DREPPEELVH 
lYRNTSWDNA 
FSFDIAEEAS 
NSGLTSTKNY 
SGDHENNKMF 
CCNSDCTLKE 
AENDTVCLDL 
NLFLRKGKPC 
LIFWIPFSIL 
PAPVIPSAPA 
ARSEKAASFK 



31 

t 

RLBKIiDSLLS 
SQNFKWWD 
PLWRFVNDTK 
RVKRRADPDP 
GFKGYGIQIE 
KVCLAHLFTY 
GKTILTKEAD 
SNCSKQSIYK 
GVQCSDRNSP 
GKCKDGKCIP 
TVGFCDMNGK 
VHCVDKKLDK 
APKLDHQRMD 
LQRQNRVNSK 



41 
I 

DYDILSLSNI 
GKNBSEYTAK 
DKRMLVYKSE 
MKNTCKLLW 
QIRILKSPQE 
QDFDMGTLGL 
LVTTHELGHN 
TIESKAQECF 
CCKNCQFETA 
FCEREQQLES 
CEKRVQDVIE 
QYESLSLFHP 
TIQEDPSTDS 
ETEC 



TGTGCCCCGA 
GATCACGAGA 
GAAAGTAAGG 
GTGGATGAAG 
AACAGCGACT 
AAAAACTGTC 
AAAGGCGTGT 
AATGACACTG 
GAGAGGGAAC 
TGCTGCAGGG 
TTTTTGAGGA 
AAACGAGTAC 
ACTTTTGGAA 
TTTTGGATTC 
GAATCTCTGT 
TCGGTTCGCA 
CCTGTGATCC 
CAGGAAGACC 
CCAAATAGCA 
AGTGAAAAGG 
GAGTGCTAAT 
GACCTACAAA 
TGGTCATGTG 
TTTGAAAAGG 
AAAATATCTT 
TTTACGTTTT 



GAAAAGTGGA 
GATAAATTTA 
TTTTATGTAG 
TTCATAATTC 
TGGTAGCCAG 
TTTTTCTGTA 
AGAAAATTCA 
GCTGAGGTTG 



51 

I 

QQHSVRKRDL 
WQDFFTGHW 
DIKNVSRLQS 
ADHRFYRYMG 
VKPGEKHYNM 
AYVGSPRANS 
FGAEHDPDGL 
QERSNKVCGN 
QKKCQEAINA 
CACNETDNSC 
RFWDFIDQLS 
SNVEMLSSMD 
HMDEDGFEKD 



Seq ID NO: 556 DNA sequence 

Nucleic Acid Accession #: NM_021832.1 

Coding sequence: 164.. 2248 



11 



21 



31 



41 



51 



TCGAGCCTGG 
CTAGGCCGGG 
GAAGGCTGCC 
ATTCCTGACC 
CGGCCCCCAC 
TTTATCTAAT 
AGAAACACTA 
TACTGAACGT 
GTACACTGTA 
GGTTCTAGCC 
ATATAACATA 
TTATAAATCT 
TTTAAAAGTG 
AGAGCTTGTT 
ATTATTGGTG 
AACTACMAT 
ATGGGATAAT 
GTCTCCACAA 
TGAAGAA?W\G 
TGAGGAAGCA 
AACTCTTGGA 
AAAGGCTTAT 
CACAAAGAAT 
ATTGGGACAT 



CGGTAGAATC 
CGGATCCCGT 
CAGAGAGGTG 
AGCGTGGTTC 
CAGAGACTCG 
ATCCAGCAGC 
CTAACTTTTT 
TTTTCACAAA 
AAATGGCAGG 
CACATAAGAG 
GAGCCACTTT 
GAAGATATCA 
GATAATGAAG 
CATCGAGTGA 
GTAGCAGATC 
TACTTAATAG 
GCAGGTTTTA 
GAGGTAAAAC 
GATGCTTGGG 
TCTAAAGTTT 
TTAGCTTATG 
TATAGCCCAG 
TATGGTAAAA 
AATTTTGGAG 



TTCCCAGTAG 
CCTCCCCCGA 
GAGTCGGTAG 
CTTTCX3TGCT 
AGAAGCTTGA 
ATTCGGTAAG 
CAGCTTTGAA 
ATTTCAAGGT 
ACTTCTTCAC 
ATGATGATGT 
GGAGATTTGT 
AGAATGTTTC 
AGTTGCTCCC 
AAAGAAGAGC 
ATCGCTTCTA 
AGCTAATTGA 
AAGGCTATGG 
CTGGTGAAAA 
ATGTGAAGAT 
GCTTGGCACA 
TTGGCTCTCC 
TTGGGAAGAA 
CCATCCTTAC 
CAGAACATGA 



GCGGCGCGGG 
TGTGAGCAGT 
CGGGGCCGGG 
GGCGCCGCGA 
TTCTTTGCTC 
TIAAAAGAGAT 
AAGGCATTTT 
CGTGGTGGTG 
TGGACACGTG 
TATAATCAGA 
TAATGATACC 
ACGTTTGCAG 
AAAAGGGTTA 
TGACCCAGAT 
CAGATACATG 
CAGAGTTGAT 
AATACAGATA 
GCACTACAAC 
GTTGCTAGAG 
CCTTTTCACA 
CAGAGCAAAC 
AAATATCTAT 
AAAGGAAGCT 
TCCX3GATGGT 



AGGAAAAGAG 
TTTCCGAAAC 
AACATGAGGC 
CCTCCGGATG 
TCAGACTACG 
CTACAGACTT 
AAATTATACC 
GATGGTAAAA 
GTTGGTGAGC 
ATCAACACAG 
AAAGACAAAA 
TCTCCAAAAG 
GTAGACAGAG 
CCCATGAAGA 
GGCAGAGGGG 
GACATCTATC 
GAGCAGATTC 
ATGGCAAAAA 
CAATTTAGCT 
TACCAAGATT 
AGCCATGGAG 
TTGAATAGTG 
GACCTGGTTA 
CTAGCAGAAT 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 



60 
120 

180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



GATTGAGGGG 
CCCGTCAGGC 
AGTCTCTCCT 
ACCCGGGCTT 
ATATTCTCTC 
CAACACATGT 
TGACATCAAG 
ACGAAAGCGA 
CTGACTCTAG 
ATGGGGCCGA 
GAATGTTAGT 
TGTGTGGTTA 
AACCACCTGA 
ACACGTGTAA 
AAGAGAGTAC 
GGAACACTTC 
GCATTCTCAA 
GTTACCCAAA 
TTGATATAGC 
TTGATATGGG 
GTGTTTGTCC 
GTTT6ACGAG 
CAACTCATGA 
GTGCCCCGAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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TGAGGACCAG GGAGGGAAAT ATGTCATGTA TCCCATAGCT GTGAGTGGCG ATCACGAGAA 1500 

CAATAAGATG TTTTCAAACT GCAGTAAACA ATCAATCTAT AAGACCATTG AAAGTAAGGC 1560 

CCAGGAGTGT TTTCAAGAAC GCAGCAATAA AGTTTGTGGG AACTCGAGG6 TGGATGAAGG 1620 

AGAAGAGTGT GATCCTGGCA TCATGTATCT GAACAACGAC ACCTGCTGCA ACAGCGACTG 1680 

CACGTTGAAG GAAGGTGTCC AGTGCAGTGA CAGGAACAGT CCTTGCTGTA AAAACTGTCA 1740 

GTTTGAGACT GCCCAGAAGA AGTGCCAGGA GGCGATTAAT GCTACTTGCA AAGGCGTGTC 1800 

CTACTGCACA GGTAATAGCA GTGAGTGGCG GCCTCCAGGA AATGCTGAAG ATGACACTGT 1860 

TTGCTTGGAT CTTGGCAAGT GTAAGGATGG GAAATGCATC CCTTTCTGCG AGAGGGAACA 1920 

GCAGCTGGAG TCCTGTGCAT GTAATGAAAC TGAC7UVCTCC TGCAAGGTGT GCTGCAGGGA 1980 

CCTTTCCGGC CGCTGTGTGC CCTATGTCGA TGCTGi\ACAA AAGAACTTAT TTTTGAGGAA 2 040 

AGGAAAGCCC TGTACAGTAG GATTTTGTGA CATGAATGGC AAATGTGAGA AACGAGTACA 2100 

GGATGTAATT GAACGATTTT GGGATTTCAT TGACCAGCTG AGCATCAATA CTTTTGGAAA 2160 

Q.j.,p,j,.p.j.ji^QQj^ GACAACATCG TTGGGTCTGT CCTGGTTTTC TCCTTGATAT TTTGGATTCC 2220 

TTTCAGCATT CTTGTCCATT GTGTGTAACG TCGAAATGCT GAGCAGCATG GATTCTGCAT 2280 

CGGTTCGCAT TATCA/^CCC TTTCCTGCGC CCCAGACTCC AGGCCGCCTG CAGCCTGCCC 2340 

CTGTGATCCC TTCGGCGCCA GCAGCTCCAA AACTGGACCA CCAGAGAATG GACACCATCC 2400 

AGGAAGACCC CAGCACAGAC TCACATATGG ACGAGGATGG GTTTGAGAAG GACCCCTTCC 2460 

CAAATAGCAG CACAGCTGCC AAGTCATTTG AGGATCTCAC GGACCATCCG GTCACCAGAA 2520 

GTGAAAAGGC TGCCTCCTTT AAACTGCAGC GTCAGAATCG TGTTGACAGC AAAGAAACAG 2580 

AGTGCTAATT TAGTTCTCAG CTCTTCTGAC TTAAGTGTGC AAAATATTTT TATAGATTTG 2640 

ACCTACAATC AATCACAGCT TATATTTTGT GAAGACTGGG AAGTGACTTA GCAGATGCTG 2700 

GTCATGTGTT TGAACTTCCT GCAGGTAT^C AGTTCTTGTG TGGTTTGGCC CTTCTCCTTT 2 760 

TGAAAAGGTA AGGTGAAGGT GAATCTAGCT TATTTTGAGG CTTTCAGGTT TTAGTTTTTA 2 820 

AAATATCTTT TGACCTGTGG TGCAAAAGCA GAAAATACAG CTGGATTGGG TTATGAGTAT 2 880 

TTACGTTTTT GTAAATTAAT CTTTTATATT GATT^ACAGGC ACTGACTAGG GAAATGATCA 2940 

GTTTTTTTTT ATACACT6TA ATGAACCGCT GAATATGAAG CATTTGGCAT TTATTTGT6A 3000 

GAAAAGTGGA ATAGTTTTTT TTTTTTTTTT TTTTTTTTGC CTTCAACTAA AAACAAAGGA 3060 

GATAAATTTA GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACCCAGAGT 312 0 

TTTTATGTAG CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT AATATGGCTC 3180 

TTCATAATTC TAAGACTAAT GCTCAGAACC TAACCACTAC CTTACAGTGA GGGCTATACA 3240 

TGGTAGCCAG TTGAATTTAT GGAATCTACC AACTGTTTAG GGCCCTGATT TGCTGGGCAG 3300 

TTTTTCTGTA TTTTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA TACATGTCTT 33 60 

AGAAAATTCA CTATTGGCTG GGAGTGGTGG CTCATGCCTG TAATCCCAGC ACTTGGAGAG 3420 
3421 GCTGAGGTTG CGCCACTACA CTCCAGCCTG GGTGACAGAG TGAGATCTGC CTC 

Seq ID NO; 557 Protein sequence 
Protein Accession #: NP_068604.1 

1 11 21 31 41 51 

I i I I I I 

MRQSLLFLTS WPFVLAPRP PDDPGFGPHQ RLEKLDSLLS DYDILSLSNI QQHSVRKRDL 60 

QTSTHVETLL TFSALKRHFK LYLTSSTERF SQNFKWWD GKNESEYTVK WQDFFTGHW 120 

GEPDSRVLAH IRDDDVIIRI NTDGAEYNIE PLWRFVNDTK DKRMLVYKSE DIKNVSRLQS 180 

PKVCGYLKVD NEELLPKGLV DREPPEELVH RVKRRADPDP MKNTCKLLW ADHRFYRYMG 240 

RGEESTTTNY LIEIilDRVDD lYRNTSWDNA GFKGYGIQIE QIRILKSPQE VKPGEKHYNM 300 

AKSYPNEEKD AWDVKMLLEQ FSFDIAEEAS KVCLAHLFTY QDFDMGTLGL AYVGSPRANS 3 60 

HGGVCPKAYY SPVGKKNIYL NSGLTSTKNY GKTILTKEAD LVTTHELGHN FGAEHDPDGL 420 

AECAPNEDQG GKYVMYPIAV SGDHENNKMF SNCSKQSIYK TIESKAQECF QERSNKVCGN 480 

SRVDEGEECD PGIMYLNNDT CCNSDCTLKE GVQCSDRNSP CCKNCQFETA QKKCQEAINA 540 

TCKGVSYCTG NSSECPPPGN AEDDTVCLDL GKCKDGKCIP PCEREQQLES CACNETDNSC * 600 

KVCCRDLSGR CVPYVDAEQK NLFLRKGKPC TVGFCDMNGK CEKRVQDVIE RFWDFIDQLS 660 
XNTFGKFLAD NIVGSVLVFS LIFWIPFSIIi VHCV 

Seq ID NO: 558 DNA sequence 

Nucleic Acid Accession #s NM_004994.1 

Coding sequence: 20.. 2143 

1 11 21 31 41 51 

I t I I I I 

AGACACCTCT GCCCTCACCA TGAGCCTCTG GCAGCCCCTG GTCCTGGTGC TCCTGGTGCT 60 

GGGCTGCTGC TTTGCTGCCC CCAGACAGCG CCAGTCCACC CTTGTGCTCT TCCCTGGAGA 120 

CCTGAGT^CC AATCTCACCG ACAGGCAGCT GGCAGAGGAA TACCTGTACC GCTATGGTTA 180 

CACTCGGGTG GCAGAGATGC GTGGAGAGTC GJ4AATCTCTG GGGCCTGCGC TGCTGCTTCT 240 

CCAGAAGCAA CTGTCCCTGC CCGAGACCGG TGAGCTGGAT AGCGCCACGC TGAAGGCCAT 300 

GCGAACCCCA CGGTGCGGGG TCCCAGACCT GGGCAGATTC CAAACCTTTG AGGGCGACCT 360 

CAAGTGGCAC CACCAC7U\CA TCACCTATTG GATCCAAAAC TACTCGGAAG ACTTGCCGCG 420 

GGCGGTGATT GACGACGCCT TTGCCCGCGC CTTCGCACTG TGGAGCGCGG TGACGCCGCT 480 

CACCTTCACT CGCGTGTACA GCCGGGACGC AGACATCGTC ATCCAGTTTG GTGTCGCGGA 540 

GCACGGAGAC GGGTATCCCT TCGACGGGAA GGACGGGCTC CTGGCACACG CCTTTCCTCC 600 

TGGCCCCGGC ATTCAGGGAG ACGCCCATTT CGACGATGAC GAGTTGTGGT CCCTGGGC7\A 660 

GGGCGTCGTG GTTCCAACTG GGTTTGGAAA CGCAGATGGC GCGGCCTGCC ACTTCCCCTT 720 

CATCTTCGAG GGCCGCTCCT ACTCTGCCTG CACCACCGAC GGTCGCTCCG ACGGCTTGCC 780 

CTGGTGCAGT ACCACGGCCA ACTACGACAC CGACGACCGG TTTGGCTTCT GCCCCAGCGA 840 

GAGACTCTAC ACCCGGGACG GCAATGCTGA TGGGAAACCC TGCCAGTTTC CATTCATCTT 900 

CCAAGGCCAA TCCTACTCCG CCTGCACCAC GGACGGTCGC TCCGACGGCT ACCGCT6GTG 960 

CGCCACCACC GCCAACTACG ACCGGGACAA GCTCTTCGGC TTCTGCCCGA CCCGAGCTGA 1020 

CTCGACGGTG ATGGGGGGCA ACTCGGCGGG GGAGCTGTGC GTCTTCCCCT TCACTTTCCT 1080 

GGGTAAGGAG TACTCGACCT GTACCAGCGA GGGCCGCGGA GATGGGCGCC TCTGGTGCGC 1140 

TACCACCTCG AACTTTGACA GCGACAAGAA GTGGGGCTTC TGCCCGGACC AAGGATACAG 1200 

TTTGTTCCTC GTGGCGGCGC ATGAGTTCGG CCACGCGCTG GGCTTAGATC ATTCCTCAGT 1260 

GCCGGAGGCG CTCATGTACC CTATGTACCG CTTCACTGAG GGGCCCCCCT TGCATAAGGA 1320 

CGACGTGAAT GGCATCCGGC ACCTCTATGG TCCTCGCCCT GAACCTGAGC CACGGCCTCC 1380 

AACCACCACC ACACCGCAGC CCACGGCTCC CCCGACGGTC TGCCCCACCG GACCCCCCAC 1440 

TGTCCACCCC TCAGAGCGCC CCACAGCTGG CCCCACAGGT CCCCCCTCAG CTGGCCCCAC 1500 

AGGTCCCCCC ACTGCTGGCC CTTCTACGGC CACTACTGTG CCTTTGAGTC CGGTGQACGA 1560 

TGCCTGCAAC GTGAACATCT TCGACGCCAT CGCGGAGATT GGGAACCAGC TGTATTTGTT 1620 

CAAGGATGGG AAGTACTGGC GATTCTCTGA GGGCAGGGGG AGCCGGCCGC AGGGCCCCTT 1680 
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CCTTATCGCC GhCAAGTGGC CCGCGCTGCC CCGCAAGCTG GACTCGGTCT TTGAGGAGCC 1740 

GCTCTCCAAG AAGCTTTTCT TCTTCTCTGG GCGCCAGGTG TGGGTGTACA CAGGCX3CGTC 1800 

GGTGCTGGGC CCGAGGCGTC TGGACAAGCT GGGCCTGGGA GCCGACGTGG CCCAGGTGAC 1860 

CGGGGCCCTC CGGAGTGGCA GGGGGAAGAT GCTGCTGTTC AGCGGGCGGC GCCTCTGGAG 192 0 

GTTCGACGTG AAGGCGCAGA TGGTGGATCC CCGGAGCGCC AGCGAGGTGG ACCGGATGTT 1980 

CCCCGGGGTG CCTTTGGACA CGCACGACGT CTTCCAGTAC CGAGAGAAAG CCTATTTCTG 2040 

CCAGGACCGC TTCTACTGGC GCGTGAGTTC CCGGAGTGAG TTGAACCAGG TGGACCAAGT 2100 

GGGCTACGTG ACCTATGACA TCCTGCAGTG CCCTGAGGAC TAGGGCTCCC GTCCTGCTTT 2160 

GCAGTGCCAT GTAAATCCCC ACTGGGACCA ACCCTGGGGA AGGAGCCAGT TTGCCGGATA 2220 

CAAACTGGTA TTCTGTTCTG GAGGAAAGGG AGGAGTGGAG GTGGGCTGGG CCCTCTCTTC 2280 
TCACCTTTGT TTTTTGTTGG AGTGTTTCTA ATAAACTTGG ATTCTCTAAC CTTT 

Seq ID NO: 559 Protein sequence 
Protein Accession #: NP_004985.1 

1 11 21 31 41 51 

I I I 1 I ! 

MSLWQPLVLV IjLVLGCCFAA PRQRQSTLVL FPGDLRTNI.T DRQLAEEYLY RYGYTRVAEM 60 

RGESKSLGPA LIiLLQKQLSIi PETGELDSAT LKAMRTPRCX3 VPDLGRFQTF EGDLKWHHHN 120 

ITYWIQNYSB DLPRAVIDDA FARAFALWSA VTPLTFTRVY SRDADIVIQF GVAEHGDGYP 180 

FDGKDGLLAH APPPGPGIQG DAHFDDDELW SLGKGVWPT RFGNADGAAC HFPFIPEGRS 240 

YSACTTDGRS DGLPWCSTTA NYDTDDRFGF CPSERLYTRD GNADGKPCQF PFIFQGQSYS 300 

ACTTDGRSDG YRWCATTANY DRDKLFGPCP TRADSTVMGG NSAGELCVFP FTFLGKEYST 3 60 

CTSEGRGDGR LWCATTSNPD SDKKWGFCPD QGYSLFLVAA HEFGHALGLD HSSVPEALMY 42 0 

PMYRFTEGPP LHKDDVNGIR HLYGPRPEPE PRPPTTTTPQ PTAPPTVCPT GPPTVHPSER 480 

PTAGPTGPPS AGPTGPPTAG PSTATTVPLS PVDDACNVNI FDAIAEIGNQ LYLFKDGKYW 540 

RFSEGRGSRP QGPFLIADKW PALPRKLDSV FEEPLSKKLF FFSGRQVWVY TGASVLGPRR 600 

LDKL6LGADV AQVTGALRSG RGKMLLFSGR RLWRFDVKAQ MVDPRSASEV DRMFPGVPLD 660 
THDVFQYREK AYFCQDRFYW RVSSRSBLNQ VDQVGYVTYD ILQCPED 

Seq ID NOt 560 DNA sequence 

Nucleic Acid Accession #: NM_000213.1 

Coding sequences 127.. 5385 

1 11 21 31 41 51 

I I I I I I 

CGCCCGCGCG CTGCAGCCCC ATCTCCTAGC GGCAGCCCAG GCGCGGAGGG AGCGAGTCCG 60 

CCCCGAGGTA GGTCCAGGAC GGGCGCACAG CAGCAGCCGA GGCTGGCCGG GAGAGGGAGG 120 

AAGAGGATGG CAGGGCCACG CCCCAGCCCA TGGGCCAGGC TGCTCCTGGC AGCCTTGATC 180 

AGCGTCAGCC TCTCTGGGAC CTTGGCAAAC CGCTGCAAGA AGGCCCCAGT GAAGAGCTGC 240 

ACGGAGTGTG TCCGTGTGGA TAAGGACTGC GCCTACTGCA CAGACGAGAT GTTCAGGGAC 3 00 

CGGCGCTGCA ACACCCAGGC GGAGCTGCTG GCCGCGGGCT GCCAGCGGGA GAGCATCGTG 360 

GTCATGGAGA GCAGCTTCCA AATCACAGAG GAGACCCAGA TTGACACCAC CCTGCGGCGC 420 

AGCCAGATGT CCCCCCAAGG CCTGCGGGTC CGTCTGCGGC CCGGTGAGGA GCGGCATTTT 480 

GAGCTGGAGG TGTTTGAGCC ACTGGAGAGC CCCGTGGACC TGTACATCCT CATGGACTTC 540 

TCCAACTCCA TGTCCGATGA TCTGGACAAC CTCAAGAAGA TGGGGCAGAA CCTGGCTCGG 600 

GTCCTGAGCC AGCTCACCAG CGACTACACT ATTGGATTTG GCAAGTTTGT GGACAAAGTC 660 

AGCGTCCCGC AGACGGACAT GAGGCCTGAG AAGCTGAAGG AGCCCTGGCC CAACAGTGAC 720 

CCCCCCTTGT CCTTCAAGAA CGTCATCAGC CTGACAGAAG ATGTGGATGA GTTCCGGAAT 780 

AAACTGCAGG GAGAGCX3GAT CTCAGGCAAC CTGGATGCTC CTGAGGGCGG CTTCGATGCC 840 

ATCCTGCAGA CAGCTGTGTG CACX3AGGGAC ATTGGCTGGC GCCCGGACAG CACCCACCTG 900 

CTGGTCTTCT CCACCGAGTC AGCCTTCCAC TATGAG6CTG ATGGCGCCAA CGTGCTGGCT 960 

GGCATCATGA GCCGCAACGA TGAACGGTGC CACCTGGACA CCACGGGCAC CTACACCCAG 102 0 

TACAGGACAC AGGACTACCC GTCGGTGCCC ACCCTGGTGC GCCTGCTCGC CAAGCACAAC 1080 

ATCATCCCCA TCTTTGCTGT CACCAACTAC TCCTATAGCT ACTACGAGAA GCTTCACACC 1140 

TATTTCCCTG TCTCCTCACT GGGGGTGCTG CAGGAGGACT CGTCCAACAT CGTGGAGCTG 1200 

CTGGAGGAGG CCTTCAATCG GATCCGCTCC AACCTGGACA TCCGGGCCCT AGACAGCCCC 12 60 

CGAGGCCTTC GGACAGAGGT CACCTCCAAG ATGTTCCAGA AGACGAGGAC TGGGTCCTTT 132 0 

CACATCCGGC GGGGGGAAGT GGGTATATAC CAGGTGCAGC TGCGGGCCCT TGAGCACGTG 13 80 

GATGGGACGC ACGTGTGCCA GCTGCCGGAG GACCAGAAGG GCAACATCCA TCTGAAACCT 1440 

TCCTTCTCCG ACGGCCTCAA GATGGACGCG GGCATCATCT GTGATGTGTG CACCTGCGAG 1500 

CT6CAAAAAG AGGTGCGGTC AGCTCGCTGC AGCTTCAACG GAGACTTCGT GTGCGGACAG 1560 

TGTGTGTGCA GCGAGGGCTG GAGTGGCCAG ACCTGCAACT GCTCCACCGG CTCTCTGAGT 1620 

GACATTCAGC CCTGCCTGCG GGAGGGCGAG GACAAGCCGT GCTCCGGCCG TGGGGAGTGC 1680 

CAGTGCGGGC ACTGTGTGTG CTACGGCGAA GGCCGCTACG AGGGTCAGTT CTGCGAGTAT 1740 

GACAACTTCC AGTGTCCCCG CACTTCCGGG TTCCTCTGCA ATGACCGAGG ACGCTGCTCC 1800 

ATGGGCCAGT GTGTGTGTGA GCCTGGTTGG ACAGGCCCAA GCTGTGACTG TCCCCTCAGC 1860 

AATGCCACCT GCATCGACT^G CAATGGGGGC ATCTGTAATG GACGTGGCCA CTGTGAGTGT 1920 

GGCCGCTGCC ACTGCCACCA GCAGTCGCTC TACACGGACA CCATCTGCGA GATCAACTAC 1980 

TCGGCGATCC ACCCGGGCCT CTGCGAGGAC CTACGCTCCT GCGTGCAGTG CCAGGCGTGG 2 040 

GGCACCGGCG AGAAGAAGGG • GCGCACGTGT GAGGAATGCA ACTTCAAGGT CAAGATGGTG 2100 

GACGAGCTTA AGAGAGCCGA GGAGGTGGTG GTGCGCTGCT CCTTCCGGGA CGAGGATGAC 2160 

GACTGCACCT ACAGCTACAC CATGGAAGGT GACGGCGCCC CTGGGCCCAA CAGCACTGTC 2220 

CTGGTGCACA AGAAGAAGGA CTGCCCTCCG GGCTCCTTCT GGTGGCTCAT CCCCCTGCTC 2280 

CTCCTCCTCC TGCCGCTCCT GGCCCTGCTA CTGCTGCTAT GCTGGAAGTA CTGTGCCTGC 2340 

TGCAAGGCCT GCCTGGCACT TCTCCCGTGC TGCAACCGAG GTCACATGGT GGGCTTTAAG 2400 

GAAGACCACT ACATGCTGCG GGAGAACCTG ATGGCCTCTG ACCACTTGGA CACGCCCATG 2460 

CTGCGCAGCG GGAACCTCAA GGGCCGTGAC GTGGTCCGCT GGAAGGTCAC CAACAACATG 2520 

CAGCGGCCTG GCTTTGCCAC TCATGCCGCC AGCATCAACC GGACAGAGGT GGTGCCCTAC 2580 

GGGCTGTCCT TGCGCCTGGC CCGCCTTTGC ACCGAGAACC TGCTGAAGCC TGACACTCGG 2640 

GAGTGCGCCC AGCTGCGCCA GGAGGTGGAG GAGAACCTGA ACGAGGTCTA CAGGCAGATC 2700 

TCCGGTGTAC ACAAGCTCCA GCAGACCAAG TTCCGGCAGC AGCCCAATGC CGGGAAAAAG 2760 

CAAGACCACA CCATTGTGGA CACAGTGCTG ATGGCGCCCC GCTCGGCCAA GCCGGCCCTG 2 82 0 

CTGAAGCTTA CAGAGAAGCA GGTGGAACAG AGGGCCTTCC ACGACCTCAA GGTGGCCCCC 2380 

GGCTACTACA CCCTCACTGG AGACGAGGAC GCCCGGGGCA TGGTGGAGTT CCAGGAGGGC 2940 

GTGGAGCTGG TGGACGTAC6 GGTGCCCCTC TTTATCCGGC CTGAGGATGA CGACGAGAAG 3000 

CAGCTGCTGG TGGAGGCCAT CGACGTGCCC GCAGGCACTG CCACCCTCGG CCGCCGCCTG 3060 



398 



wo 02/086443 

GTAAACATCA CCATCATCAA GGAGCAAGCC AGAGACGTGG TGTCCTTTGA GCAGCCTGAG 3120 

TTCTCGGTCA GCCGCGGGGA CCAGGTGGCC CGCATCCCTG TCATCCGGCG TGTCCTGGAC 3180 

GGCGGGAAGT CCCAGGTCTC CTACCGCACA CAGGATGGCA CCGCGCAGGG CAACCX3GGAC 3240 

TACATCCCCG TGGAGGGTGA GCTGCTGTTC CAGCCTGGGG AGGCCTGGAA AGAGCTGCA6 3300 

GTGAAGCTCC TGGAGCTGCA AGAAGTTGAC TCCCTCCTGC GGGGCCGCCA GGTCCGCCGT 3360 

TTCCACGTCC AGCTCAGCAA CCCTAAGTTT GGGGCCCACC TGGGCCAGCC CCACTCCACC 3420 

ACCATCATCA TCAGGGACCC AGATGAACTG GACCX3GAGCT TCACGAGTCA GATGTTGTCA 3480 

TCACAGCCAC CCCCTCACGG CGACCTGGGC GCCCCX3CAGA ACCCCAATGC TAAGGCCGCT 3540 

GGGTCCAGGA AGATCCATTT CAACTGGCTG CCCCCTTCTG GCAAGCCAAT GGGGTACAGG 3 600 

GTAAAGTACT GGATTCAGGG TGACTCCGAA TCCGAAGCCC ACCTGCTCGA CAGCAAGGTG 3 660 

CCCTCAGTGG AGCTCAGCAA CCTGTACCCG TATTGCGACT ATGAGATGAA GGTGTGCGCC 3720 

TACGGGGCTC AGGGCGAGGG ACCCTACAGC TCCCTGGTGT CCTGCCGCAC CCACCAGGAA 3780 

GTGCCCAGCG AGCCAGGGCG TCTGGCCTTC AATGTCGTCT CCTCCACGGT GACCCAGCTG 3840 

AGCTGGGCTG AGCCGGCTGA GACCAACGGT GAGATCACA6 CCTACGAGGT CTGCTATGGC 3900 

CTGGTCT^ACG ATGACAACCG ACCTATTGGG CCCATGAAGA 7VAGTGCTGGT TGACAACCCT 3960 

AAGAACCGGA TGCTGCTTAT TGAGAACCTT CGGGAGTCCC AGCCCTACCG CTACACGGTG 402 0 

AAGGCGCGCA ACGGGGCCGG CTGGGGGCCT GAGCGGGAGG CCATCATCAA CCTGGCCACC 4080 

CAGCCCAAGA GGCCCATGTC CATCCCCATC ATCCCTGACA TCCCTATCGT GGACGCCCAG 4140 

AGCGGGGAGG ACTACGACAG CTTCCTTATG TACAGCGATG ACGTTCTACG CTCTCCATCG 4200 

GGCAGCCAGA GGCCCAGCGT CTCCGATGAC ACTGAGCACC TGGTGAATGG CCGGATGGAC 4260 

TTTGCCTTCC CGGGCAGCAC CAACTCCCTG CACAGGATGA CCACGACCAG TGCTGCTGCC 4320 

TATGGCACCC ACCTGAGCCC ACACGTGCCC CACCGCGTGC TAAGCACATC CTCCACCCTC 43 80 

ACACGGGACT ACAACTCACT GACCCGCTCA GAACACTCAC ACTCGACCAC ACTGCCGAGG 4440 

GACTACTCCA CCCTCACCTC CGTCTCCTCC CACGACTCTC GCCTGACTGC TGGTGTGCCC 4500 

GACACGCCCA CCCGCCTGGT GTTCTCTGCC CTGGGGCCCA CATCTCTCAG AGTGAGCTGG 4560 

CAGGAGCCGC GGTGCGAGCG GCCGCTGCAG GGCTACAGTG TGGAGTACCA GCTGCTG/^C 4620 

GGCGGTGAGC TGCATCGGCT CAACATCCCC AACCCTGCCC AGACCTCGGT GGTGGTGGAA 4680 

GACCTCCTGC CCAACCACTC CTACGTGTTC CGCGTGCGGG CCCAGAGCCA GGAAGGCTGG 4740 

GGCCGAGAGC GTGAGGGTGT CATCACCATT GAATCCCAGG TGCACCCGCA GAGCCCACTG 4800 

TGTCCCCTGC CAGGCTCCGC CTTCACTTTG AGCACTCCCA GTGCCCCAGG CCCGCTGGTG 4860 

TTCACTGCCC TGAGCCCAGA CTCGCTGCAG CTGAGCTGGG AGCGGCCACG GAGGCCCAAT 4920 

GGGGATATCG TCGGCTACCT GGTGACCTGT GAGATGGCCC AAGGAGGAGG GCCAGCCACC 4980 

GCATTCCGGG TGGATGGAGA CAGCCCCGAG AGCCGGCTGA CCGTGCCGGG CCTCAGCGAG 5040 

AACGTGCCCT ACAAGTTCAA GGTGCAGGCC AGGACCACTG AGGGCTTCGG GCCAGAGCGC 5100 

GAGGGCATCA TCACCATAGA GTCCCAGGAT GGAGGACCCT TCCCGCAGCT GGGCAGCCGT 5160 

GCCGGGCTCT TCCAGCACCC GCTGCAAAGC GAGTACAGCA GCATCACCAC CACCCACACC 5220 

AGCGCCACCG AGCCCTTCCT AGTGGATGGG CCGACCCTGG GGGCCCAGCA CCTGGAGGCA 5280 

GGCGGCTCCC TCACCCGGCA TGTGACCCAG GAGTTTGTGA GCCGGACACT GACCACCAGC 5340 

GGAACCCTTA GCACCCACAT GGACCi^ACAG TTCTTCCT^ZVA CTTGACCGCA CCCTGCCCCA 5400 

CCCCCGCCAT GTCCCACTAG GCGTCCTCCC GACTCCTCTC CCGGAGCCTC CTCAGCTACT 5460 

CCATCCTTGC ACCCCTGGGG GCCCAGCCCA CCCGCATGCA CAGAGCAGGG GCTAGGTGTC 552 0 

TCCTGGGAGG CATGAAGGGG GCAAGGTCCG TCCTCTGTGG 6CCCAAACCT ATTTGTAACC 5580 

AAAGAGCTGG 6AGCAGCACA AGGACCCAGC CTTTGTTCTG CACTTAATAA ATGGTTTTGC 5640 
TACTG 

Seq ID NO: 561 Protein sequence 
Protein Accession #: NP_000204.1 

1 11 21 31 41 51 

1111-11 

MAGPRPSPWA RLLLAALISV SLSGTLANRC KKAPVKSCTE CVRVDKDCAY CTDEMFRDRR 60 

CNTQAELLAA GCQRESIWM ESSFQITEET QIDTTLRRSQ MSPQGLRVRL RPGEERHFEIi 120 

EVFEPLESPV DLYILMDFSN SMSDDLDNLK KMGQNLARVL SQLTSDYTIG FGKFVDKVSV 180 

PQTDMRPEKL KEPWPNSDPP FSFKNVISIiT EDVDEFRNKL QGERISGNLD APEGGFDAIL 240 

QTAVCTRDIG WRPDSTHIiLV FSTESAFHYE ADGANVLAGI MSRNDERCHL DTTGTYTQYR 300 

TQDYPSVPTL VRLLAKHNII PIFAVTNYSY SYYEKLHTYF PVSSLGVLQE DSSNIVELLE 360 

EAFNRIRSNL DIRALDSPRG LRTEVTSKMF QKTRTGSFHI RRGEVGIYQV QLRALEHVDG 420 

THVCQLPEDQ KGNIHLKPSF SDGLKMDAGI ICDVCTCELQ KEVRSARCSF NGDFVCGQCV 48 0 

CSEGWSGQTC NCSTGSLSDI QPCLREGEDK PCSGRGECQC GHCVCYGEGR YEGQFCEYDN 540 

FQCPRTSGFL CNDRGRCSMG QCVCEPGWTG PSCDCPLSNA TCIDSNGGIC MGRGHCECGR 600 

CHCHQQSLYT DTICEINYSA IHPGLCEDLR SCVQCQAWGT GEKKGRTCEB CNFKVKMVDE 660 

LKRAEEVWR CSFRDEDDDC TYSYTMEGDG APGPNSTVLV HKKKDCPPGS FWWIilPLLLL 72 0 

IjLPLLALLLL LCWKYCACCK ACLALLPCCN RGHMVGFKED HYMLRENLMA SDHLDTPMIiR 780 

SGNLKGRDW RWKVTNNMQR PGFATHAASI NPTELVPYGL SLRLARLCTE NLLKPDTREC 840 

AQLRQEVEEN LNEVYRQISG VHKLQQTKFR QQPNAGKKQD HTIVDTVLMA PRSAKPALLK 90 0 

LTEKQVEQRA FHDLKVAPGY YTLTADQDAR GMVEFQEGVE LVDVRVPLFI RPEDDDEKQL 960 

LVEAIDVPAG TATLGRRLVN ITIIKEQARD WSFEQPEFS VSRGDQVARI PVIRRVLDGG 1020 

KSQVSYRTQD GTAQGNRDYI PVEGELLFQP GEAWKELQVK LIiELQEVDSL LRGRQVRRFH 1080 

VQLSNPKFGA HLGQPHSTTI IIRDPDELDR SFTSQMLSSQ PPPHGDLGAP QNPNAKAAGS 1140 

RKIHFNWLPP SGKPMGYRVK YWIQGDSESE AHLLDSKVPS VELTNLYPYC DYEMKVCAYG 12 00 

AQGEGPYSSL VSCRTHQEVP SEPGRLAFNV VSSTVTQLSW AEPAETNGEI TAYEVCYGLV 1260 

NDDNRPIGPM KKVLVDNPKN RMLLIENLRE SQPYRYTVKA RNGAGWGPER EAIINLATQP 1320 

KRPMSIPIIP DIPIVDAQSG EDYDSFLMYS DDVLRSPSGS QRPSVSDDTE HLVNGRMDFA 1380 

FPGSTNSLHR MTTTSAAAYG THLSPHVPHR VLSTSSTLTR DYNSLTRSEH SHSTTLPRDY 1440 

STLTSVSSHD SRLTAGVPDT PTRLVFSALG PTSLRVSWQE PRCERPLQGY SVEYQItLNGG 1500 

ELHRLNIPNP AQTSVWEDL LPNHSYVFRV RAQSQEGWGR EREGVITIES QVHPQSPLCP 15 60 

LPGSAFTLST PSAPGPLVFT ALSPDSLQLS WERPRRPNGD IVGYLVTCEM AQGGGPATAF 1620 

RVDGDSPESR LTVPGLSENV PYKFKVQART TEGFGPEREG IXTIESQDGG PFPQLGSRAG 1680 

LFQHPLQSEY SSITTTHTSA TEPFLVDGPT LGAQHIiBAGG SLTRHVTQEF VSRTLTTSGT 1740 
LSTHMDQQPF QT 

Seq ID NO: 562 DNA sequence 

Nucleic Acid Accession #: NM_013332.1 

Coding sequence: 1. .63 

1 11 21 31 41 51 

I I I I I i 
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GCACGAGGGC GCTTTTGTCT CCGGTGAGTT TTGTGGCGGG AAGCTTCTGC GCTGGTGCTT 60 

AGTAACCGAC TTTCCTCCGG ACTCCTGCAC 6ACCTGCTCC TACAGCCGGC GATCCACTCC 120 

CGGCTGTTCC CCCGGAGGGT CCAGAGGCCT TTCAGAAG6A GAAGGCAGCT CT6TTTCTCT 180 

GCAGAGGAGT AGGGTCCTTT CAGCCATG/VA GCATGTGTTG AACCTCTACC TGTTAGGTGT 240 

GGTACTGACC CTACTCTCCA TCTTCGTTAG AGTGATGGAG TCCCTAGAAG GCTTACTAGA 300 

GAGCCCATCG CCTGGGACCT CCTGGACCAC CAGAAGCCAA CTAGCCAACA CAGAGCCCAC 360 

CAAGGGCCTT CCAGACCATC CATCCAGAAG CATGTGATAA GACCTCCTTC CATACTGGCC 420 

ATATTTTGGA ACACTGACCT AGACATGTCC AGATGGGAGT CCCATTCCTA GCAGACT^GC 480 

TGAGCACCGT TGTAACCAGA GAACTATTAC TAGGCCTT6A AGAACCTGTC TAACTGGATG 540 

CTCATTGCCT GGGCAAGGCC TGTTTAGGCC GGTTGCGGTG GCTCATGCCT GTAATCCTAG 600 

CACTTTGGGA GGCTGAGGTG GGTGGATCAC CTGAGGTCAG GAGTTCGAGA CCAGCCTCGC 660 

CAACATGGCG AAACCCCATC TCTACTAAAA ATACAAAAGT TAGCTGGGTG TGGTGGCAGA 720 

GGCCTGTAAT CCCAGTTCCT TGGGAGGCTG AGGCGGGAGA ATTGCTTGAA CCCGGGGACG 780 

GAGGTTGCAG TGAACCGAGA TCGCACTGCT GTACCCAGCC TGGGCCACAG TGCAAGACTC 840 

CATCTCAAAA A;U\AAAAGAA AAGAAAAAGC CTGTTTAATG CACAGGTGTG AGTGGATTGC 900 

TTATGGCTAT GAGATAGGTT GATCTCGCCC TTACCCCGGG GTCTGGTGTA TGCTGTGCTT 960 

TCCTCAGCAG TATGGCTCTG ACATCTCTTA GATGTCCCAA CTTCAGCTGT TGGGAGATGG 102 0 

TGATATTTTC AACCCTACTT CCTAAACATC TGTCTGGGGT TCCTTTAGTC TTGAATGTCT 1080 

TATGCTCAAT TATTTGGTGT TGAGCCTCTC TTCCACAAGA GCTCCTCCAT GTTTGGATAG 1140 

CAGTTGAAGA GGTTGTGTGG GTGGGCTGTT GGGAGTGAGG ATGGAGTGTT CAGTGCCCAT 1200 

TTCTCATTTT ACATTTTAAA GTCGTTCCTC CAACATAGTG TGTATTGGTC TGAAGGGGGT 1260 

GGTGGGATGC CAAAGCCTGC TCAAGTTATG GACATTGTGG CCACCATGTG GCTTAAATGA 1320 
TTTTTTCTAA CTAATAAAGT GGAATATATA TTTCAAAAAA AAAAAAAAAA AA 

Seq ID NO: 563 Protein sequence 
Protein Accession #: NP 037464.1 
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1 11 21 31 41 51 

1 I 1 I ' ' 

3U MKHVLNLYLL GWLTIjLSIP VRVMESLEGL LESPSPGTSW TTRSQLANTE PTKGLPDHPS 
RSM 



60 



35 



Seq ID NO: 564 DNA sequence 

Nucleic Acid Accession #: NM_023 915.1 

Coding sequence: 250.. 1326 



40 
45 
50 
55 
60 
65 



1 
I 

GGCACGAGGG 
TCAAAGCTTA 
GTGAATGGAC 
CCCACGCCTC 
AACTGAAGAA 
CAAGAGAGTC 
AATGAATTTG 
TTGCTGAATG 
TTCTATCTCA 
ATAGTCCATG 
TCAGTTTTGT 
GATCGCTATC 
ACGAAGGTTT 
ATCCTGACAA 
CCTTTGGGGG 
GTGCTGGTGA 
AGGCAATTCA 
GTGGCTGTGT 
AGTCACTTAG 
ATTACACTTT 
TGTAGGTCAT 
ATCAGATCAC 
GTGTAGGCCT 
TTCATTATCC 



11 
1 

TTTCGTTTTC 
TTCTTAATTA 
AGCCAGCCAC 
AATCGTCCCC 
TGGGGTTCAA 
ACAATTCAGG 
ACACAATTGT 
GTTTAGCAGT 
AAAACATAGT 
ATGCAGGATT 
TTTATGCAAA 
TGAAGGTGGT 
TATCTGTTTG 
ATGGTCAGCC 
TCAAATGGCA 
TTCTGATCGG 
TAAGTCAGTC 
TTTTTACCTG 
ACAGGCTTTT 
TCTTGTCTGC 
TTTCj=AGAAG 
TGCAAAGTGT 
TTTATTGTTT 
TTAAAAAAj\A 



21 
I 

ATGCTTTACC 
GAGACAAGAA 
CACAATGAAA 
AAGTGTTTCC 
CTTGACGCTT 
CAACAGGAGC 
CTTGCCGGTG 
GTGGATCTTC 
GGTTGCAGAC 
TGGACCTTGG 
CATGTATACT 
CAAGCCATTT 
TGTTTGGGTG 
AACAGAGGAC 
TACGGCAGTC 
ATGTTACATA 
AAGCCGAAAG 
CTTTCTACCA 
AGATGAATCT 
GTGTAATGTT 
GCTGTTCAAA 
GAGAAGATCG 
GTTGGAATCG 
AA 



31 

I 

AGAAAATCCA 
ACCTGTTTCA 
GA/yVTCAAAC 
TGACACGCAT 
GCAAAATTAC 
GACX3GGCCAG 
CTTTATCTCA 
TTCCACATTA 
CTCATAATGA 
TACTTCAAGT 
TCCATCGTGT 
GGGGACTCTC 
ATCATGGCTG 
AATATCCATG 
ACCTATGTGA 
GCCATATCCA 
CGAAAACATA 
TATCACTTGT 
GCACAAAAAA 
TGCCTGGATC 
AAATCAAATA 
GAAGTTCGCA 
ATATGTACAA 



41 
I 

CTTCCCTGCC 
ACTTGAAGAC 
CAGGAATAAC 
CTTTGCTTAC 
CAAATAACGA 
GAAAGAACAC 
TTATATTTGT 
GGAATAAAAC 
CGCTGACATT 
TTATTCTCTG 
TCCTTGGGCT 
GGATGTACAG 
TTTTGTCTTT 
ACTGCTCAAA 
ACAGCTGCTT 
GGTACATCCA 
ACCAGAGCAT 
GCAGAATTCC 
TCCTATATTA 
CAATAATTTA 
TCAGAACCAG 
TATATTATGA 
AGTGTAAATA 



51 
I 

GACCTTAGTT 
ACCGTATGAG 
CTATGCTGAA 
AGTGCATCAC 
GCTGCACGGC 
CACCCTTCAC 
GGCAAGCATC 
CAGCTTCATA 
TCCATTTCGA 
CAGATACACT 
GATAAGCATT 
CATAACCTTG 
GCCAAACATC 
ACTTAAAAGT 
GTTTGTGGCC 
CAAATCCAGC 
CAGGGTTGTT 
TTTTACTTTT 
CTGCAAAGAA 
CTTTTTCATG 
GAGTGAAAGC 
TTACACTGAT 
AATGTTTCTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NO: 565 Protein sequence 
Protein Accession #: NP 076404 



70 



75 



80 



85 



1 

1 

MGFNLTLAKL 
GLAVWIFFHI 
FYANMYTSIV 
NGQPTEDNIH 
ISQSSRKRKH 
FLSACNVCLD 



11 
I 

PNNELHGQES 
RNKTSFIFYL 
FLGLISIDRY 
DCSKLKSPLG 
NQSIRVWAV 
PIIYFFMCRS 



21 

I 

HNSGNRSDGP 
KNIWADLIM 
LKWKPFGDS 
VKWHTAVTYV 
FFTCFLPYHL 
FSRRLFKKSN 



31 
1 

GKNTTLHNEF 
TLTFPFRIVH 
RMYSITFTKV 
NSCLFVAVLV 
CRIPFTFSHL 
IRTRSESIRS 



41 

I 

DTIVLPVLYL 
DAGFGPWYFK 
LSVCVWVIMA 
ILIGCYIAIS 
DRLLDESAQK 
LQSVRRSEVR 



SI 

I 

IIFVASILLN 
FILCRYTSVL 
VIiSLPNIILT 
RYIHKSSRQF 
ILYYCKEITL 
lYYDYTDV 



Seq ID NO: 566 DNA sequence 

Nucleic Acid Accession #: NM_005365.1 

Coding sequence : 1 . . 948 



ATGTCTCTCG 
GAGGACTTGG 
TCCTCTGACA 
CCTCAGGGAG 
GAGGGCTCCA 
GAGTTCATGT 



11 

1 

AGCAGAGGAG 
GCCTGATGGG 
GCAAGGAGGA 
GCGCTTCCTC 
GCA6TCAAGA 
TCCAA6AAGC 



21 
I 

TCCGCACTGC 
TGCACAGGAA 
GGAG6TGTCT 
CTCCATTTCC 
AGAGGAAGAG 
ACTGAAATTG 



31 

1 

AAGCCTGATG 
CCCACAGGCG 
GCTGCTGGGT 
GTCTACTACA 
CCAAGCTCCT 
AAGGTGGCTG 



41 

1 

AAGACCTTGA 
AGGAGGAGGA 
CATCAAGTCC 
CTTTATGGAG 
CGGTCGAGCC 
A6TTGGTTCA 



51 

I 

AGCCCAAGGA 
GACTACCTCC 
TCCCCAGAGT 
CCAATTCGAT 
AGCTCAGCTG 
TTTCCTGCTC 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 



400 
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CACAAATATC GAGTCAAGGA GCCGGTCACA AAGGCAGAAA TGCTGGAGAG CGTCATCAAA 420 

AATTACAAGC GCTACTTTCC TGTGATCTTC GGCAAAGCCT CCGAGTTCAT GCAGGTGATC 480 

TTTGGCACTG ATGTGAAGGA GGTGGACCCC GCCGGCCACT CCTACATCCT TGTCACTGCT 540 

CTTGGCCTCT CGTGCGATAG CATGCTGGGT GATGGTCATA GCATGCCCAA GGCCGCCCTC 600 

5 CTGATCATTG TCCTGGGTGT GATCCTAACC AAAGACAACT GCGCCCCTGA AGAGGTTATC 660 

TGGGAAGCGT TGAGTGTGAT GGGGGTGTAT GTTGGGAAGG AGCACATGTT CTACGGGGAG 720 

CCCAGGAAGC TGCTCACCCA AGATTGGGTG CAGGAAAACT ACCTGGAGTA CCGGCAGGTG 780 

CCCGGCAGTG ATCCTGCGCA CTACGAGTTC CTGTGGGGTT CCAAGGCCCA CGCTGAAACC 840 

AGCTATGAGA AGGTCATAAA TTATTTGGTC ATGCTCAAT6 CAAGAGAGCC CATCTGCTAC 900 
10 CCATCCCTTT ATGAAGAGGT TTTGGGAGAG GAGCAAGAGG GAGTCTGA 

Seq ID NO: 567 Protein sequence 
Protein Accession NP_005356.1 

15 1 11 21 31 41 51 

1 I 1 I 1 I 

MSLEQRSPHC KPDEDLEAQG EDLGLMGAQE PTGEEEETTS SSDSKEEEVS AAGSSSPPQS 60 

PQGGASSSIS VYYTLWSQFD EGSSSQEEEE PSSSVDPAQL EFMFQEALKL KVAELVHFLL 120 

HKYRVKEPVT KAEMLESVIK NYKRYFPVIF GKASEFMQVI FGTDVKEVDP AGHSYILVTA 180 

20 LGLSCDSMLG DGHSMPKAAL LIIVLGVILT KDNCAPEEVI WEALSVMGVY VGKEHMFYGE 240 

PRKLLTQDWV QENYLEYRQV PGSDPAHYEF LWGSKAHAET SYEKVINYLV MLNAREPICY 300 
PSLYEEVLGE EQEGV 

^- Seq ID NO: 568 DNA sequence 
25 Nucleic Acid Accession #: NM_014400 
Coding sequence s 86.. 1126 

1 " 11 21 31 41 51 

dk) GGTTACTCAT CCTGGGCTCA GGTAAGAGGG CCCGAGCTCG GAGGCGGCAC ACCCAGGGGG 60 

GACGCCAAGG GAGCAGGACG GAGCCATGGA CCCCGCCAGG AAAGCAGGTG CCCAGGCCAT 120 

GATCTGGACT GCAGGCTGGC TGCTGCTGCT GCTGCTTCGC GGAGGAGCGC AGGCCCTGGA 180 

GTGCTACAGC TGCGTGCAGA AAGCAGATGA CGGATGCTCC CCGAACAAGA TGAAGACAGT 240 

GAAGTGCGCG CCGGGCGTGG ACGTCTGCAC CGAGGCCGTG GGGGCGGTGG AGACCATCCA 3 00 

35 CGGACAATTC TCGCTGGCAG TGCSGGGTTG CGGTTCGGGA CTCCCCGGCA AGAATGACCG 3 60 

CGGCCTGGAT CTTCACGGGC TTCTGGCGTT CATCCAGCTG CAGCAATGCG CTCAGGATCG 420 

CTGCAACGCC AAGCTCAACC TCACCTCGCG GGCGCTCGAC CCGGCAGGTA ATGAGAGTGC 480 

ATACCCGCCC AACGGCGTGG AGTGCTACAG CTGTGTGGGC CT6AGCCGGG AGGCGTGCCA 540 

GGGTACATCG CCGCCGGTCG TGAGCTGCTA CAACGCCAGC GATCATGTCT ACAAGGGCTG 600 

40 CTTCGACGGC AACGTCACCT TGACGGCAGC TAATGTGACT GTGTCCTTGC CTGTCCGGGG 660 

CTGTGTCCAG GATGAATTCT GCACTCGGGA TGGAGTAACA GGCCCAGGGT TCACGCTCAG 72 0 

TGGCTCCTGT TGCCAGGGGT CCCGCTGTAA CTCTGACCTC CGCAACAAGA CCTACTTCTC 780 

CCCTCGAATC CCACCCCTTG TCCGGCTGCC CCCTCCAGAG CCCACGACTG TGGCCTCAAC 840 

CACATCTGTC ACCACTTCTA CCTCGGCCCC AGTGAGACCC ACATCCACCA CCAAACCCAT 900 

45 GCCAGCGCCA ACCAGTCAGA CTCCGAGACA GGGAGTAGAA CACGAGGCCT CCCGGGATGA 960 

GGAGCCCAGG TTGACTGGAG GCGCCGCTGG CCACCAGGAC CGCAGCAATT CAGGGCAGTA 1020 

TCCTGCAAAA GGGGGGCCCC AGCAGCCCCA TAATAAAGGC TGTGTGGCTC CCACAGCTGG 10 8 0 

ATTGGCAGCC CTTCTGTTGG CCGTGGCTGC TGGTGTCCTA CTGTGAGCTT CTCCACCTGG 1140 

AAATTTCCCT CTCACCTACT TCTCTGGCCC TGGGTACCCC TCTTCTCATC ACTTCCTGTT 1200 

50 CCCACGACTG GACTGGGCTG GCCCAGCCCC TGTTTTTCCA ACATTCCCCA GTATCCCCAG 1260 

CTTCTGCTGC GCTGGTTTGC GGCTTTGGGA AATAAAATAC CGTTGTATAT ATTCTGGCAG 1320 

GGGTGTTCTA GCTTTTTGAG GACAGCTCCT GTATCCTTCT CATCCTTGTC TCTCCGCTTG 13 80 

TCCTCTTGTG ATGTTAGGAC AGAGTGAGAG AAGTCAGCTG TCACGGGGAA GGTGAGAGAG 144 0 

AGGATGCTAA GCTTCCTACT CACTTTCTCC TAGCCAGCCT GGACTTTGGA GCGTGGGGTG 1500 

55 GGTGGGACAA TGGCTCCCCA CTCTAAGCAC TGCCTCCCCT ACTCCCCGCA TCTTTGGGGA 1560 

ATCGGTTCCC CATATGTCTT CCTTACTAGA CTGTGAGCTC CTCGAGGGCA GGGACCGTGC 1620 

CTTATGTCTG TGTGTGATCA GTTTCTGGCA CATAAATGCC TCAATA7\AGA TTTAATTACT 1680 

TTGTATAGTG AAAAAAAA 

60 Seq ID NO: 569 Protein sequence 
Protein Accession #: NP_055215 

1 11 21 31 41 51 

I I I 1 I 1 

Oj MDPARKAGAQ AMIWTAGWLL LLLLRGGAQA LECYSCVQKA DDGCSPNKMK TVKCAPGVDV 6 0 

CTEAVGAVET IHGQFSLAVX GCGSGLPGKN DRGIjDLHGLL AFIQLQQCAQ DRCNAKLNLT 120 

SRALDPAGNE SAYPPNGVEC YSCVGIiSREA CQGTSPPWS CYNASDHVYK GCFDGNVTLT 180 

AANVTVSLPV RGCVQDEFCT RDGVTGPGFT LSGSCCQGSR CNSDIiRNKTY FSPRIPPLVR 240 

LPPPEPTTVA STTSVTTSTS APVRPTSTTK PMPAPTSQTP RQGVEHEASR DEEPRLTGGA 300 
70 AGHQDRSNSG QYPAKGGPQQ PHNKGCVAPT AGLAALLLAV AAGVLIi 

Seq ID NO: 570 DNA sequence 

Nucleic Acid Accession #: NM_005329.1 

Coding sequence: 1. .1662 



1 11 21 31 41 51 

I I I I I I 

ATGCCGGTGC AGCTGACGAC AGCCCTGCGT GTGGTGGGCA CCAGCCTGTT TGCCCTGGCA 60 

GTGCTGGGTG GCATCCTGGC AGCCTATGTG ACGGGCTACC AGTTCATCCA CACGGATWVAG 120 

80 CACTACCTGT CCTTCGGCCT GTACGGCGCC ATCCTGGGCC TGCACCTGCT CATTCAGAGC 180 

CTTTTTGCCT TCCTGGAGCA CCGGCGCATG CGACGTGCCG GCCAGGCCCT GAAGCTGCCC 240 

TCCCCGCGGC GGGGCTCGGT GGCACTGTGC ATTGCCGCGT ACCAGGAGGA CCCTGACTAC 3 00 

TTGCGCAAGT GCCTGCGCTC GGCCCAGCGC ATCTCCTTCC CTGACCTCAA GGTGGTCATG 360 

GTGGTGGATG GCAACCGCCA GGAGGACGCC TACATGCTGG ACATCTTCCA CGAGGTGCTG 420 

85 GGCGGCACCG AGCAGGCCGG CTTCTTTGTG TGGCGCAGCA ACTTCCATGA GGCAGGCGA6 480 

GGTGAGACGG AGGCCAGCCT GCAGGAGGGC ATGGACCGTG TGCGGGATGT GGTGCGGGCC 540 

AGCACCTTCT CGTGCATCAT GCAGAAGTGG GGAGGCAAGC GCGAGGTCAT GTACACGGCC 600 
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TTCAAGGCCC TCGGCGATTC GGTGGACTAC ATCCAGGTGT GCGACTCTGA CACTGTGCTG 660 

GATCCAGCCT GCACCATCGA GATGCTTCGA GTCCTGGAGG AGGATCCCCA AGTAGGGGGA 720 

GTCGGGGGAG ATGTCCAGAT CCTCAACAAG TACGACTCAT GGATTTCCTT CCTGAGCAGC 780 

GTGCGGTACT GGATGGCCTT CAACGTGGAG CGGGCCTGCC AGTCCTACTT TGGCTGTGTG 840 

CAGTGTATTA GTGGGCCCTT GGGCATGTAC CGCAACAGCC TCCTCCAGCA GTTCCTGGAG 900 

GACTGGTACC ATCAGAAGTT CCTAGGCAGC AAGTGCAGCT TCGGGGATGA CCGGCACCTC 960 

ACCAACCGAG TCCTGAGCCT TGGCTACCGA ACTAAGTATA CCGCGCGCTC CAAGTGCCTC 1020 

ACAGAGACCC CCACTAAGTA CCTCCGGTGG CTCAACCAGC AAACCCGCTG GAGCAAGTCT 1080 

TACTTCCGGG AGTGGCTCTA CAACTCTCTG TGGTTCCATA AGCACCACCT CTGGATGACC 1140 

TAOSAGTCAG TGGTCACGGG TTTCTTCCCC TTCTTCCTCA TTGCCACGGT TATACAGCTT 1200 

TTCTACCGGG GCCGCATCTG GAACATTCTC CTCTTCCTGC TGACGGTGCA GCTGGTGGGC 1260 

ATTATCAAGG CCACCTACGC CTGCTTCCTT CGGGGCAATG CAGAGATGAT CTTCATGTCC 132 0 

CTCTACTCCC TCCTCTATAT GTCCAGCCTT CTGCCGGCCA AGATCTTTGC CATTGCTACC 1380 

ATCAACAAAT CTGGCTGGGG CACCTCTGGC CGAAAAACCA TTGTGGTG7A CTTCATTGGC 1440 

CTCATTCCTG TGTCCATCTG GGTGGCAGTT CTCCTGGGAG GGCTGGCCTA CACAGCTTAT 1500 

TGCCAGGACC TGTTCAGTGA GACAGAGCTA GCCTTCCTTG TCTCTGGGGC TATACTGTAT 1560 

GGCTGCTACT GGGTGGCCCT CCTCATGCTA TATCTGGCCA TCATCGCCCG GCGATGTGGG 1620 
AAGAAGCCGG AGCAGTACAG CTTGGCTTTT GCTGAGGTGT GA 

Seq ID NO: 571 Protein sequence 
Protein Accession #: NF 005320.1 
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MPVQLTTALR 
LPAFLEHRRM 
WDGNRQEDA 
STFSCIMQKW 
VGGDVQILNK 
DWYHQKFLGS 
YFREWLYNSL 
IIKATYACFIi 
LIPVSIWVAV 
KKPEQYSLAP 



11 

1 

WGTSLFALA 
RRAGQALKLP 
YMLDIFHEVL 
GGKREVMYTA 
YDSWISFLSS 
KCSFGDDRHL 
WFHKHHLWMT 
RGNAEMIFMS 
LLGGLAYTAY 
AEV 



21 
I 

VLGGILAAYV 
SPRRGSVALC 
GGTEQAGFFV 
FKALGDSVDY 
VRYWMAFNVE 
TNRVLSLGYR 
YESWTGFFP 
LYSLLYMSSL 
CQDLFSBTEL 



31 

1 

TGYQFIHTEK 
lAAYQEDPDY 
WRSNFHEAGE 
IQVCDSDTVL 
RACQSYFGCV 
TKYTARSKCL 
FFLIATVIQL 
LPAKIFAIAT 
AFLVSGAILY 



41 

HYLSFGLYGA 
LRKCLRSAQR 
GETEASLQEG 
DPACTIEMLR 
QCISGPLGMY 
TETPTKYLRW 
FYRGRIWNIIi 
INKSGWGTSG 
GCYWVALIiML 



51 

1 

ILGIiHIiLIQS 
ISFPDIiKWM 
MDRVRDWRA 
VLEEDPQVGG 
RNSLLQQFLE 
LNQQTRWSKS 
liFLLTVQLVG 
RKTIWNFIG 
YLAIIARRCG 



Seq ID NO: 572 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 148-7095 



CACACATACG 
CAAAAAAAAC 
CGGCGAGGGG 
CAGCTCCTCT 
CTTGTTGAAG 
TU^TATCCAA 
CAAGTAAATG 
AACACATTCA 
GTCAGCGGAG 
AAATGCAATA 
GAGATGCAAA 
GGAAAAGGGA 
GATTTCAAAG 
TTAGATCCAT 
AATGGCTCAT 
ACAGTTAGCA 
TCTGGTTATG 
TTCTCTAGAC 
AGTTCAGi^AC 
TGGGAAAGAC 
CAGTTGGATG 
GGTGCTATTC 
TGCACTAATG 
AATGCTG7UVC 
GAAGAGGGAA 
AACCAAATCA 
ACGAAATACA 
AAGGGTGATG 
ACAGAAAAAG 
GAAGGTACTT 
AACTTGTCGG 
AGTTTATTGA 
GCAAGTTCTG 
GAAT^ACCCAG 
GAAGATTCAA 
GTGTGGTTTC 
AGCTTTCTCC 
TCCTTTTCTG 
CATTATTCTA 
TCCAGACAAC 
GTATACAATG 
ACCCCTTTGT 
TCGGCCTTGC 
TCTTCCTATG 
TTTCGCCATC 
GATAAGGTGC 



11 

1 

CACGCACGAT 
ATTTCCTTCG 
CCGCAGACCG 
GTGTTTGCCG 
AGATTGGCTG 
CATGTAATAG 
TGAATCTTAA 
TTCATAACAC 
GAGTTTCAGA 
TGTCATCTGA 
TCTACTGCTT 
AGTTAAGAGC 
CGATTATTGA 
TCATACTGTT 
TGACATCTCC 
TCTCTGAAAG 
TCATGCTGAT 
AGGTGTTTTC 
CAGAAAATGT 
CTCGAGTCGT 
GAGAGGACCA 
TCAATT^TTT 
GCTTATATGG 
TTGATCTTTT 
AAGACATTGA 
GGAAAAAGGA 
ATGAAGCCJ^A 
TTCCCAATAC 
ATATTTCCTT 
CAGCCTCTTT 
GGACTGCAGA 
CCAGTTTCAA 
CTATCCCATT 
AGACAATAAC 
CTTCATCAGG 
CTAGCTCTAC 
AGACTAATTA 
CAGGCCCAGT 
CCTTTGCCTA 
AGGATTTGGT 
GTGAGACACC 
TGCTTGACAA 
ATGCTACGCC 
ATGGTGCACC 
TGCATACAGT 
CCTTGCATGC 



21 
I 

CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCCAAAACAA 
GAAACTTAAA 
TGGGAAAACA 
AATGGTGTTT 
TGGATCAGAG 
TGATGCGGAC 
TTTATCCATT 
TGGAGTCGAA 
GAACCTTCTG 
TCCCTGCACA 
CCAGTTGGCT 
GGACTACTTA 
CTCATACACT 
TCAGGCTGAC 
TTATGATACC 
AACCAAGCAT 
GCTACCCT^T 
AAAATACAGC 
CCCTGAATTA 
AGAAGGCGCT 
ACCCCAGATT 
GACTAACCGA 
ATCTTTAAAT 
GACTTCTCAG 
AAATGATGGC 
ATCCTTAAAT 
GCTTGATACT 
CATCTCTGAG 
ATATGATGTC 
TTCAGAAGAA 
AGACATAACA 
CACTGAGATA 
GATGTCACAG 
CTTCCCAACT 
CTCCACGGTC 
TCTTCAACCT 
TCAGATCCTC 
TGTATTTCCC 
TTTGCTTCCA 
TTCTCAAATC 
TTCTCTGCCA 



31 
I 

TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
TCTCCTATCA 
TTTCAGGGTT 
GTGGAAATTA 
AAAGCAAGCA 
CATAGrTTAG 
CGATTTTCAA 
TTGTTTGAGG 
AGTGTTAGTC 
CCAAACTCAA 
GACACAGTTG 
GTTTTTTGTG 
CAAAACAATT 
GGAAAGGAAG 
CCAGAGAATT 
ATGATTGAGA 
GAATTTTTGA 
ATGAGTTATG 
GACCAACTGA 
ATTGGAACTG 
ATTGTGAATC 
TCTACCACAA 
TCCCCAACAA 
TCCACTTCCC 
ACTGTGACTG 
TCTAAAACTG 
ACAGTTTCTA 
GGAGCTGAAG 
AACATATCCC 
CTTATACCAG 
TCACTAAAGG 
GCACAGCCCG 
CGTGTTGATG 
GGTCCCTCAG 
GAGGTAACAC 
AACGTGGTAT 
TCCTACAGTA 
AACACTACCC 
AGTGTCGATG 
TTTTCCTCTG 
CTTCCACTU^G 
GT6GCTG6GG 



41 

1 

GGAGGATTAA 
TGAGAAGCAO 
AGCGTTTCCT 
ACTACAGACA 
ATCAAAAAAA 
ATATTGATGA 
GGGATAAAAC 
ATCTCACTAA 
AGATT^CTTT 
AAG6ACAAAA 
GTTTTGAGGA 
TTGGGACAGA 
GTTTTGGGAA 
CTGACAAGTA 
ACTGGATTGT 
AAGTTCTTAC 
TTCGAGAGCA 
AGATTCATGA 
ATACCAGCCT 
AGTTTGCAGT 
CAGATGGCTA 
TTCTTCAGAT 
TTGTCGACAT 
AAGAAATAAT 
CTGGTAGAGA 
CACACTACAA 
GAGGAAGTGA 
AACCAGTCAC 
AACTGCCACC 
TTCTTAGATC 
TAACAGAATA 
ATTCTTCAGG 
AAGGGTATAT 
AATCTGCTAG 
ATCCTTCTAT 
ATGTTGGATC 
AATCTGAGAA 
TTACAGATCT 
CTCATGCTTT 
ACTCGCAGAC 
GTGAAGTCTT 
CTGCTGCTTC 
TGTCATTTGA 
CTTCCTTCAG 
TTACTTCAGC 
GTGATTTGCT 



51 

1 

AACAAACAAA 
AGGAGCCGCA 
CGCTTGCATT 
ACAGAGAAAA 
TTGGGGAAAG 
AGATCTTACA 
ATCATTGGAA 
TGACTACCGT 
TCACTGGGGA 
ATTTCCACTT 
AGCAGTCAAA 
AGA/^TTTG 
GCAGGCTGCT 
TTACATTTAC 
TTTTAAAGAT 
AATGCAACAA 
ACAGTACAAG 
AGCAGTTTGT 
TCTTGTTACA 
TTTGTACCAG 
TCAAGACTTG 
AGTAGCCATA 
GCCTACTGAT 
CAAGGAGGAG 
CAGTGCTACA 
TCGCATAGGG 
ATTCTCTGGA 
TAAATTAGCC 
TCACACTGTG 
TCCACATATG 
TGAGGAGGAG 
CTCCAGTCCC 
ATTTTCCTCC 
AAATGCTTCC 
GGAGGGAAAT 
AGGCAGAGAG 
GACAACCAAG 
GGAAATGCCA 
TACCCCATCC 
AACCCAACCG 
TCCTCTAGTC 
AAGTAGTGAT 
ATCCATCCTG 
TAGTGAATTG 
TACCGAGAGT 
ATTAGAGCCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
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AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 

TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 

AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 

GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 

GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 

CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC T6ACACAGAT 3180 

GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACTiACA 3240 

TCTGTGTTTG GTGATGATAA TAAGGCGGTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 

ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 

CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 342 0 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCT6TGCTTA GTGCAAACTC AGAGCCAGCA 3600 

TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3 660 

ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAA.CCTT CCTTTCAGGC TTCTGATGTT 3720 

GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCA7VAGTTG ATA7\AATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTT6 ATGTGTCGCC TACTTCTCAT 3900 

ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAA6TGAGAA ATATGAACCA 3 960 

GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 

CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG 6TTCTGTAAC CTCAACAAAG 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 

TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 4440 

AGAGGTAGTG ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 

CAGGAAAAGG TAATGAATGA TTCAGACACC CACGAAAACA GTCTTATGGA TCAGAATAAT 4560 

CCAATCTCAT ACTCACTATC TGAGAATTCT GTUVGAAGATA ATAGA6TCAC AAGTGTATCC 4620 

TCAGACAGTC AAACTGGTAT GGACAGAAGT CCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 

TCCCAAAAGC ACAATGATGG AAAAGAGGAA AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 

CCTCTCAGCC CTGAATCTAA AGCATGGGCA GTTCTGACAA GTGATGAAGA AAGTGGATCA 4800 

GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATGAGACTT CCACAGATTT CAGTTTTGCA 4860 

GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 492 0 

GGATTCCCAC AGTCCCCAAC ATCATCTGTT ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 4980 

TCAGAGGCAG AGGCCAGTAA TAGTAGCCAT GA6TCTCGTA TTGGTCTAGC TGAGGGGTTG 5040 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCT6AC TTTTATCTGT 5100 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 5160 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 5220 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 5280 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 5340 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 5400 

CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 5460 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 5520 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 5580 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 5640 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 5700 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 5820 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 5880 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 5940 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 6000 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TT^A/VACACAT CCGTTCACAA 6060 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTT6AG 6120 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 6180 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GCTCCTGAGC 6240 

CAGTCAAATA TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA CAGGGAAAAG 6300 

AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 6360 

GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATGG GCTATTACCA GAGCAATGAA 6420 

TTCATCATTA CCCAGCACCC TCTCCTTCAT ACCATCAAGG ATTTCTG6AG GATGATATGG 6480 

GACCATAATG CCCAACTGGT GGTTATGATT CCTGATGGCC AAAACATGGC AGAA6ATGAA 6540 

TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAGCTTTAA GGTCACTCTT 6600 

ATGGCTGAAG AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 6660 

TTAGAAGCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGTG TCCTAAATGG 6720 

CCAAATCCAG ATAGCCCCAT TAGTA/^AACT TTTGAACTTA TAAGTGTTAT AT^AAGAAGAA 6780 

GCTGCCAATA GGGATGGGCC TATGATTGTT CATGATGAGC ATG6AGGAGT GACGGCAGGA 6840 

ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGGATGTT 6900 

TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 6960 

TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA AGAGAATCCA 7020 

TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC TGAGAGCTTA 7080 

GAGTCTTTAG TTTAACACAG AAAGGGGTGG GGGGACTCAC ATCTGAGCAT TGTTTTCCTC 7140 

TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTGATTT CCCATCACCT 7200 

GACAGTAACT TTCATGACAT AGGATTCTGC CGGCCAATTT ATATCATTAA CAATGTGTGC 72 60 

CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT AAAATGATTG AATTTTACAG 7320 

TATTTCTAAG AATGGAATTG TGGTATTTTT TTCTGTATTG ATTTTAACAG AAAATTTCAA 73 80 

TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATGTTTG TTTTTAGTGT CAAATTTTTA 7440 

GCTGTATTTG TAGCAATTAT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 7500 

AAATAA/iACA CTCTTCCATA TGATATTCAA CATTTTACAA CTGCAGTATT CACCTAAAGT 7560 

AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAGTGT CTCCATGGAC CAAATTTATA 7620 

TTTATAATTG TAGATTTTTA TATTTTACTA CTGAGTCAAG TTTTCTAGTT CTGTGTAATT 7680 

GTTTAGTTTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAGTTTTC TGACATTGTA 7740 

TTGTGTTACC TAAGTCATTA ACTTTGTTTC AGCAT6TAAT TTTAACTTTT GTGGAAT^ATA 7800 

GAAATACCTT CATTTTGAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA 7860 

AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT ATTGCCATTA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA AAAA 
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Seq ID NO: 573 Protein' sequence : 
Protein Accession #: Eos sequence 

5 1 11 21 31 41 51 

1 I I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

10 ILFEVGTBEN IiDFKAIIDGV ESVSRFGKQA ALDPFILLNL hPNSTDKYYI YNQShTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEPLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE BEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

15 RSPTRGSBFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ PSYSSEVFPL VTPLLLDNQI 780 

20 LNTTPAASSS DSALHATPVF PSVDVSFESI LSSYDGAPLL PFSSASFSSE LFRHLHTVSQ 840 

ILPQVTSATE SDKVPLHASIj PVAGGDLLLE PSLAQYSDVL STTHAASETL EFGSBSGVLY 900 

KTLMFSQVEP PSSDAMMHAR SSGPEPSYAL SDNEGSQHIF TVSYSSAIPV HDSVGVTYQG 960 

SLFSGPSHIP IPKSSLITPT ASLLQPTHAL SGDGEWSGAS SDSEFLLPDT DGLTALNISS 102 0 

PVSVAEFTYT TSVFGDDNKA LSKSEIIYGN ETELQIPSFN EMVYPSESTV MPNMYDNVNK 1080 

25 LNASLQETSV SISSTKGMFP GSLAHTTTKV FDHEISQVPE NNFSVQPTHT VSQASGDTSL 1140 

KPVLSANSEP ASSDPASSEM LSPSTQLLFY ETSASFSTEV LLQPSFQASD VDTLLKTVLP 1200 

AVPSDPILVE TPK7DKISST MLHtilVSNSA SSENMLHSTS VPVFDVSPTS HMHSASLQGL 1260 

TISYASEKYE PVLLKSESSH QWPSLYSND ELFQTANLEI NQAHPPKGRH VFATPVLSID 132 0 

EPLNTLINKL IHSDEILTST KSSVTGKVFA GIPTVASDTF VSTDHSVPIG NGHVAITAVS 1380 

30 PHRDGSVTST KLLFPSKATS ELSHSAKSDA GLVGGGEDGD TDDDGDDDDD DRGSDGLSIH 1440 

KCMSCSSYRE SQEKVMNDSD THENSLMDQN NPISYSLSEN SEEDNRVTSV SSDSQTGMDR 1500 

SPGKSPSANG LSQKHNDGKE ENDIQTGSAL LPLSPESKAW AVLTSDEESG SGQGTSDSLN 1560 

ENETSTDFSF ADTNEKDADG ILAAGDSEIT PGFPQSPTSS VTSENSEVFH VSEAEASNSS 1620 

HESRIGLAEG LESEKKAVIP LVIVSALTFI CLVVLVGILI YWRKCPQTAH FYLEDSTSPR 1680 

35 VISTPPTPIF PISDDVGAIP IKHFPKHV7\D LHASSGFTEE FETLKEFYQE VQSCTVDLGI 1740 

TADSSNHPDN KHKNRYINIV AYDHSRVKLA QLAEKDGKLT DYINANYVDG YNRPKAYIAA 18 0 0 

QGPLKSTAED FWRMIWEHNV EVIVMITNLV EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ 1860 

VLAYYTVRNF TIiRNTKIKKG SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY 1920 

AKRHAVGPW VHCSAGVGRT GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE 1980 

40 QYVFIHDTLV EAILSKETEV LDSHIHAYVN ALLIPGPAGK TKLEKQFQLL SQSNIQQSDY 2 040 

SAALKQCNRE KNRTSSIIPV ERSRVGISSL SGEGTDYINA SYIMGYYQSN EFIITQHPLL 2100 

HTIKDFWRMI WDHNAQLWM IPDGQNMAED EFVYWPNKDB PINCESFKVT LMAEEHKCLS 2160 

NEEKLIIQDF ILEATQDDYV LEVRHFQCPK WPNPDSPISK TFELISVIKE BAANRDGPMI 2220 

VHDEHGGVTA GTFCALTTLM HQLEKENSVD VYQVAKMINL MRPGVFADIE QYQFLYKVIL 2280 

45 SliVSTRQBEN PSTSLDSNGA ALPDGNIAES LESLV 

Seq ID NO: 574 DNA sequence 

Nucleic Acid Accession Eos sequence 

Coding sequence: 148-4518 

1 11 21 31 41 51 

1 1 ] I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

55 CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

60 AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 72 0 

65 GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

70 TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 12 60 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

75 TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCaA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

80 AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTG7WVG ATTCTTCAGG CTCCAGTCCC 1920 

85 GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

6AAGATTCJ\A CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
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GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAG7UV 6ACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT AGCTGAGGGG 2460 

TTGGAATCCG AGAAGAAGGC AGTTATACCC CTTGTGATCG TGTCAGCCCT GACTTTTATC 2520 

TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGCACAC 2580 

TTTTACTTAG AGGACAGTAC ATCCCCTAGA GTTATATCCA CACCTCCAAC ACCTATCTTT 2640 

CCAATTTCAG ATGATGTCGG AGCAATTCCA ATAAAGCACT TTCCAAAGCA TGTTGCAGAT 2700 

TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAGTT TTACCAGGAA 2760 

GTGCAGAGCT GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 282 0 

AAGCACAAGA ATCGATACAT AAATATCGTT GCCTATGATC ATAGCAGGGT TAAGCTAGCA 2 880 

CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATGCCAATTA TGTTGATGGC 2940 

TACT^ACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 3000 

TTCTGGAGAA TGATATGGGA ACATAATGTG GAAGTTATTG TCATGATAAC AAACCTCGTG 3 060 

6AGAAAGGAA GGAGAAAATG TGATCAGTAC TGGCCTGCCG ATGGGAGTGA GGAGTACGGG 3120 

AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT 3180 

ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACCCAG TGGACGTGTG 3240 

GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACCAGAGTA CTCCCTGCCA 3300 

GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC 33 60 

GTCCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA TTGTGCTAGA CAGTATGTTG 3420 

CAGCAGATTC AACACGAAGG AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCCGTTCA 3 480 

CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 3540 

GAGGCCATAC TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATTCATGC CTATGTTAAT 360 0 

GCACTCCTCA TTCCTGGACC AGCAGGCAT^A ACAAAGCTAG AGAAACAATT CCAGCTCCTG 3660 

AGCCAGTC7UV ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAGCAATG CAACAGGGAA 3720 

AAGAATCGAA CTTCTTCTAT CATCCCTGTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 3780 

AGTGGAGAAG GCACAGACTA CATCAATGCC TCCTATATCA TGGGCTATTA CCAGAGCAAT 3840 

GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 3900 

TGGGACCATA ATGCCCAACT GGTGGTTATG ATTCCTGATG GCCAAAACAT GGCAGAA6AT 3960 

GAATTTGTTT ACTGGCCAAA TAAAGATGAG CCTATAAATT GTGAGAGCTT TAAGGTCACT 4020 

CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACTTATAAT TCAGGACTTT 4080 

ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GGCACTTTCA GTGTCCTAAA 4140 

TGGCCAAATC CAGATAGCCC CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 4200 

GAAGCTGCCA ATAGGGATGG GCCTATGATT GTTCATGATG AGCATGGAGG AGTGACGGCA 42 60 

GGAACTTTCT GTGCTCTGAC AACCCTTATG CACCAACTAG AAAAAGAAAA TTCCGTGGAT 4320 

GTTTACCAGG TAGCCAAGAT GATCAATCTG ATGAGGCCAG GAGTCTTTGC TGACATTGAG 4380 

CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 

CCATCCACCT CTCTGGACAG TAATGGTGCA GCATTGCCTG ATGGAAATAT AGCTGAGAGC 450 0 

TTAGAGTCTT TAGTTTAACA CAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 

CTCTTCCTAA 7WVTTAGGCAG GT^AAATCAGT CTAGTTCTGT TATCTGTTGA TTTCCCATCA 4620 

CCTGACAGTA ACTTTCATGA CATAGGATTC TGCCX3CCAAA TTTATATCAT TAACAATGTG 4680 

TGCCTTTTTG CAAGACTTGT AATTTACTTA TTATGTTTGA ACTAAAATGA TTGAATTTTA 4740 

CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 

CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTGTTTTTAG TGTCAAATTT 4860 

TTAGCTGTAT TTGTAGCAAT TATCAGGTTT GCTAGAAATA TAACTTTTAA TACAGTAGCC 4 920 

TGTAi^TAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA 4 980 

AGTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAG TGTCTCCATG GACCAAATTT 5040 

ATATTTATAA TTGTAGATTT TTATATTTTA CTACTGAGTC AAGTTTTCTA GTTCTGTGTA 5100 

ATTGTTTAGT TTAATGACGT AGTTCATTAG CTGGTCTTAC TCTACCAGTT TTCTGACATT 5160 

GTATTGTGTT ACCTAAGTCA TTAACTTTGT TTCAGCATGT AATTTTAACT TTTGTGGAAA 5220 

ATAGAAATAC CTTCATTTTG AAAGAAGTTT TTATGAGAAT AACACCTTAC CAAACATTGT 5280 

TCAAATGGTT TTTATCCAAG GAATTGCA7VA AATAAATATA AATATTGCCA TTAAAAAAAA 5340 
T^T^AAAAAAAA AAAAA/^AAAA AAAAAAA 



Seq ID NO: 575 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQIOWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKIi KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

lIiFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEBIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLXPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDV6SGR ESFLQTNYTE IRVDESEKTT KSPSAGPVMS QGPSVTDLEM PHYSTPAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNAEASNS SHESRiGIiAE GLESEKKAVI 780 

PLVIVSALTF ICLWLVGIL lYWRKCFQTA HFYLEDSTSP RVISTPPTPI FPISDDVGAI 840 

PIKHFPKHVA DLHASSGFTE EPETLKEFYQ EVQSCTVDLG ITADSSNHPD NKHKNRYINI 900 

VAYDHSRVKL AQLAEKDGKL TDYINANYVD GYNRPKAYIA AQGPLKSTAE DFWRMIWEHN 960 

VEVIVMITNL VEKGRRKCDQ YWPAD6SEEY GNFLVTQKSV QVLAYYTVRN FTLRNTKIKK 1020 

GSQKGRPSGR WTQYHYTQW PDMGVPEYSL PVLTFVRKAA YAKRHAVGPV WHCSAGVGR 1080 

TGTYIVLDSM LQQIQHEGTV NIFGFLKHIR SQRNYLVQTE EQYVFIHDTL VEAILSKETE 1140 

VLDSHIHAYV NALLIPGPAG KTKLEKQFQL LSQSNIQQSD YSAALKQCNR EKNRTSSIIP 1200 

VERSRVGISS LSGEGTDYIN ASYIMGYYQS NEPIITQHPL IiHTIKDFWRM IWDHNAQLW 1260 

MIPDGQNMAE DEFVYWPNKD EPINCESFKV TLMAEEHKCL SNEEKLIIQD FILEATQDDY 1320 

VLEVRHFQCP KWPNPDSPIS KTFELISVIK EEAANRDGPM IVHDEHGGVT AGTFCALTTIi 1380 

MHQLEKENSV DVYQYAKMIN LMRPGVFADI EQYQFLYKVI LSIiVSTRQEE NPSTSLDSNG 1440 
AALPDGNIAE SLESLV 
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Seq ID NO: 576 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 148-4494 

1 11 21 31 41 51 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG T6AATCTTAA GAi\ACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 60 0 

GAGATGCAAA TCTACTGCTT TGATGCAGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 102 0 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 10 80 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 132 0 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA T^GACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTATW^T ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCA6TTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAT^TGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 252 0 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT 6CTTCCAGAC T6CACACTTT 2580 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGGAAGTGC AGAGCTGTAC TGTTGACTTA 2760 

GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC ACAAGAATCG ATACATAAAT 2 82 0 

ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC TTGCTGAAAA GGATGGCAT^ 2880 

CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA ACAGACCTUU^ AGCTTATATT 2940 

GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT GGAGAATGAT ATGGGAACAT 3000 

AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA AAGGAAGGAG AAAATGTGAT 3060 

CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT TTCTGGTCAC TCAGAAGAGT 312 0 

GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC TAAGAAACAC AAAAATAAAA 3180 

AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA CACAGTATCA CTACACGCAG 3240 

TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC TGACCTTTGT GAGAAAGGCA 3300 

GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC ACTGCAGTGC TG6AGTTGGA 3360 

AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC AGATTCAACA CGAAGGAACT 3420 

GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA GAAATTATTT GGTACAAACT 3480 

GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG CCATACTTAG TAAAGAAACT 3540 

GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC TCCTCATTCC TGGACCAGCA 3600 

GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC AGTC7\AATAT ACAGCAGAGT 3 660 

GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA ATCGAACTTC TTCTATCATC 3720 

CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG GAGAAGGCAC AGACTACATC 3 780 

AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT TCATCATTAC CCAGCACCCT 3840 

CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG ACCATAATGC CCAACTGGTG 3 900 

GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT TTGTTTACTG GCCAAATAAA 3 960 

GATGAGCCTA TAAATTGTGA GAGCTTTAAG' GTCACTCTTA TGGCTGAAGA ACACAAATGT 4020 

CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT TAGAAGCTAC ACAGGATGAT 4080 

TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC CAAATCCAGA TAGCCCCATT 4140 

AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG CTGCCAATAG GGATGGGCCT 42 00 

ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA CTTTCTGTGC TCTGACAACC 4260 

CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT ACCAGGTAGC CAAGATGATC 4320 

AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT ATCAGTTTCT CTAC/U^GTG 4380 

ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT CCACCTCTCT GGACAGTAAT 4440 

GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG AGTCTTTAGT TTAACACAGA 4500 

AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT TCCTAAAATT AGGCAGGAAA 4560 

ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG ACAGTAACTT TCATGACATA 4620 

GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC TTTTTGCAAG ACTTGTAATT 4680 

TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT ATTTCTAAGA ATGGAATTGT 4740 

GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT TTATAGAGGT TAGGAATTCC 4800 
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AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG CTGTATTTGT AGCAATTATC 4860 

AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA AATAAAACAC TCTTCCATAT 4920 

GATATTCT^AC ATTTTACT^AC TGCAGTATTC ACCTAAAGTA GAAATAATCT GTTACTTATT 4980 

GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT TTATAATTGT AGATTTTTAT 5040 

ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG TTTAGTTTAA TGACGTAGTT 5100 

CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT TGTGTTACCT AAGTCATTAA 5160 

CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG AAATACCTTC ATTTTGAAAG 5220 

AAGTTTTTAT GAGAAT7VACA CCTTACCAAA CATTGTTCAA ATGGTTTTTA TCCAAGGAAT 5280 

TGCAAAAATA AATATATVATA TTGCCATTAA AAAAAAAAAA AAAAAAAAAA AAAAAi^AAAA 5340 
AAA 

Seq ID NO: 577 Protein sequence: 
Protein Accession #: EOS sequence 

1 11 21 31 41 51 

I I I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 12 0 

PKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHBAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKICEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTABSIi NTVSITEYEE BSLLTSFKLD TGAEDSSGSS PATSAIFFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 

LVIVSALTFI CLVVLVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 840 

IKHFPKHVAD LHASSGFTEE FEEVQSCTVD LGITADSSNH PDNKHKNRYI NIVAYDHSRV 900 

KLAQLAEKDG KLTDYINANY VDGYNRPKAY lAAQGPLKST AEDFWRMIWE HNVEVIVMIT 960 

NLVEKGRRKC DQYWPADGSE EYGNFLVTQK SVQVLAYYTV RNFTLRNTKI KKGSQKGRPS 102 0 

GRWTQYHYT QWPDMGVPEY SLPVLTFVRK AAYAKRHAVG PVWHCSAGV GRTGTYIVLD 1080 

SMLQQIQHEG TVNIFGFLKH IRSQRNYLVQ TEEQYVFIHD TLVEAILSKE TEVLDSHIHA 1140 

YVNALLIPGP AGKTKLEKQF QLLSQSNIQQ SDYSAALKQC NREKNRTSSI IPVERSRVGI 1200 

SSLSGEGTDY INASYIMGYY QSNEFIITQH PLIiHTIKDFW RMIWDHNAQL WMIPDGQNM 1260 

AEDEFVYWPN KDEPINCESF KVTLMABEHK CLSNEBKLII QDFILEATQD DYVLBVRHPQ 1320 

CPKWPNPDSP ISKTPELISV IKEEAANRDG PMIVHDEHGG VTAGTFCALT TLMHQLEKEN 1380 

SVDVYQVAKM INLMRPGVFA DIEQYQFLYK VILSLVSTRQ EENPSTSLDS NGAALPDGNI 1440 
7VESLESLV 



Seq ID NO: 578 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 501-4514 

1 11 21 31 41 51 

t I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAT TGGGGAAAGA 300 

AATATCCAAC ATGTAATAGC CCAAAACAAT CTCCTATCAA TATTGATGAA GATCTTACAC 360 

AAGTAAATGT GAATCTTAAG AAACTTAAAT TTCAGGGTTG GGATAAAACA TCATTGGAAA 420 

ACACATTCAT TCATAACACT GGGAAAACAG TGGAAATTAA TCTCACTAAT GACTACCGTG 480 

TCAGCGGAGG AGTTTCAGAA ATGGTGTTTA AAGCAAGCAA GATAACTTTT CACTGGGGAA 540 

AATGCAATAT GTCATCTGAT GGATCAGAGC ATAGTTTAGA AGGACAAAAA TTTCCACTTG 600 

AGATGCAAAT CTACTGCTTT GATGCGGACC GATTTTCAAG TTTTGAGGAA GCAGTCAAAG 660 

GAAAAGGGAA GTTAAGAGCT TTATCCATTT TGTTTGAGGT TGGGACAGAA GAAAATTTGG 720 

ATTTCAAAGC GATTATTGAT GGAGTCGAAA GTGTTAGTCG TTTTGGGAAG CAGGCTGCTT 780 

TAGATCCATT CATACTGTTG AACCTTCTGC CAAACTCAAC TGACAAGTAT TACATTTACA 840 

ATGGCTCATT GACATCTCCT CCCTGCACAG ACACAGTTGA CTGGATTGTT TTTAAAGATA 90 0 

CAGTTAGCAT CTCTGAAAGC CAGTTGGCTG TTTTTTGTGA AGTTCTTACA ATGCAACAAT 960 

CTGGTTATGT CATGCTGATG GACTACTTAC AJ^AACAATTT TCGAGAGCAA CAGTACAAGT 1020 

TCTCTAGACA GGTGTTTTCC TCATACACTG GAAAGGAAGA GATTCATGAA GCAGTTTGTA 1080 

GTTCAGAACC AGAAAATGTT CAGGCTGACC CAGAGAATTA TACCAGCCTT CTTGTTACAT 1140 

GGGAAAGACC TCGAGTCGTT TATGATACCA TGATTGAGAA GTTTGCAGTT TTGTACCAGC 1200 

AGTTGGATGG AGAGGACCAA ACCAAGCATG AATTTTTGAC AGATGGCTAT CAAGACTTGG 1260 

GTGCTATTCT CAATAATTTG CTACCCAATA TGAGTTATGT TCTTCAGATA GTAGCCATAT 1320 

GCACTAATGG CTTATATGGA AAATACAGCG ACCAACTGAT TGTCGACATG CCTACTGATA 13 80 

ATCCTGAACT TGATCTTTTC CCTGAATTAA TTGGAACTGA AGAAATAATC AAGGAGGAGG 1440 

AAGAGGGAAA AGACATTGAA GAAGGCGCTA TTGTGAATCC TGGTAGAGAC AGT6CTACAA 1500 

ACCAAATCAG GAAA7\AGGAA CCCCAGATTT CTACCACAAC ACACTACAAT CGCATAGGGA 1560 

CGAAATACAA TGAAGCCAAG ACTAACCGAT CCCCAACAAG AGGAAGTGAA TTCTCTGGAA 1620 

AGGGTGATGT TCCCAATACA TCTTTAAATT CCACTTCCCA ACCAGTCACT AAATTAGCCA 1680 

CAGAAAAAGA TATTTCCTTG ACTTCTCAGA CTGTGACTGA ACTGCCACCT CACACTGTGG 1740 

AAGGTACTTC AGCCTCTTTA AATGATGGCT CTAAAACTGT TCTTAGATCT CCACATATGA 1800 

ACTTGTCGGG GACTGCAGAA TCCTTAAATA CAGTTTCTAT AACAGAATAT GAGGAGGAGA 1860 

GTTTATTGAC CAGTTTCAAG CTTGATACTG GAGCTGAAGA TTCTTCAGGC TCCAGTCCCG 1920 

CAACTTCTGC TATCCCATTC ATCTCTGAGA ACATATCCCA AGGGTATATA TTTTCCTCCG 1980 

AAAACCCAGA GACAATAACA TATGATGTCC TTATACCAGA ATCTGCTAGA AATGCTTCCG 2040 

AAGATTCAAC TTCATCAGGT TCAGAAGAAT CACTAAAGGA TCCTTCTATG GAGGGAAATG 2100 

TGTGGTTTCC TAGCTCTACA GACATAACAG CACAGCCCGA TGTTGGATCA GGCAGAGAGA 2160 

GCTTTCTCCA GACTAATTAC ACTGAGATAC GTGTTGATGA ATCTGAGAAG ACAACCAAGT 2220 

CCTTTTCTGC AGGCCCAGTG ATGTCACAGG GTCCCTCAGT TACAGATCTG GAAATGCCAC 2280 
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ATTATTCTAC CTTTGCCTAC TTCCCAACTG 
CCAGACAACA GGATTTGGTC TCCACGGTCA 
TATACAATGA GGCCAGTAAT AGTAGCCATG 
AATCCGAGAA GAAGGCAGTT ATACCCCTTG 
TAGTGGTTCT TGTGGGTATT CTCATCTACT 
ACTTAGAGGA CAGTACATCC CCTAGAGTTA 
TTTCAGATGA TGTCGGAGCA ATTCCAATAA 
ATGCAAGTAG TGGGTTTACT GAAGAATTTG 
AGAGCTGTAC TGTTGACTTA GGTATTACAG 
ACAAGAATCG ATACATAAAT ATCGTTGCCT 
TTGCTGAAAA GGATGGCAAA CTGACTGATT 
ACAGACCAAA AGCTTATATT GCTGCCCAAG 
GGAGAATGAT ATGGGAACAT AATGTGGAAG 
AAG6AAGGAG AAAATGTGAT CAGTACT6GC 
TTCTGGTCAC TCAGAAGAGT GTGCT^GTGC 
TAAGAAACAC AAAAATAAAA AAGGGCTCCC 
CACAGTATCA CTACACGCAG TGGCCTGACA 
TGACCTTTGT GAGAAAGGCA GCCTATGCCA 
ACTGCAGTGC TGGAGTTGGA AGAACAGGCA 
AGATTCAACA CGAAGGAACT GTCAACATAT 
GAAATTATTT GGTACAAACT GAGGAGCAAT 
CCATACTTAG TAAAGAAACT GAGGTGCTGG 
TCCTCATTCC TGGACCAGCA GGCAAAACAA 
AGTCAAATAT ACAGCAGAGT GACTATTCTG 
ATCGAACTTC TTCTATCATC CCTGTGGAAA 
GAGAAGGCAC AGACTACATC AATGCCTCCT 
-TCATCATTAC CCAGCACCCT CTCCTTCATA 
ACCATAATGC CCAACTGGTG GTTATGATTC 
TTGTTTACTG GCCAAATAAA GATGAGCCTA 
TGGCTGAAGA ACACAAATGT CTATCTAATG 
TAGAAGCTAC ACAGGATGAT TATGTACTTG 
CAAATCCAGA TAGCCCCATT AGTAAAACTT 
CTGCCAATAG GGATGGGCCT ATGATTGTTC 
CTTTCTGTGC TCTGACAACC CTTATGCACC 
ACCAGGTAGC CAAGATGATC AATCTGATGA 
ATCAGTTTCT CTACAAAGTG ATCCTCAGCC 
CCACCTCTCT GGACAGTAAT GGTGCAGCAT 
AGTCTTTAGT TTAACACAGA AAGGGGTGGG 
TCCTAAAATT AGGCAGGAAA ATCAGTCTAG 
ACAGTAACTT TCATGACATA GGATTCTGCC 
TTTTTGCAAG ACTTGTAATT TACTTATTAT 
ATTTCTAAGA ATGGAATTGT GGTATTTTTT 
TTATAGAGGT TAGGAATTCC AAACTACAGA 
CTGTATTTGT AGCAATTATC AGGTTTGCTA 
AATAAAACAC TCTTCCATAT GATATTCAAC 
GAAATAATCT GTTACTTATT GTAAATACTG 
TTATAATTGT AGATTTTTAT ATTTTACTAC 
TTTAGTTTAA TGACGTAGTT CATTAGCTGG 
TGTGTTACCT AAGTCATTAA CTTTGTTTCA 
AAATACCTTC ATTTTGAAAG AAGTTTTTAT 
ATGGTTTTTA TCCAAGGAAT TGCAAAAATA 
AAAAAAAAAA AAAAAAAAAA AAA 



AGGTAACACC TCATGCTTTT ACCCCATCCT 2340 

AOSTGGTATA CTCGCAGACA ACCCAACCGG 2400 

AGTCTCX3TAT TGGTCTAGCT GAGGGGTTGG 2460 

TGATCGTGTC AGCCCTGACT TTTATCTGTC 252 0 

GGAGGAAATG CTTCCAGACT GCACACTTTT 2580 

TATCCACACC TCCAACACCT ATCTTTCCAA 2640 

AGCACTTTCC AAAGCATGTT GCAGATTTAC 2700 

AGACACTGAA AGAGTTTTAC CAGGAAGTGC 2760 

CAGACAGCTC CAACCACCCA GACAACAAGC 2820 

ATGATCATAG CAGGGTTAAG CTAGCACAGC 2 880 

ATATCAATGC CAATTATGTT GATGGCTACA 2 940 

GCCCACTGAA ATCCACAGCT GAAGATTTCT 3000 

TTATTGTCAT GATAACAAAC CTCGTGGAGA 3060 

CTGCCGATGG GAGTGAGGAG TACGGGAACT 3120 

TTGCCTATTA TACTGTGAGG AATTTTACTC 3180 

AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA 3240 

TGGGAGTACC AGAGTACTCC CTGCCAGTGC 3300 

AGCGCCATGC AGTGGGGCCT GTTGTCGTCC 3360 

CATATATTGT GCTAGACAGT ATGTTGCAGC 3420 

TTGGCTTCTT AAAACACATC CGTTCACAAA 3480 

ATGTCTTCAT TCATGATACA CTGGTTGAGG 3540 

ACAGTCATAT TCATGCCTAT GTTAATGCAC 3 60 0 

AGCTAGAGAA ACAATTCCAG CTCCTGAGCC 3660 

CAGCCCTAAA GCAATGCAAC AGGGAAAAGA 372 0 

GATCAAGGGT TGGCATTTCA TCCCTGAGTG 3780 

ATATCATGGG CTATTACCAG AGCAATGAAT 3840 

CCATCAAGGA TTTCTGGAGG ATGATATGGG 3900 

CTGATGGCCA AAACATGGCA GAAGATGAAT 3960 

TAAATTGTGA GAGCTTTAAG GTCACTCTTA 4020 

AGGAAAAACT TATAATTCAG GACTTTATCT 4080 

AAGTGAGQCA CTTTCAGTGT CCTAAATGGC 4140 

TTGAACTTAT AAGTGTTATA AAAGAAGAAG 4200 

ATGATGAGCA TGGAGGAGTG ACGGCAGGAA 4260 

AACTAGAAAA AGAAAATTCC GTGGATGTTT 4320 

GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT 4380 

TTGTGAGCAC AAGGCAGGAA GAGAATCCAT 4440 

TGCCTGATGG AAATATAGCT GAGAGCTTAG 4500 

GGGACTCACA TCTGAGCATT GTTTTCCTCT 4560 

TTCTGTTATC TGTTGATTTC CCATCACCTG 4620 

GCCAAATTTA TATCATTAAC AATGTGTGCC 4680 

GTTTGAACTA A^^TGATTGA ATTTTACAGT 4740 

TCTGTATTGA TTTTAACAGA AAATTTCAAT 4800 

AAAXGTTTGT TTTTAGTGTC AAATTTTTAG 4860 

GAAATATAAC TTTTAATACA GTAGCCTGTA 4920 

ATTTTACAAC TGCAGTATTC ACCTAAAGTA 4 980 

CCCTAGTGTC TCCATGGACC AAATTTATAT 5040 

TGAGTCAAGT TTTCTAGTTC TGTGTAATTG 5100 

TCTTACTCTA CCAGTTTTCT GACATTGTAT 5160 

GCATGTAATT TTAACTTTT6 TGGAAT^TAG 5220 

GAGAATAACA CCTTACCAAA CATTGTTCAA 5280 

AATATAAATA TTGCCATTAA AAAAAAAAAA 5340 



Seq ID NO: 579 Protein sequence: 
Protein Accession # : EOS sequence 



1 11 21 

I 1 1 

MVFKASKITF HWGKCNMSSD GSEHSLEGQK 
LSILFEVGTE ENLDFKAIID GVESVSRFGK 
PCTDTVDWIV FKDTVSISES QLAVFGEVLT 
SYTGKKEIHE AVCSSEPENV QADPENYTSL 
TKHEFLTDGY QDLGAILNNL LPNMSYVLQI 
PELIGTEEII KEEEEGKDIE EGAIVNPGRD 
TNRSPTRGSE FSGKGDVPNT SLNSTSQPVT 
NDGSKTVLRS PHMNLSGTAE SLNTVSITEY 
ISENISQGYI FSSENPETIT YDVLIPESAR 
DITAQPDVGS GRESFLQTNY TEIRVDESEK 
FPTEVTPHAF TPSSRQQDLV STVNWYSQT 
IPIjVIVSALT FICLWLVGI LIYWRKCFQT 
IPIKHFPKHV ADLHASSGFT EEFETLKEFY 
IVAYDHSRVK LAQLAEKDGK LTDYINANYV 
NVEVIVMITN LVEKGRRKCD QYWPADGSEE 
KGSQKGRP3G RWTQYHYTQ WPDMGVPBYS 
RTGTYIVLDS MLQQIQHEGT VNIFGFLKHI 
EVLDSHIHAY VNAIiLIPGPA GKTKLEKQFQ 
PVERSRVGIS SLSGEGTDYI NASYIMGYYQ 
VMIPDGQNMA EDEFVYWPNK* DEPINCESFK 
YVTiEVRHFQC PKWPWPDSPI SKTFELISVI 
LMHQLEKENS VDVYQVAKMI NLMRPGVFAD 
GT^LPDGNIA ESLESLV 



31 41 51 

I 1 1 

FPLEMQIYCF DADRFSSFEE AVKGKGKLRA 60 

QAALDPFILL NIiLPNSTDKY YIYNGSLTSP 120 

MQQSGYVMLM DYLQNNFREQ QYKFSRQVFS 180 

LVTWERPRW YDTMIEKFAV IjYQQLDGEDQ 240 

VAICTNGLYG KYSDQLIVDM PTDNPELDLF 300 

SATNQIRKKE PQISTTTHYN RIGTKYNEAK 360 

KLATEKDlSh TSQTVTELPP HTVEGTSASL 420 

EEESLLTSFK LDTGAEDSSG SSPATSAIPF 480 

NASEDSTSSG SEESLKDPSM EGNVWFPSST 540 

TTKSFSAGPV MSQGPSVTDL EMPHYSTPAY 600 

TQPVYNEASN SSHESRIGLA EGLESEKKAV 660 

AHFYLEDSTS PRVISTPPTP IFPISDDVGA 720 

QEVQSCTVDL GITADSSNHP DNKHKNRYIN 780 

DGYNRPKAYI AAQGPLKSTA EDFWRMIWEH 840 

YGNFLVTQKS VQVLAYYTVR NFTLRNTKIK 900 

LPVLTFVRKA AYAKRHAVGP WVHCSAGVG 960 

RSQRNYLVQT .EEQYVFIHDT LVEAILSKET 1020 

LLSQSNIQQS DYSAALKQCN REKNRTSSII 1080 

SNEFIITQHP LLHTIKDFWR MIWDHNAQLV 1140 

VTLMAEEHKC LSNEEKLIIQ DPILEATQDD 1200 

KEEAANRDGP MIVHDEHGGV TAGTFCALTT 1260 

lEQYQFLYKV ILSLVSTRQE ENPSTSLDSN 1320 



Seq ID NO: 580 DNA sequence 

Nucleic Acid Accession #i EOS sequence 

Coding sequence: 148-4632 

1 11 21 31 41 51 
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CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGA6CCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AACGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCA7^7\ACAA TCTCCTATCA ATATTGATGA A6ATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTA6TC GTTTTGGGAA 6CAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCTUV CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 12 00 

CAGTTGGATG 6AGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 13 80 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTGTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AQCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCC7VACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AA.CGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2 640 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 2760 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 2820 

CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 2 880 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 2940 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 3000 

TGGAGAATGA TATGGGJ\ACA TAATGTGGT^ GTTATTGTCA TGATAACAAA CCTCGTGGAG 3 060 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 3120 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 3180 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 3240 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 3300 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 3360 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 342 0 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 3480 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 3540 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 3600 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GGGTCTCACT 3660 

CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTCGGCTC ACTGCAACCT TCCTCTCCCT 3720 

GGCTTAACTG ATCCTCCTAC CTCAGCCTCC CGAGTGGCTG GGACTATACT CCTGAGCCAG 3780 

TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 3840 

CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 3900 

GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 3960 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 4020 

CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 4080 

GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 4140 

GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 4200 

GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 4260 

AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA A6AAGAAGCT 4320 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 4380 

TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 4440 

CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 4500 

CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGGGCACAA GGCAGGAAGA GAATCCATCC 4560 

ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 4620 

TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 4680 

CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 4740 

AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 4800 

TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 4860 

TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 4920 

ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 4980 

GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 5040 

TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 5100 

AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 5160 
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ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TGTAGTTCTG TGTAATTGTT 5220 

TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 5280 

TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GA7WVATAGAA 5340 

ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 5400 

5 GGTTTTTATC CAAGGAATTG CAAATU^TAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 5460 
AAAAAAAAAA AAAAAAAAAA A 

Seq ID NO: 581 Protein sequence: 
Protein Accession #: EOS secpience 

1 11 21 31 41 51 

I I 1 I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDIi TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

15 FKASKITFHW GKCNMSSDGS BHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN IiDPKAIIDGV ESVSRFGKQA AIiDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

20 LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTBLPPHT VBGTSASIM3 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYBE ESLLTSFKLD TGABDSSGSS PATSAIPPIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDIiEM PHYSTFAYFP 720 

25 TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 

LVIVSALTFl CLWLVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 840 

IKHFPKHVAD LHASSGFTEE FETLKBFYQE VQSCTVDLGI TADSSNHPDN KHKNRYINIV 900 

AYDHSRVKLA QLAEKDGKLT DYINANYVDG YNRPKAYIAA QGPLKSTAED FWRMIWEHNV 960 

EVIVMITNLV EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ VLAYYTVRNF TLRNTKIKKG 1020 

30 SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY AKRHAVGPW VHCSAGVGRT 1080 

GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE QYVFIHDTLV EAILSKETEV 1140 

LDSHIHAYVN ALLIPGPAGK TKLEKQFQGL TLSPRIiECRG TISAHCNLPL PGIiTDPPTSA 1200 

SRVAGTILLS QSNIQQSDYS AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS 1260 

YIMGYYQSNE FIITQHPIiLH TIKDFWRMIW DHNAQLWMI PDGQNMAEDE PVYWPNKDEP 1320 

35 INCESFKVTL MAEEHKCLSN EEKLIIQDFI LEATQDDYVL EVRHFQCPKW PNPDSPISKT 1380 

FELISVIKEE AANRDGPMIV HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM 1440 
RPGVFADIEQ YQFLYKVILS LVGTRQEENP STSLDSNGAA LPDGNIAESL ESLV 



40 Seq ID NO: 582 DNA sequence 

Nucleic Acid Accession #: NM_002851.1 
Coding sequence: 148.. 7092 

1 11 21 31 41 51 

45 1 1 1 I I I 

CACACATACG CACGCACGAT CTCACTTC6A TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

50 CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAT^AA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 54 0 

55 AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

60 AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCAT6A AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

65 TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 132 0 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCT6AATTA ATTGG7\ACTG AAGAAATAAT CAAGGAGGAG 1440 

70 GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

75 GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAA7VACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTJ\AAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 192 0 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

80 GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGG7VAAT 2100 

GTGTGGTTTC CTAGCTCTAC A6ACATAACA GCACAGCCCX3 ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 222 0 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

85 TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 

ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC T^GTAGTGAT 2520 
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TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 2580 

TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATT6 2640 

TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 

AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 

TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2 880 

AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 

GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 

GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 30 60 

CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC A6CCTACTCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 

TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 

ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 

CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGATUVC CTCTGTTTCC 3420 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3 60 0 

TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 

ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 

GACACCTTGC TTAAAACTGT TCXTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ATA/^TTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3 900 

ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3 960 

GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4 02 0 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTTUVTAAA TT^GCTTATA 4140 

CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 

TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGACAGA 4440 

GATAGTGATG GCTTATCCAT TCATAAGTGT ATGTCATGCT CATCCTATAG AGAATCACAG 4500 

GAAAAGGTAA TGAATGATTC AGACACCCAC GAAAACAGTC TTATGGATCA GAATAATCCA 4560 

ATCTCATACT CACTATCTGA GAATTCTGAA GAAGATAATA GAGTCACAAG TGTATCCTCA 4620 

GACAGTCAAA CTGGTATGGA CAGAAGTCCT GGTAAATCAC CATCAGCAAA TGGGCTATCC 4680 

CAAAAGCACA ATGATGGAAA AGAGGAAAAT GACATTCAGA CTGGTAGTGC TCTGCTTCCT 4740 

CTCAGCCCTG AATCTAAAGC ATGGGCAGTT CTGACAAGTG ATGAAGAAAG TGQATCAGGG 4800 

CAAGGTACCT CAGATAGCCT TAATGAGAAT GAGACTTCCA CAGATTTCAG TTTTGCAGAC 4860 

ACTAATGAAA AAGATGCTGA TGGGATCCTG GCAGCAGGTG ACTCAGAAAT AACTCCTGGA 4920 

TTCCCACAGT CCCCAACATC ATCTGTTACT AGCGAGAACT CAGAAGTGTT CCACGTTTCA 4980 

GAGGCAGAGG CCAGTAATAG TAGCCATGAG TCTCGTATTG GTCTAGCTGA GGGGTTGGAA 5040 

TCCGAGAAGA AGGCAGTTAT ACCCCTTGTG ATOSTGTCAG CCCTGACTTT TATCTGTCTA 5100 

GTGGTTCTTG TGGGTATTCT CATCTACTGG AGGAAATGCT TCCAGACTGC ACACTTTTAC 5160 

TTAGAGGACA GTACATCCCC TAGAGTTATA TCCACACCTC CAACACCTAT CTTTCCAATT 5220 

TCAGATGATG TCGGAGCAAT TCCAATAAAG CACTTTCCAA AGCATGTTGC AGATTTACAT 5280 

GCAAGTAGTG GGTTTACTGA AGAATTTGAG ACACTGAAAG AGTTTTACCA GGAAGTGCAG 5340 

AGCTGTACTG TTGACTTAGG TATTACAGCA GACAGCTCCA ACCACCCAGA CAACAAGCAC 5400 

AAGAATCGAT ACATAAATAT CGTTGCCTAT GATCATAGCA GGGTTAAGCT AGCACAGCTT 5460 

GCTGAAAAGG ATGGCAAACT GACTGATTAT ATCAATGCCA ATTATGTTGA TGGCTACAAC 552 0 

AGACCAAAAG CTTATATTGC TGCCCAAGGC CCACTGAAAT CCACAGCTGA AGATTTCTGG 5580 

AGAATGATAT GGGAACATAA TGTGGAAGTT ATTGTCATGA TAACAAACCT CGTGGAGAAA 5 640 

GGAAGGAGAA AATGTGATCA GTACTGGCCT GCCGATGGGA GTGAGGAGTA CGGGAACTTT 5700 

CTGGTCACTC AGAAGAGTGT GC7\AGTGCTT GCCTATTATA CTGTGAGGAA TTTTACTCTA 5760 

AGAAACACAA AAATAAATUU^ GGGCTCCCAG 7U\AGGAAGAC CCAGTGGACG TGTGGTCACA 5820 

CAGTATCACT ACACGCAGTG GCCTGACATG GGAGTACCAG AGTACTCCCT GCCAGTGCTG 5880 

ACCTTTGTGA GAAAGGCAGC CTATGCCAAG CGCCATGCAG TGGGGCCTGT TGTCGTCCAC 5 940 

TGCAGTGCTG GAGTTGGAAG AACAGGCACA TATATTGTGC TAGACAGTAT GTTGCAGCAG 6000 

ATTCAACACG AAGGAACTGT CAACATATTT GGCTTCTTAA AACACATCCG TTCACAAAGA 6060 

AATTATTTGG TACAAACTGA GGAGCAATAT GTCTTCATTC ATGATACACT GGTTGAGGCC 6120 

ATACTTAGTA AAGAAACTGA GGTGCTGGAC AGTCATATTC ATGCCTATGT TAATGCACTC 6180 

CTCATTCCTG GACCAGCAGG CAAAACAAAG CTAGAGAAAC AATTCCAGCT CCTGAGCCAG 6240 

TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTA7W\GC AATGCAACAG GGAAAAGAAT 6300 

CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 6360 

GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 6420 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 6480 

CATAATGCCC AACTG6TGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 6540 

GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 6600 

GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 6660 

GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 672 0 

AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 6780 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 6840 

TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG A/^TTCCGT GGATGTTTAC 6900 

CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 6960 

CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGAGCACAA GGCAGGAAGA GAATCCATCC 7020 

ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 7080 

TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 7140 

CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 7200 

AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 7260 

TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 7320 

TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 7380 

ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 7440 

GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 7500 

TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 7 560 

AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 7620 

ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 7680 

TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 7740 
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TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 7800 
ATACCTTOIT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCSWWVCA TTGTTCAAAT 7860 
GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA A 

Seq ID NO: S83 Protein sequence 
Protein Accession #: NP 002842.1 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
BNISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
LNTTPAASSS 
ILPQVTSATE 
KTLMFSQVEP 
SLFSGPSHIP 
PVSVAEFTYT 
LNASLQETSV 
KPVLSANSEP 
AVPSDPILVE 
TISYASEKYE 
EFLNTLINKL 
PHRDGSVTST 
CMSCSSYRES 
PGKSPSANGL 
NETSTDFSFA 
ESRXGLAEGL 
ISTPPTPIFP 
ADSSNHPDNK 
GPLKSTAEDF 
LAYYTVRNFT 
KRHAVGPVW 
YVFIHDTLVE 
AALKQCNREK 
TIKDFWRMIW 
EEKLIIQDFI 
HDEHGGVTAG 
LVSTRQEENP 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQI* 
CSSEPENVQA 
LGAILNNLLP 
EEEGKDIEEG 
GKGDVPNTSL 
MNLSGTABSL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
DSALHATPVF 
SDKVPLHASL 
PSSDAMMHAR 
IPKSSLITPT 
TSVFGDDNKA 
SISSTKGMFP 
ASSDPASSEM 
TPKVDKISST 
PVLLKSESSH 
IHSDEILTST 
KLLFPSKATS 
QEKVMKTOSDT 
SQKHNDGKEE 
DTNEKDADGI 
ESEKKAVIPL 
ISDDVGAIPI 
HKNRYINIVA 
WRMIWEHNVE 
LRNTKIKKGS 
HCSAGVGRTG 
AILSKETEVL 
NRTSSIIPVE 
DHNAQLWMI 
LEATQDDYVL 
TFCALTTLMH 
STSLDSNGAA 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRF6KQA 
AVFCEVIiTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYBB 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
PSVDVSFESI 
PVAGGDLIjLE 
SSGPEPSYAIi 
ASLLQPTHAL 
LSKSEIIYGN 
GSLAHTTTKV 
LSPSTQIiLFY 
MLHLIVSNSA 
QWPSIiYSND 
KSSVTGKVFA 
ELSHSAKSDA 
HENSLMDQNN 
NDIQTGSAIiL 
LAAGDSEITP 
VIVSALTFIC 
KHFPKHVADL 
YDHSRVKLAQ 
VIVMITNLVE 
QKGRPSGRW 
TYIVIiDSMLQ 
DSHIHAYVNA 
RSRVGISSLS 
PDGQNMAEDE 
EVRHFQCPKW 
QLEKENSVDV 
LPDGNIAESL 



31 

1 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNI* 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSPSAGPVMS 
PVYNGETPLQ 
LSSYDGAPLL 
PSLAQYSDVL 
SDNEGSQHIF 
SGDGEWSGAS 
ETELQIPSFN 
FDHEISQVPE 
ETSASFSTEV 
SSENMLHSTS 
ELPQTANLEI 
GIPTVASDTF 
GLVGGGEDGD 
PISYSLSENS 
PLSPESKAWA 
GFPQSPTSSV 
liWLVGILIY 
HASSGFTEEF 
LAEKDGKLTD 
KGRRKCDQYW 
TQYHYTQWPD 
QIQHEGTVNI 
LLIPGPAGKT 
GEGTDYINAS 
FVYWPNKDEP 
PNPDSPISKT 
YQVAKMINLM 
ESLV 



41 

I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPWSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
PSYSSEVFPL 
PFSSASFSSE 
STTHAASETL 
TVSYSSAIPV 
SDSEFLLPDT 
EMVYPSESTV 
NNFSVQPTHT 
LLQPSFQASD 
VPVPDVSPTS 
NQAHPPKGRH 
VSTDHSVPIG 
TDDDGDDDDD 
EEDNRVTSVS 
VLTSDEESGS 
TSENSEVFHV 
WRKCFQTAHF 
ETLKEFYQEV 
YINANYVDGY 
PADGSEEYGN 
MGVPEYSLPV 
FGFIjKHIRSQ 
KLEKQFQLLS 
YIMGYYQSNE 
INCESFKVTIi 
FELISVIKEB 
RP6VFADIEQ 



51 

1 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASIiND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
VTPLLLDNQI 
LFRHLHTVSQ 
EFGSESGVLY 
HDSVGVTYQG 
DGLTALNISS 
MPNMYDNVNK 
VSQASGDTSL 
VDTLLKTVLP 
HMHSASLQGIj 
VFATPVLSID 
NGHVAITAVS 
RDSDGLSIHK 
SDSQTGMDRS 
GQGTSDSLNE 
SEAEASNSSH 
YLEDSTSPRV 
QSCTVDLGIT 
NRPKAYIAAQ 
FLVTQKSVQV 
LTFVRKAAYA 
RNYLVQTEEQ 
QSNIQQSDYS 
FIITQHPLLH 
MAEEHKCLSN 
AANRDGPMIV 
YQFLYKVILS 



Seq ID NO: 584 DNA sequence 

Nucleic Acid Accession #: NM_O0S68a.l 

Coding sequence s 126.. 443 9 



CCGGGCAGGT 
AGGGGCGCAG 
AGAAGATGAA 
GTGTGAGGGA 
GGAGAACrCG 
TCTCTCTTGA 
GAAAGTACCA 
ACCCAGTGGA 
CCCGTGTGGC 
ACGAGTCTTC 
AAGTTGGGCC 
TCATCCTGTC 
TCATGGTGAA 
TGTTGTTAGT 
CTTGGGCATT 
TTAAGAAGAT 
TTTGCTCCAA 
GAGGACCCGT 
GCTTCCTGGG 
TCACAGCATA 
ATGAAGTTCT 
AGAGTGTTCA 
AGGGTATCAC 
CTGTTCATAT 
TCTTCAATTC 
AAGCCTCAGT 
TAAAGAACAA 
GGGACTCCTC 
ACAAGAGGGC 
AGGCGGT6CT 
CCGAA6AG6A 



11 
I 

GGCTCATGCT 
GAATTCTGAT 
GGATATCGAC 
GAGAACCAGC 
ACCGTTGGAA 
TGCCTCCATG 
TCATGGCTTG 
CAATGCTGGG 
CCACAAGAAG 
TGACGTGAAC 
AGACGCTGCT 
CATCGTGTGC 
ACACCTCTTG 
GCTGGGCCTC 
GAATTACCGA 
CCTTAAGTTA 
CGATGGGCAG 
TGTTGCCATC 
ATCAGCTGTT 
TTTCAGGAGA 
TACTTACATT 
AAAAATCCGC 
TGTGGGTGTG 
GACCCTGGGC 
CATGACTTTT 
GGCTGTTGAC 
ACCAGCCAGT 
CCACTCCAGT 
TTCCAGGGGC 
GGCAGAGCAG 
A6AAGGCAA6 



21 

I 

CGGGAGCGTG 
GTGAAACTAA 
ATAGGAAAAG 
ACTTCTGGGA 
TGCCAAGATG 
CATTCTCAGC 
AGTGCTCTGA 
CTTTTTTCCT 
GGGGAGCTCT 
TGCAGAAGAC 
TCCCTGCGAA 
CTGATGATCA 
GAGTATACCC 
CTCCTGACGG 
ACCGGTGTCC 
AAGAACATTA 
AGAATGTTTG 
TTAGGCATGA 
TTTATCCTCT 
AAATGCGTGG 
AAATTTATCA 
GAGGAGGAGC 
GCTCCCATTG 
TTCGATCTGA 
GCTTTGAAAG 
AGATTTAAGA 
CCTCACATCA 
ATCCAGAACT 
AAGAAAGAGA 
AAAGGCCACC 
CACATCCACC 



31 

I 

GTTGAGCGGC 
CAGTCTGTGA 
AGTATATCAT 
CGCACAGAGA 
CCTTGGAAAC 
TCAGAATCCT 
AGCCCATCCG 
GTATGACTTT 
CAATGGAAGA 
TAGAGAGACT 
GGGTTGTGTG 
CGCAGCTGGC 
AGGCAACAGA 
AAATCGTGCG 
GCTTGCGGGG 
AAGAGAAATC 
AGGCAGCAGC 
TTTATAATGT 
TTTACCCAGC 
CCGCCACGGA 
AAATGTATGC 
GTCGGATATT 
TGGTGGTGAT 
CAGCAGCACA 
TAACACCGTT 
GTTTGTTTCT 
AGATAGAGAT 
CGCCCAA6CT 
AGGTGAGGCA 
TCCTCCTGGA 
TGGGCCACCT 



41 
I 

TGGCGCGGTT 
GCCCTGGAAC 
CCCCAGTCCT 
CCGTGAAGAT 
AGCAGCCCGA 
GGATGAGGAG 
GACTACTTCC 
TTCGTGGCTT 
CGTGTGGTCT 
GTGGCAAGAA 
GATCTTCTGC 
TGGCTTCAGT 
GTCTAACCTG 
GTCTTGGTCG 
GGCCATCCTA 
CCTGGGTGAG 
CGTTGGCAGC 
AATTATTCTG 
AATGATGTTT 
TGAACGTGTC 
CTGGGTCAAA 
GGAAAAAGCC 
TGCCAGCGTG 
GGCTTTCACA 
TTCAGTAAAG 
AATGGAAGAG 
GAAAAATGCC 
GACCCCCAAA 
GCTGCA6CXSC 
CAGTGACGAG 
GCGCTTACAG 



51 
I 

GTCCTGGAGC 
CTCCGCTCAG 
GGGTATAGT^A 
TCCAAGTTCA 
GCCGAGGGCC 
CATCCCAAGG 
AAACACCAGC 
TCTTCTCTGG 
CTGTCCAAGC 
GAGCTGAATG 
CGCACCAGGC 
GGACCAGCCT 
CAGTACAGCT 
CTTGCACTGA 
ACCATGGCAT 
CTCATCAACA 
CTGCTGGCTG 
GGACCAACAG 
GCATCACGGC 
CAGAAGATGA 
GCATTTTCTC 
GGGTACTTCC 
GTGACCTTCT 
GTGGTGACAG 
TCCCTCTCAG 
GTTCACATGA 
ACCTTGGCAT 
ATGAAAAAAG 
ACTGAGCATC 
CGGCCCAGTC 
AGGACACTGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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ACAGCATCGA TCTGGAGATC CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 1920 

GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 

TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 2100 

ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 

GAGAGCGAGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 2220 

TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 2340 

TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

GCTGTATTAC GGT^GAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAA/UVGG 252 0 

AAACCAGTGG TTCACAGAAG AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 25 80 

AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGQGGGC CCCTTGGCAT 2700 

TCCTGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2760 

GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTCGA GGGAACGAGA 2 820 

CCTCGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2 880 

CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2940 

GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3000 

CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 3060 

TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 312 0 

TGGTGTTCTT CTGTGTGGGA ATGATCX3CAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 

GGCCCCTTGT CATCCTGTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 

AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 

AGCTGCTGGA T6ACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGOGATG CGGTGGCTGG 3420 

CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 3480 

TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 

TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 360 0 

CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT QGAAGCACCT GCCAGAATTA 3660 

AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3720 

AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGTATCCTTC ACGATCAAAC 3780 

CTA7\AGAGAA GATTGGCATT GTGGGGCGGA CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 

CCCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 3 900 

GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3960 

TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 

TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 

TTGAATCTGA AGTGATQGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 

GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 4200 

CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 42 60 

GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 432 0 

TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 

GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 

TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 

CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4560 

GTTCCG6ATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 462 0 

ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 

GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTAGCTATA 4740 

TCTATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 

TATTAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 4860 

TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4920 

CTCTAGCTGG TGGTTTCACG GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 4980 

ATAGTGGGCG CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 

GAGACGGGTG GGCGGCTGGA GACCATGCAG AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 

CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 5160 

TTTCACTCCC TCCATCAAGA ATGGGGATCA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 

TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 52 8 0 

TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 

GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG TGGCACTTTT TCATTTGCCT 5400 

ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT TCGTGGGTCT GTTTTCCTTT 54 60 

CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 5520 

CAGCTCTTGC TAATCAGTGT CTCACACTGG CGTAGAAGTT TTTGTACTGT AAAGAGACCT 5580 

ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC GCAAACCCCC TTTGTGCTGT 5640 

GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC TGCTGTCATC AGTTGAATGG TCAGCGTTGC 5700 

ATGTCGTGAC CAACTAGACA TTCTGTCGCC TTAGCATGTT TGCTGAACAC CTTGTGGAAG 5760 

CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 5820 
AAAAAAAAAA AAAAAAAA 

Seg ID NO: 585 Protein sequence 
Protein Accession #: NP_005679,1 

1 11 21 31 41 51 

I I I I I I 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKPRR TRPLBCQDAL BTAARAEGLS 60 

LDASMHSQLR ILDEEHPKGK YHHGLSAIiKP IRTTSKHQHP VDNAGLFSCM TFSWLSSLAR 120 

VAHKKGELSM EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 180 

LSIVCLMITQ LAGFSGPAFM VKHLLEYTQA TESNLQYSLIj LVLGLLLTEI VRSWSLALTW 240 

ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGELINIC SNDGQRMFEA AAVGSLLAGG 300 

PWAILGMIY NVIILGPTGF LGSAVFIIiFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 360 

VLTYIKFIKM YAWVKAFSQS VQKIREEERR ILEKAGYFQG ITVGVAPIW VIASWTFSV 420 

HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMEEVHMIK 480 

NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA 540 

VLAEQKGHLL LDSDERPSPE EEEGKHIHLG HLRLQRTLHS IDLEIQEGKL VGICGSVGSG 600 

KTSLISAILG QMTLLEGSIA ISGTFAYVAQ QAWILNATLR DNILFGKEYD EERYNSVIjNS 660 

CCIiRPDLAIL PSSDLTEIGB RGANLSGGQR QRISLARALY SDRSIYILDD PLSALD/fflVG 720 

NHIFNSAIRK HIiKSKTVLFV THQLQYLVDC DBVIFMKEGC ITERGTHEEL MNLNGDYATI 780 

FNNIiLLGETP PVBIWSKKET SGSQKKSQDK GPKTGSVKKE KAVKPBEGQL VQLEEKGQGS 840 
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VPWSVYGVYI QAAGGPLAFIi VIMALFMLNV GSTAFSTWWIi SYWIKQGSGN TTVTRGNETS 900 
VSDSMKDNPH MQYYASIYAL SMAVMIiILKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 960 

KFFDTTPTGR ILNRFSKDMD EVDVRI»PFQA EMFIQNVILV FFCVGMIAGV FPWFLVAVGP 1020 

LVILFSVLHI VSRVLIRBLK RLDNITQSPF LSHITSSIQG LATIHAYWKG QEFLHRYQEL 1080 

5 LDDNQAPFFL FTCAMRWLAV RLDLISIALI TTTGLMIVLM HGQIPPAYAG LAISYAVQLT 1140 

GLFQFTVRLA SETEARFTSV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 1200 

RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAIj FRIiVELSGGC IKIDGVRISD 1260 

IGL7U3LRSKI1 SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNF SVGERQLLCI ARALIiRHCKI IiIIiDEAT/y^M DTBTDLLIQE TIREAFADCT 1380 
10 MLTIAHRLHT VLGSDRIMVL AQGQWEFDT PSVLLSNDSS RFYAMFAAAE NKVAVKG 



15 



40 



Seq ID NO: 586 DNA sequence 

Nucleic Acid Accession #: NM_001327.1 

Coding sequence : 89 . . 631 



1 11 21 31 41 51 

I I I 1 t I 

AGCAGGGGGC GCTGTGTGTA CCGAGAATAC GAGAATACCT CGTGGGCCCT GACCTTCTCT 60 

CTGAGAGCCG GGCAGAGGCT CCGGAGCCAT GCAGGCCGAA GGCCX3GGGCA CAGGGGGTTC 120 

20 GACGGGCGAT GCTGATGGCC CAGGAGGCCC TGGCATTCCT GATGGCCCAG GGGGCAATGC 180 

TGGCGGCCCA GGAGAGGCGG GTGCCACGGG CGGCAGAGGT CCCCGGGGCG CAGGGGCAGC 240 

AAGGGCCTCG GGGCCGGGAG GAGGCGCCCC GCX3GGGTCCX3 CATGGCGGCG CGGCTTCAGG 3 00 

GCTGAATGGA TGCTGCAGAT GCGGGGCCAG GGGGCCGGAG AGCCGCCTGC TTGAGTTCTA 3 60 

CCTCGCCATG CCTTTCGCGA CACCCATGGA AGCAGAGCTG GCCCGCAGGA GCCTGGCCCA 420 

25 GGATGCCCCA CCGCTTCCCG TGCCAGGGGT GCTTCTGAAG GAGTTCACTG TGTCCGGCAA 480 

CATACTGACT ATCCGACTGA CTGCTGCAGA CCACCX3CCAA CTGCAGCTCT CCATCAGCTC 540 

CTGTCTCCAG CAGCTTTCCC TGTTGATGTG GATCACGCAG TGCTTTCTGC CCGTGTTTTT 600 

GGCTCAGCCT CCCTCAGGGC AGAGGCGCTA AGCCCAGCCT GGCGCCCCTT CCTAGGTCAT 660 

GCCTCCTCCC CTAGGGAATG GTCCCAGCAC GAGTGGCCAG TTCATTGTGG GGGCCTGATT 720 

30 GTTTGTCGCT GGAGGAGGAC GGCTTACATG TTTGTTTCTG TAGAAAATAA AACTGAGCTA 

Seq ID NO: 587 Protein sequence 
Protein Accession #: NP_001318.1 

35 1 11 21 31 41 51 

I I 1 I I I 

MQAEGRGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPGGGA 60 

PRGPHGGAAS GIiNGCCRCGA RGPESRLLBF YIjAMPFATPM EAELARRSLA QDAPPtiPVPG 120 
VLLKEFTVSG NILTIRLTAA DHRQLQLSIS SCLQQLSLLM WITQCFLPVP LAQPPSGQRR 



Seq ID NO: 588 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 52.. 459 



45 1 11 21 31 41 51 

I I I I I I 

CCTCGTGGGC CCTGACCTTC TCTCTGAGAG CCGGGCAGAG GCTCCGGAGC CATGCAGGCC 60 

GAAGGCCAGG GCACAGGGGG TTCGACGGGC GATGCTGATG GCCCAGGAGG CCCTGGCATT 12 0 

GCTGATGGCC CAGGGGGCAA TGCTGGCGGC CCAGGAGAGG CGGGTGCCAC GGGCGGCAGA 180 

50 GGTCCCCGGG GCGCAGGGGC AGCAAGGGCC TCGGGGCCGA GAGGAGGCGC CCCGCGGGGT 240 

CCGCATGGCG GTGCCGCTTC TGCGCAGGAT GGAAGGTGCC CCTGCGGGGC CAGGAGGCCC 300 

GACAGCCGCC TGCTTCAGTT CCGACTGACT GCTGCAGACC ACCGCCAACT GCAGCTCTCC 360 

ATCAGCTCCT GTCTCCAGCA GCTTTCCCTG TTGATGTGGA TCACGCAGTG CTTTCTGCCC 420 

GTGTTTTTGG CTCAGGCTCC CTCAGGGCAG AGGCGCTAAG CCCAGCCTGG CGCCCCTTCC 480 

55 TAGGTCATGC CTCCTCCCCT AGGGAATGGT CCCAGCACGA GTGGCCAGTT CATTGTGGGG 540 

GCCTGATTGT TTGTCGCTGG AGGAGGACGG CTTACATGTT TGTTTCTGTA GAAAATAAAG 600 
CTGAGCTA 

Seq ID NO: 589 Protein sec[uence 
60 Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I 1 I I I I 

MQAEGQGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 60 

65 PRGPHGGAAS AQDGRCPCGA RRPDSRLLQF RLTAADHRQL QLSISSCLQQ LSLLMWITQC 120 
FLPVFLAQAP SGQRR 

Seq ID NO: 590 DNA sequence 
Nucleic Acid Accession NM_005562.1 
70 Coding sequence: 90.. 3671 

1 11 21 31 41 51 

I I I I I I 

ACAGCQGAGC GCAGAGTGAG AACCACCAAC CGAGGCGCCG 6GCAGCGACC CCTGCAGCGG 60 

75 AGACAGAGAC TGAGCGGCCC GGCACCGCCA TGCCTGCGCT CTGGCTGGGC TGCTGCCTCT 120 

GCTTCTCGCT CCTCCTGCCC GCAGCCCGGG CCACCTCCAG GAGGGAAGTC TGTGATTGCA 180 

ATGGGAAGTC CAGGCAGTGT ATCTTTGATC GGGAACTTCA CAGACAAACT GGTAATGGAT 240 

TCCGCTGCCT CAACTGCAAT GACAACACTG ATGGCATTCA CTGCGAGAAG TGCAAGAATG 300 

GCTTTTACCG GCACAGAGAA AGGGACCGCT GTTTGCCCTG CAATTGTAAC TCCT^GGTT 360 

80 CTCTTAGTGC TCGATGTGAC AACTCTGGAC GGTGCAGCTG TAAACCAGGT GTGACAGGAG 420 

CCAGATGCGA CCGATGTCTG CCAGGCTTCC ACATGCTCAC GGATGCGGGG TGCACCCAAG 480 

ACCAGAGACT GCTAGACTCC AAGTGTGACT GTGACCCAGC TGGCATCGCA GGGCCCTGTG 540 

ACGCGGGCCG CTGTGTCTGC AAGCCAGCTG TTACTGGAGA ACGCTGTGAT AGGTGTCGAT 600 

CAGGTTACTA TAATCTGGAT GGGGGGAACC CTGAGGGCTG TACCCAGTGT TTCTGCTATG 660 

85 GGCATTCAGC CAGCTGCCGC AGCTCTGCAG AATACAGTGT CCATAAGATC ACCTCTACCT 720 

TTCATCAAGA TGTTGATGGC TGGAAGGCTG TCCAACGAAA TGGGTCTCCT GCAAAGCTCC 780 

AATGGTCACA GCGCCATCAA GATGTGTTTA GCTCAGCCCA ACGACTAGAC CCTGTCTATT 840 
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TTGTGGCTCC TGCCAAATTT CTTGGGAATC AACAGGTGAG CTATGGGCAA AGCCTGTCCT 90 0 

TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCT6C CCATGATGTG ATTCTGGAAG 960 

GTGCTGGTCT ACGGATCACA GCTCCCTTGA TGCCACTTGG CAAGACACTG CCTTGTGGGC 1020 

TCACCAAGAC TTACACATTC AGGTTAAATG AGCATCC/^AG CAATAATTGG AGCCCCCAGC 1080 

TGAGTTACTT TGAGTATCGA AGGTTACTGC GGAATCTCAC AGCCCTCCGC ATCCGAGCTA 1140 

CATATGGAGA ATACAGTACT GGGTACATTG ACAATGTGAC CCTGATTTCA GCCCGCCCTG 1200 

TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG TCCTGTTGGG TACAAGGGGC 1260 

'AATTCTGCCA GGATTGTGCT TCTGGCTACA AGAGAGATTC AGCGAGACTG GGGCCTTTTG 1320 

GCACCTGTAT TCCTTGTAAC TGTCAAGGGG GAGGGGCCTG TGATCCAGAC ACAGGAGATT 13 80 

GTTATTCAGG GGATGAGAAT CCTGACATTG AGTGTGCTGA CTGCCCAATT GGTTTCTACA 1440 

ACGATCCGCA CGACCCCCGC AGCTGCAAGC CATGTCCCTG TCATAACGGG TTCAGCTGCT 1500 

CAGTGATGCC GGAGACGGAG GAGGTGGTGT GCAATAACTG CCCTCCCGGG GTCACCGGTG 1560 

CCCGCTGTGA GCTCTGTGCT GATGGCTACT TTGGGGACCC CTTTGGTGAA CATGGCCCAG 1620 

TGAGGCCTTG TCAGCCCTGT CAATGCAACA ACAATGTGGA CCCCAGTGCC TCTGGQAATT 1680 

GT6ACCGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA CACAGCCGGC ATCTACTGCG 1740 

ACCAGTGCAA AGCAGGCTAC TTCGGGGACC CATTGGCTCC CAACCCAGCA GACAAGTGTC 1800 

GAGCTTGCAA CTGTAACCCC ATGGGCTCAG AGCCTGTAGG ATGTCG/UVGT GATGGCACCT 1860 

GTGTTTGCAA GCCAGGATTT GGTGGCCCCA ACTGTGAGCA TGGAGCATTC AGCTGTCCAG 1920 

CTTGCTATAA TCAAGTGAAG ATTCAGATGG ATCAGTTTAT GCAGCAGCTT CAGAGAATGG 1980 

AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG ATGGAGTAGT ACCTGATACA GAGCTGGAAG 2040 

GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTTCAG 2100 

AAGGTGCTAG CAGATCCCTT GGTCTCCAGT TGGCCAAGGT GAGGAGCCAA GAGAACAGCT 2160 

ACCAGAGCCG CCTGGATGAC CTCAAGATGA CTGTGGAAAG AGTTCGGGCT CTGGGAAGTC 2220 

AGTACCAGAA CCGAGTTCGG GATACTCACA GGCTCATCAC TCAGATGCAG CTGAGCCTGG 22 80 

CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACCAC TACGTGGGGC 2340 

CAAATGGCTT TAAAAGTCTG GCTCAGGAGG CCACAAGATT AGCAGAAAGC CACGTTGAGT 2400 

CAGCCAGTAA CATGGAGCAA CTGACAAGGG AAACTGAGGA CTATTCCAAA CAAGCCCTCT 2460 

CACTGGTGCG CAAGGCCCTG CATGAAGGAG TCGGAAGCGG AAGCGGTAGC CCGGACX2GTG 2520 

CTGTGGTGCA AGGGCTTGTG GAAAAATTGG AGAAAACCAA GTCCCTGGCC CAGCAGTTGA 2580 

CAAGGGAGGC CACTCAAGCG GAAATTGAAG CAGATAGGTC TTATCAGCAC AGTCTCCGCC 2640 

TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTGATCA GTCCTTTCAG GTGGAAGAAG 2700 

CAAAGAGGAT CAAACAAAAA GCGGATTCAC TCTCAACGCT GGTAACCAGG CATATGGATG 2760 

AGTTCAAGCG TACACAAAAG AATCTGGGAA ACTGGAAAGA AGAAGCACAG CAGCTCTTAC 282 0 

AGAATGGAAA AAGTGGGAGA GAGAAATCAG ATCAGCTGCT TTCCCGTGCC AATCTTGCTA 2 8 80 

AAAGCAGAGC ACAAGi^AGCA CTGAGTATGG GCAATGCCAC TTTTTATGAA GTTGAGAGCA 2 940 

TCCTTAAAAA CCTCAGAGAG TTTGACCTGC AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 3000 

AAGCCATGAA GAGACTCTCC TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 3060 

AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG CTGCTGATGC ACAGAGGGCA AAGAATGGGG 3120 

CCGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACAGGA GATTGGGAGT CTGAACTTGG 3180 

AAGCCAATGT GACAGCAGAT GGAGCCTTGG CCATGGAAAA GGGACTGGCC TCTCTGAAGA 3240 

GTGAGATGAG GGAAGTGGAA GGAGAGCTGG AAAGGAAGGA GCTGGAGTTT GACACGAATA 3300 

TGGATGCAGT ACAGATGGTG ATTACAGAAG CCCAGAAGGT TGATACCAGA GCCAAGAACG 3360 

CTGGGGTTAC AATCCAAGAC ACACTCAACA CATTAGACGG CCTCCTGCAT CTGATGGACC 3420 

AGCCTCTCAG TGTAGATGAA GAGGGGCTGG TCTTACTGGA GCAGAAGCTT TCCCGAGCCA 3480 

AGACCCAGAT CAACAGCCAA CTGCGGCCCA TGATGTCAGA GCTGGAAGAG AGGGCACGTC 3 540 

AGCAGAGGGG CCACCTCCAT TTGCTGGAGA CAAGCATAGA TGGGATTCTG GCTGATGTGA 3 600 

AGAACTTGGA GAACATTAGG GACAACCTGC CCCCAGGCTG CTACAATACC CAGGCTCTTG 3 660 

AGCAACAGTG AAGCTGCCAT AAATATTTCT CAACTGAGGT TCTTGGGATA CAGATCTCAG 3 720 

GGCTCGGGAG CCATGTCATG TGAGTGGGTG GGATGGGGAC ATTTGAACAT GTTTAATGGG 3780 

TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3840 

TGCACCATAC TCCTTGCTTC CTGATGCTGG GCAATGAGGC AGATAGCACT GGGTGTGAGA 3 900 

ATGATCAAGG ATCTGGACCC CAAAGAATAG ACTGGATGGA AAGACAAACT GCACAGGCAG 3 960 

ATGTTTGCCT CATAATAGTC GTAAGTGGAG TCCTGGAATT TGGACAAGTG CTGTTGGGAT 402 0 

ATAGTCAACT TATTCTTTGA GTAATGTGAC TAAAGGA7VAA AACTTTGACT TTGCCCAGGC 4080 

ATGAAATTCT TCCTAATGTC AGAACAGAGT GCAACCCAGT CACACTGTGG CCAGTA7VAAT 4140 

ACTATTGCCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA 4200 

ACCCAGGGTG TGAACATGTT CTCCATTTTC AAGCTGGAAG AAGTGAGCAG TGTTGGAGTG 42 60 

AGGACCTGTA AGGCAGGCCC ATTCAGAGCT ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 4320 

GTTCTGGACC TGGGCATGAC ATCCTTTCTT TTAATGATGC CATGGCAACT TAGAGATTGC 4380 

ATTTTTATTA AAGCATTTCC TACCAGCAAA GCAAATGTTG GGAAAGTATT TACTTTTTCG 4440 

GTTTCAAAGT GATAGAAAAG TGTGGCTTGG GCATTGAAAG AGGTAAAATT CTCTAGATTT 4500 

ATTAGTCCTA ATTCAATCCT ACTTTTCX3AA CACCAAAAAT GATGCGCATC AATGTATTTT 4560 

ATCTTATTTT CTCAATCTCC TCTCTCTTTC CTCCACCCAT AATAAGAGAA TGTTCCTACT 4620 

CACACTTCAG CTGGGTCACA TCCATCCCTC CATTCATCCT TCCATCCATC TTTCCATCCA 4680 

TTACCTCCAT CCATCCTTCC AACATATATT TATTGAGTAC CTACTGTGTG CCAGGGGCTG 4740 

GTGGGACAGT GGTGACATAG TCTCTGCCCT CATAGAGTTG ATTGTCTAGT GAGGAAGACA 4800 

AGCATTTTTA AAAAATAAAT TTAAACTTAC AAACTTTGTT TGTCACAAGT GGTGTTTATT 4860 

GCAATAACCG CTTGGTTTGC AACCTCTTTG CTCAACAGAA CATATGTTGC AAGACCCTCC 4920 

CATGGGGGCA CTTGAGTTTT GGCAAGGCTG ACAGAGCTCT GGGTTGTGCA CATTTCTTTG 4980 

CATTCCAGCT GTCACTCTGT GCCTTTCTAC AACTGATTGC AACAGACTGT TGAGTTATGA 5040 

TAACACCAGT GGGAATTGCT GGAGGAACCA GAGGCACTTC CACCTTGGCT GGGAAGACTA 5100 

TGGTGCTGCC TTGCTTCTGT ATTTCCTTGG ATTTTCCTGA AAGTGTTTTT AAATAAAGAA 5160 
CAATTGTTAG ATGCC 

Seq ID NO: 591 Protein sequence 
Protein Accession #j NP_005553.1 



1 
I 

MPALWLGCCL 
DGIHCEKCKN 
HMLTDAGCTQ 
PEGCTQCFCY 
SSAQRLDPVY 
MPLGKTIiPCG 
DNVTLISARP 
GGACDPDTGD 



11 
I 

CFSLIiliPAAR 
GFYRHRERDR 
DQRLIjDSKCD 
GHSASCRSSA 
FVAPAKFLGN 
LTKTYTPRIjN 
VSGAPAPWVE 
CYSGDEMPDI 



21 
I 

ATSRREVCDC 
CLPCNCNSKG 
CDPAGIAGPC 
EYSVHKITST 
QQVSYGQSLS 
EHPSNNWSPQ 
QCICPVGYKG 
ECADCPIGFY 



31 
1 

NGKSRQCIFD 
SLSARCDNSG 
DAGRCVCKPA 
FHQDVDGWKA 
FDYRVDRGGR 
LSYFEYRRLL 
QFCQDCASGY 
NDPHDPRSCK 



41 
I 

RELHRQTGNG 
RCSCKPGVTG 
VTGERCDRCR 
VQRNGSPAKIj 
KPSAHDVILE 
RNLTALRIRA 
KRDSARL6PF 
PCPCHNGFSC 



51 
1 

FRCLNCNDNT 
ARCDRCLPGF 
SGYYNLDGGN 
QWSQRHQDVF 
GAGLRITT^L 
TYGEYSTGYI 
GTCIPCaJCQG 
SVMPETEEW 



60 
120 
180 
240 
300 
360 
420 
480 
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15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 
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CNNCPPGVTG ARCELCADGY FGDPFGEHGP VRPCQPCQQI NNVDPSASQN CDRLTGRCIiK 540 

CIHNTAOIYC DQCKAGYFQD PIiAFNPADKC RACaiCaiCMGS EPVOCRSDQT CVCKPGFGGP 600 

NCEHGAFSCP ACYNQVKIQM DQFMQQLQRM EAIiISKAQGG DGWPDTBLB GRMQQAEQAL 660 

QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDIiKM TVERVRALGS QVQNRVRDTH 720 

RIiITQMQLSL AESEASLGNT NIPASDHYVG PNGFKSljAQE ATRLAESHVE SASNMEQLTR 780 

ETEDVSKQAL SLVRKALHEG VGSGSGSPDG AWQGLVEKL EKTKSLAQQI. TREATQAEIE 840 

ADRSYQHSLR LLDSVSRLQG VSDQSFQVEE AKRIKQKADS IiSTIiVTRHMD EFKRTQKNL6 900 

NWKEEAQQLL QNGKSGREKS DQLLSRANLA KSRAQEAIiSM GNATPYEVES ILKNLREFDL 960 

QVDNRKAEAE EAMKRIiSYIS QKVSDASDKT QQAERALGSA AADAQRAKNG AGEALEISSE 1020 

IBQEiaSLML EANVTADGAIi AMBKOIASLK SEMREVEGEL BRKELEFDTN ^^DAVQMVITE 1080 

AQKVDTRAKH AGVTIQDTLN TLDGLlMIIiMD QPIiSVDBEGL VIjIiEQKIiSRA KTQINSQLRP 1140 
MMSELEERAR QQRGHLHLI.E TSIDGIIiADV KHLEHIRDHL PPGCYNTQAIi EQQ 

Seq ID NO: 592 DNA sec[uence 

Nucleic Acid Accession #: AF101051.1 

Coding sequence: 221.856 
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GAGCAACCTC 
CGACCCAGAG 
GCGGGGCCCA 
ACCTGCCACC 
GCTGTTGGGC 
GCCCCAGTGG 
CGAGGGGCTG 
TGACTCCTTG 
CATCCTCCTG 
CTTGGAAGAC 
TCTTGCAGGT 
ATTCTATGAC 
TGGCTGGGCT 
CCGAAAAACA 
GAAAGACTAG 
GGACATTGAG 
GTATGGTATT 
AAACATGGCT 
TTGTATTACT 
TATATATAGA 
CTCATTATGT 
CCATATTGAT 
CAGTCAAATA 
CTAATTTACC 
TTATTTTTTA 
TTTCATTGGT 
AGCCAAGAAG 
GTGATAAATT 
TTTGCTTTGA 
CACAACTTTA 
ACCTTTTTGT 
TATATCTTCC 
GATAATCTGG 
TCTTTTTTCT 
AATATTAATT 
TTTATTTGCT 
CTTCATGTGA 
ACACATACCT 
AAACCTACGC 
ATTCTTTCAG 
TTTCCAGTCT 
GCACTGGTGT 
AGCAAGGCAT 
CTGATCTTCC 
GTGGTTTTGT 
CGTTTTGGTG 
GCCTTAACCA 
AAGATTCTGA 
ACAGATGTAA 
TTTTGAATCA 
TGTTAGCTGG 
CTACACAAGG 
TGCCTTCCAA 
ATACATAGAT 
ACAAAAAAAT 
TTTGATCTTT 
TTATAATGGG 
AAAAGGAAAA 



11 
I 

AGCTTCTAGT 
CTTCTCCAGC 
GCCACCTTCG 
CCTGAGCCAG 
TTCATTCTCG 
AGGATTTACT 
TGGATGTCCT 
CTGAATCTGA 
GGAGTGATAG 
GATGAGGTGC 
CTGGCTATTT 
CCTATGACCC 
GCTGCTTCTC 
ACCTCTTACC 
GTGTGACACA 
ATACTATCAT 
ACAAAACAAA 
TAATCTTATT 
GCTTCCCATT 
TATGTATATA 
TGATACTAGC 
GAAGATGTTT 
TCATTTACTC 
AAGGATGAAT 
CCATAATCTT 
CTCTATCTCC 
AATTTATTAC 
CCTGTTGACC 
AAATATTTGT 
TTGATTGAAT 
TCCCCATTCC 
TAATAAGGTG 
TGACAAATAT 
ATCTGCCAAA 
AGTTTATATT 
CAGCTGGCTG 
TTCACTGCCT 
TCATGTGGTT 
ACATACCTTC 
CTGTGTCTGA 
GTACAGAATG 
CTGGAGACCT 
TTGGCTGCTG 
CACCTCACAG 
AATTTGAAAA 
TTGCTTTTCA 
GTCTCTCAAG 
GGAAGTCTTA 
TGGGAAGAAA 
T/y^TAACTCA 
CAGCTGACGC 
AAAGTCAGCC 
ACCTGAGAAT 
CTTCATGATG 
TTTATGGCCC 
TTATATTCTT 
AATTTGTATA 
AAAAAAAAAA 



21 
1 

ATCCAGACTC 
GGCGGCGCAG 
GGAGTCCGGG 
CGCGGGCGCC 
CCTTCCTGGG 
CCTATGCCGG 
GCGTGTCGCA 
GCAGCACATT 
CAATCTTTGT 
AGAAGATGAG 
TAGTTGCCAC 
CAGTCAATGC 
TCTGCCTTCT 
CAACACCAAG 
GAGGCAAT^G 
TAACATTAGG 
CAAACAAACA 
TTATCTTCTT 
GAGTAATCAT 
TACATGTTTT 
ATACTTAAAA 
ATTGGTATAT 
TTCTTCATTA 
TCTTTCAATT 
ATAGCACTTG 
TGAATCTAAC 
AAATCAGAAC 
TTCCCACACA 
CCi\ATTGAGT 
TTTTAAGCTA 
TTAATTGTAT 
TGGTCTGTTT 
TCTCTCTGTA 
TTGAGATAAT 
ACTCTCATTC 
AGACACTGAA 
TCCTCTCTCT 
CAGTGCCTTC 
ATGTGGCTCA 
CATGTTTGTG 
CTATTTCACT 
GGATTTGAGT 
TAAGCTTATT 
TGATGTTGTG 
GTGCTATACT 
AATGTTTGAA 
TGATGAGACA 
TCTTCTGCAG 
TAAAAGCCTA 
TAAGGTGCTA 
TGCTAGGATA 
ACCGTGTCTT 
ATATGCTTTT 
TGTGAGTGTA 
AAAATGACCA 
CTACCACACC 
AAGCATTACT 
AAA 



31 

I 

CAGCGCCGCC 
CGAGCAGGGC 
TTGCCCACCT 
CGAGCGAGTC 
ATGGATCGGC 
CGACAACATC 
GAGCACCGGG 
GCAAGCAACC 
GGCCACCGTT 
GATGGCTGTC 
AGCATGGTAT 
CAGGTACGAA 
GGGAGGTGCC 
GCCCTATCCA 
GAGAAAATCA 
ACCTTAGAAT 
AAAAACCCAT 
TCCTCAATAT 
ACTCAAATGG 
TCTATTAAAA 
TATCTCTAAA 
TTTCTTTTTC 
GCTTTGGGTG 
CTTCATGCGT 
CATCGTTATT 
ACATTTCATA 
TTTGGAGGCA 
ATCCCTGTAC 
AGCTGCATGC 
CTTATTCATA 
TGTTTTCCCA 
GTCTGAACAA 
GCTGTAAGCA 
GATACTTAAC 
TTTGAACATG 
GTiAGTCACTG 
ACCAGTCTAT 
CTCTCTCTAC 
GTGCCTTCCT 
CTCTGTTCCA 
TGAGCAAGAT 
CTTGGTGCTA 
GCTTCATCTG 
GGGATCCA6T 
AAGGGAAAGA 
AATAAAAAAA 
GTGAAGTAAA 
TGAGTATGGC 
CQTGTTGGTA 
TCTGTTCAGT 
GTTAGTTTGG 
ATGAGGAATT- 
GGAAGTTAAA 
ATTCCATGTG 
ACGAAATTGT 
TGGAAACAGA 
CTTTTTCAAT 



41 
1 

CCGGGCGCGG 
TCCCCGCCTT 
GCAAACTCTC 
ATGGCCAACG 
GCCATCGTCA 
GTGACCGCCC 
CAGATCCAGT 
CGTGCCTTGA 
GGCATGAAGT 
ATTGGGGGTG 
GGCAATAGAA 
TTTGGTCAGG 
CTACTTTGCT 
AAACCTGCAC 
TGTTGAAACA 
TTTGGGTATT 
GTGTTAAAAT 
AGGAGGG7\AG 
GG6AA66GGT 
ATAGACAGTA 
ATAGGTAAAT 
GTCCTTATAT 
CCTTTGCCAC 
GCCCTTTTCA 
AAGCCCTTAT 
GCCTACATTT 
AATCTTTCTG 
TCTGACCCAT 
TGTTCCCCCA 
GTTTTATATC 
AGTGTAATTA 
AGTGCTAGAC 
AGTCACTTAA 
CAGTTAGAAG 
AACTATGCCT 
AACAAAACCT 
TTCCACTGAA 
CAGTCTATTT 
CTCTCTACCA 
TTTTAACAAC 
GATGTATGGA 
TCAATCACCG 
TAAGCGGTGG 
GAGATAGAAT 
ATTGAGGAAT 
TGTTAAGAAA 
ATTGAGTGCA 
CCAATGCTTT 
AATCC7VACAG 
GATGCCCTCA 
AAATGGTACT 
GGACCTAATA 
ATTTAAATGG 
GATATCAGTT 
TACAATAGAA 
CCAATAGACA 
AAATTGTTTT 



51 
I 

ACCCCAACCC 
AACTTCCTCC 
CGCCTTCTGC 
CGGGGCTGCA 
GCACTGCCCT 
AGGCCATGTA 
GCAAAGTCTT 
TGGTGGTTGG 
GTATGAAGTG 
CGATATTTCT 
TCGTTCAAGA 
CTCTCTTCAC 
GTTCCTGTCC 
CTTCCAGCGG 
AACCGAAAAT 
GTAATCTGAA 
ACTCAGTGCT 
ATTTTACCAT 
GCTCCTTAAA 
AAATACTATT 
GTATTTAATT 
ACATATGTAA 
AAGACCTAGC 
TATACTTATT 
TTGTTTTGTG 
TAGTTTCTAA 
CATGACCAAA 
AGCACTCTTG 
GGTGTTGTAA 
CCCCTAAACT 
TCATGCGTTT 
TTTCTGGAGT 
TCTTTCTACC 
AGGTAGTGTG 
ATGTAGTGTC 
ACACACGTAC 
CAAAACCTAC 
CCACTGAACA 
GTCTATTTCC 
TGCTCTTACT 
AAGGGTGTTG 
TCTGTGTTTG 
TTTGTAATTC 
ACATGTAAGT 
TAACTGCATA 
TGGGTTTCTT 
CTAAACGAAT 
CTGTGGCTAA 
CAAGGGAGAT 
GAGCTCTTGC 
TCATAATAAA 
AATTTTAGTG 
CTTTTGCCAC 
ACCAAACATT 
TTTATCCAAT 
TTTTGGGGTT 
TTAATTTAAA 



60 

120 
180 
240 
300 
360 
420 
480 
540 
500 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
28B0 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 



Seq ID KOt 593 Protein sequence 
Protein Accession #: AAD16433.1 

1 11 21 31 41 51 

I I I I I I 

MANAGLQLLG FILAFLGWIG AIVSTAIiPQW RIYSYAGDNI VTAQAMYEGL WMSCVSQSTG 
QIQCKVFDSL LNLSSTLQAT RALMWGILL GVIAIFVATV GMKCMKCIiED DEVQKMRMAV 
IGGAIFLLAG LAILVATAWY GNRIVQEFYD PMTPVNARYE FGQALFTGWA AASLCLLGGA 
LLCCSCPRKT TSYPTPRPYP KPAESSGKDY V 
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Seq ID NO: 594 DNA sequence 

Nucleic Acid Accession #; NM_00 6180.1 

Coding sequence: 352.. 2820 

1 11 21 31 41 51 

I 1 I I I I 

CCCCCATTCG CATCTAACAA GGAATCTGCG CCCCAGAGAG TCCCGGACGC CGCCGGTCGG 60 

TGCCCGGCGC GCCGGGCCAT GCAGCGACGG CCGCCGCGGA GCTCCGAGCA GCGGTAGCGC 120 

CCCCCTGTAA AGCGGTTCGC TATGCCGGGA CCACTGTGAA CCCTGCCGCC TGCCGGAACA 180 

CTCTTCGCTC CGGACCAGCT CAGCCTCTGA TAAGCTGGAC TCGGCACGCC CGCAACAAGC 240 

ACCGAGGAGT TAAGAGAGCC GCAAGCGCAG GGAAGGCCTC CCCGCACGGG TGGGGGAAAG 300 

CGGCCGGTGC AGCGCGGGGA CAGGCACTCG GGCTGGCACT GGCTGCTAGG GATGTCGTCC 360 

TGGATAAGGT GGCATGGACC CGCCATGGCG CGGCTCTGGG GCTTCTGCTG GCTGGTTGT6 420 

GGCTTCTGGA GGGCCX3CTTT CGCCTGTCCC ACGTCCTGCA AATGCAGTGC CTCTCGGATC 480 

TGGTGCAGCG ACCCTTCTCC TGGCATCGTG GCATTTCCGA GATTGGAGCC TAACAGTGTA 540 

GATCCTGAGA ACATCACCGA AATTTTCATC GCAAACCAGA AAAGGTTAGA AATCATCAAC 600 

GAAGATGATG TTGAAGCTTA TGTGGGACTG AGAAATCTGA CAATTGTGGA TTCTGGATTA 660 

AAATTTGTGG CTCATAAAGC ATTTCTGAAA AACAGCAACC TGCAGCACAT CAATTTTACC 720 

CGAAACAAAC TGACGAGTTT GTCTAGGAAA CATTTCCGTC ACCTTGACTT GTCTGAACTG 780 

ATCCTGGTGG GCAATCCATT TACATGCTCC TGTGACATTA TGTGGATCAA GACTCTCCAA 840 

GAGGCTAAAT CCAGTCCAGA CACTCAGGAT TTGTACTGCC TGAATGAAAG CAGCAAGAAT 90 0 

ATTCCCCTGG CAAACCTGCA GATACCCAAT TGTGGTTTGC CATCTGCAAA TCTGGCCGCA 960 

CCTAACCTCA CTGTGGAGGA AGGAAAGTCT ATCACATTAT CCTGTAGTGT GGCAGGTGAT 102 0 

CCGGTTCCTA ATATGTATTG GGATGTTGGT AACCTGGTTT CCAAACATAT GAATGAAACA 1080 

AGCCACACAC AGGGCTCCTT AAGGATAACT AACATTTCAT CCGATGACAG TGGGAAGCAG 1140 

ATCTGTTGTG TGGCGGAAAA TCTTGTAGGA GAAGATCAAG ATTCTGTCAA CCTCACTGTG 1200 

CATTTTGCAC CAACTATCAC ATTTCTCGAA TCTCCAACCT CAGACCACCA CTGGTGCATT 12 60 

CCATTCACTG TGAAAGGCAA CCCCAAACCA GCGCTTCAGT GGTTCTATAA CGGGGCAATA 1320 

TTGAATGAGT CCAAATACAT CTGTACTAAA ATACATGTTA CCAATCACAC GGAGTACCAC 1380 

GGCTGCCTCC AGCTGGATAA TCCCACTCAC ATGAACAATG GGGACTACAC TCTAATAGCC 1440 

AAGAATGAGT ATGGGAAGGA TGAGAAACAG ATTTCTGCTC ACTTCATGGG CTGGCCTGGA 1500 

ATTGACGATG GTGCAAACCC AAATTATCCT GATGTAATTT ATGAAGATTA TGGAACTGCA 1560 

GCGAATGACA TCGGGGACAC CACGT^ACAGA AGTAATGAAA TCCCTTCCAC AGACGTCACT 1620 

GATAAAACCG GTCGGGAACA TCTCTCGGTC TATGCTGTGG TGGTGATTGC GTCTGTGGTG 1680 

GGATTTTGCC TTTTGGTAAT GCTGTTTCTG CTTAAGTTGG CAAGACACTC CAAGTTTGGC 1740 

ATGAAAGGCC CAGCCTCCGT TATCAGCAAT GATGATGACT CTGCCAGCCC ACTCCATCAC 1800 

ATCTCX:7y\TG GQAGTAACAC TCCATCTTCT TCGGAAGGTG GCCCAGATGC TGTCATTATT 1860 

GGAATGACCA AGATCCCTGT CATTGAAAAT CCCCAGTACT TTGGCATCAC CAACAGTCAG 1920 

CTCAAGCCAG ACACATTTGT TCAGCACATC AAGCGACATA ACATTGTTCT GAAAAGGGAG 1980 

CTAGGCGAAG GAGCCTTTGG AAAAGTGTTC CTAGCTGAAT GCTATAACCT CTGTCCTGAG 2040 

CAGGACAAGA TCTTGGTGGC AGTGAAGACC CTGAAGGATG CCAGTGACAA TGCACGCAAG 2100 

GACTTCCACC GTGAGGCCGA GCTCCTGACC AACCTCCAGC ATGAGCACAT CGTCAAGTTC 2160 

TATGGCGTCT GCGTGGAGGG CGACCCCCTC ATCATGGTCT TTGAGTACAT GAAGCATGGG 2220 

GACCTCAACA AGTTCCTCAG GGCACACGGC CCT6ATGCCG TGCTGATGGC TGAGGGCAAC 22 80 

CCX3CCCACGG AACTGACGCA GTOGCAGATG CTGCATATAG CCCAGCAGAT CGCCGCGGGC 2340 

ATGGTCTACC TGGCGTCCCA GCACTTCGTG CACCGC6ATT TGGCCACCAG GAACTGCCTG 2400 

GTCGGGGAGA ACTTGCTGGT GAAAATCGGG GACTTTGGGA TGTCCCGGGA CGTGTACAGC 2460 

ACTGACTACT ACAGGGTCGG TGGCCACACA ATGCTGCCCA TTCGCTGGAT GCCTCCAGAG 2520 

AGCATCATGT ACAGGA7VATT CACGACGGAA AGCGACGTCT GGAGCCTGGG GGTCGTGTTG 2580 

TGGGAGATTT TCACCTATGG CAAACAGCCC TGGTACCAGC TGTCAAACAA TGAGGTGATA 2640 

GAGTGTATCA CTCAGGGCCG AGTCCTGCAG CGACCCCGCA CGTGCCCCCA GGAGGTGTAT 2700 

GAGCTGATGC TGGGGTGCTG GCAGCGAGAG CCCCACATGA GGAAGAACAT C7VAGGGCATC 2760 

CATACCCTCC TTCAGAACTT GGCCAAGGCA TCTCCGGTCT ACCTGGACAT TCTAGGCTAG 2820 

GGCCCTTTTC CCCAGACCGA TCCTTCCCAA CGTACTCCTC AGACGGGCTG AGAGGATGAA 2880 

CATCTTTTAA CTGCCGCTGG AGGCCACCT^ GCTGCTCTCC TTCACTCTGA CAGTATTAAC 2940 

ATCAAAGACT CCGA6AAGCT CTCGAGGGAA GCAGTGTGTA CTTCTTCATC CATAGACACA 3000 

GTATTGACTT CTTTTTGGCA TTATCTCTTT CTCTCTTTCC ATCTCCCTTG GTTGTTCCTT 3060 

TTTCTTTTTT TAAATTTTCT TTTTCTTCTT TTTTTTCGTC TTCCCTGCTT CACGATTCTT 3120 

ACCCTTTCTT TTGAATCAAT CTGGCTTCTG CATTACTATT AACTCTGCAT AGACAAAGGC 3180 

CTTAACAAAC GTAATTTGTT ATATCAGCAG ACACTCCAGT TTGCCCACCA CAACTAACAA 3240 

TGCCTTGTTG TATTCCTGCC TTTGATGTGG ATGAAAAAAA GGGAAAACAA ATATTTCACT 3300 

TAAACTTTGT CACTTCTGCT GTACAGATAT CGAGAGTTTC TATGGATTCA CTTCTATTTA 33 60 

TTTATTATTA TTACTGTTCT TATTGTTTTT GGATGGCTTA AGCCTGTGTA TAAAAAAGAA 3420 

AACTTGTGTT CAATCTGTGA AGCCTTTATC TATGGGAGAT TAAAACCAGA GAGAAAGAAG 3480 

ATTTATTATG AACCGCAATA TGGGAGGAAC AAAGACAACC ACTGGGATCA GCTGGTGTCA 3540 

GTCCCTACTT AGGAAATACT CAGCAACTGT TAGCTGGGAA GAATGTATTC GGCACCTTCC 3600 

CCTGAGGACC TTTCTGAGGA GTAAA7UVGAC TACTGGCCTC TGTGCCATGG ATGATTCTTT 3660 
TCCCATCACC AGAAATGATA GCGTGCAGTA GAGAGCAAAG ATGGCTT 

Seq ID NO: 595 Protein sequence 
Protein Accession #; NP_006171.1 

1 11 21 31 41 51 

I I I 1 I I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAPPRLEP 60 

NSVDPENITE IFIANQKRIjE IINEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

NFTRNKLTSIj SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FliESPTSDHH 300 

WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

IjIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 420 

DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKGPASV ISNDDDSASP 480 

LHHISNGSNT PSSSEGGPDA VIIGMTKIPV lENPQYFGIT NSQLiKPDTFV QHIKRHNIVL 540 

KRELGEGAFG KVFLAECYNL CPEQDKILVA VKTLKDASDN ARKDFHREAE LLTNLQHEHI 600 

VKFYGVCVEG DPLIMVFEYM KHGDLNKFLR AHGPDAVLMA EGNPPTELTQ SQMLHIAQQI 660 

AAGMVYLASQ HPVHRDIiATR NCLVGENLLV KIGDFGMSRD VYSTDYYRVG GHTMLPIRWM 720 
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PPESIMYRKF TTESDVWSLG WLWEIFTYG KQPWYQLSNN EVIECITQGR VLQRPRTCPQ 780 
EVYELMLGCW QREPHMRKNI KGIHTLLQNL AKASPVYLDI LG 

Seq ID NO: 596 DNA sequence 
Nucleic Acid Accession #: AF410899 
Coding sequence: 4 8 3.. 2 9 99 

1 11 21 31 41 51 

1 I I I I i 

GGGAGCAGGA GCCTCGCTGG CTGCTTCGCT CGCGCTCTAC GCGCTCAGTC CCCGGCGGTA 60 

GCAGGAGCCT GGACCCAGGC GCCGGCGGCG GGCGTGAGGC GCCGGAGCCC GGCCTCGAGG 120 

TGCATACCGG ACCCCCATTC GCATCTAACA AGGAATCTGC GCCCCAGAGA GTCCCGGACG 18 0 

CCGCCGGTCG GTGCCCGGCG CGCCGGGCCA TGCAGCGACG GCCGCCGCGG AGCTCCGAGC 240 

AGCGGTAGCG CCCCCCTGTA AAGCGGTTCG CTATGCCGGG ACCACTGTGA ACCCTGCCGC 300 

CTGCCGGJ\AC ACTCTTCGCT CCGGACCAGC TCAGCCTCTG ATAAGCTGGA CTCGGCACGC 360 

CCGCAACAAG CACCGAGGAG TTAAGAGAGC CGCAAGCGCA GGGAAGGCCT CCCCGCACGG 420 

GTGGGGGAAA GCGGCCGGTG CAGCGCGGGG ACAGGCACTC GGGCTGGCAC TGGCTGCTAG 480 

GGATGTCGTC CTGGATAAGG TGGCATGGAC CCGCCATGGC GCGGCTCTGG GGCTTCTGCT 540 

GGCTGGTTGT GGGCTTCTGG AGGGCCGCTT TCGCCTGTCC CACGTCCTGC AAATGCAGTG 600 

CCTCTCGGAT CTGGTGCAGC GACCCTTCTC CTGGCATCGT GGCATTTCCG AGATTGGAGC 660 

CTAACAGTGT AGATCCTGAG AACATCACCG AAATTTTCAT CGCAAACCAG AAAAGGTTAG 720 

AAATCATCAA CGAAGATGAT GTTGAAGCTT ATGTGGGACT GAGAAATCTG ACAATTGTGG 780 

ATTCTGGATT A7\AATTTGTG GCTCATAAAG CATTTCTGAA AAACAGCAAC CTGCAGCACA 840 

TCAATTTTAC CCGAAACAAA CTGACGAGTT TGTCTAGGAA ACATTTCCGT CACCTTGACT 90 0 

TGTCTGAACT GATCCTGGTG GGCAATCCAT- TTACATGCTC CTGTGACATT ATGTGGATCA 960 

AGACTCTCCA AGAGGCTAAA TCCAGTCCAG ACACTCAGGA TTTGTACTGC CTGAATGAAA 1020 

GCAGCAAGAA TATTCCCCTG GCAAACCTGC AGATACCCAA TTGTGGTTTG CCATCTGCAA 1080 

ATCTGGCCGC ACCTAACCTC ACTGTGGAGG AAGGAAAGTC TATCACATTA TCCTGTAGTG 1140 

TGGCAGGTGA TCCGGTTCCT AATATGTATT GGGATGTTGG TAACCTGGTT TCCAAACATA 12 00 

TGAATGAAAC AAGCCACACA CAGGGCTCCT TAAGGATAAC TAACATTTCA TCCGATGACA 1260 

GTGGGAAGCA GATCTCTTGT GTGGCGGAAA ATCTTGTAGG AGAAGATCAA GATTCTGTCA 1320 

ACCTCACTGT GCATTTTGCA CCAACTATCA CATTTCTCGA ATCTCCAACC TCAGACCACC 1380 

ACTGGTGCAT TCCATTCACT GTGAAAGGCA ACCCCAAACC AGCGCTTCAG TGGTTCTATA 1440 

ACGGGGCAAT ATTGAATGAG TCCAAATACA TCTGTACTAA AATACATGTT ACCAATCACA 1500 

CGGAGTACCA CGGCTGCCTC CAGCTGGATA ATCCCACTCA CATGAACAAT GGGGACTACA 1560 

CTCTAATAGC CAAGAATGAG TATGGGAAGG ATGAGAAACA GATTTCTGCT CACTTCATGG 1620 

GCTGGCCTGG AATTGACGAT GGTGCAAACC CAAATTATCC TGATGTAATT TATGAAGATT 1680 

ATGGAACTGC AGCGAATGAC ATCGGGGACA CCACGAACAG AAGTAATGAA ATCCCTTCCA 1740 

CAGACGTCAC TGATAATVACC GGTCGGGAAC ATCTCTCGGT CTATGCTGTG GTGGTGATTG 1800 

CGTCTGTGGT GGGATTTTGC CTTTTGGTAA TGCTGTTTCT GCTTAAGTTG GCAAGACACT 1860 

CCAAGTTTGG CATGAAAGAT TTCTCATGGT TTGGATTTGG GAAAGTAAAA TCAAGACAAG 192 0 

GTGTTGGCCC AGCCTCCGTT ATCAGCAATG ATGATGACTC TGCCAGCCCA CTCCATCACA 1980 

TCTCCAATGG GAGTAACACT CCATCTTCTT CGGT^AGGTGG CCCAGATGCT GTCATTATTG 2040 

GAATGACCAA GATCCCTGTC ATTGAAAATC CCCAGTACTT TGGCATCACC AACAGTCAGC 2100 

TCAAGCCAGA CACATTTGTT CAGCACATCA AGCGACATAA CATTGTTCTG AAAAGGGAGC 2160 

TAGGCGAAGG AGCCTTTGGA AAAGTGTTCC TAGCTGAATG CTATAACCTC TGTCCTGAGC 222 0 

AGGACAAGAT CTTGGTGGCA GTGAAGACCC TGAAGGATGC CAGTGACAAT GCACGCAAGG 22 80 

ACTTCCACCG TGAGGCCGAG CTCCTGACCA ACCTCCAGCA TGAGCACATC GTCAAGTTCT 2 340 

ATGGCGTCTG CGTGGAGGGC GACCCCCTCA TCATGGTCTT TGAGTACATG AAGCATGGGG 2400 

ACCTCAACAA GTTCCTCAGG GCACACGGCC CTGATGCCGT GCTGATGGCT 6AGGGCAACC 2460 

CGCCCACGGA ACTGACGCAG TCGCAGATGC TGCATATAGC CCAGCAGATC GCCGCGGGCA 2520 

TGGTCTACCT GGCGTCCCAG CACTTCGTGC ACCGCGATTT GGCCACCAGG AACTGCCTGG 2580 

TCX3GGGAGAA CTTGCTGGTG AAAATCGGGG ACTTTGGGAT GTCCCGGGAC GTGTACAGCA 2 640 

CTGACTACTA CAGGGTCGGT GGCCACACAA TGCTGCCCAT TCGCTGGATG CCTCCAGAGA 2700 

GCATCATGTA CAGGAAATTC ACGACGGAAA GCGACGTCTG GAGCCTGGGG GTCGTGTTGT 2760 

GGGAGATTTT CACCTATGGC AAACAGCCCT GGTACCAGCT GTCAT^CAAT GAGGTGATAG 2820 

AGTGTATCAC TCAGGGCCGA GTCCTGCAGC GACCCCGCAC GTGCCCCCAG GAGGTGTATG 2880 

AGCTGATGCT GGGGTGCTGG CAGCGAGAGC CCCACATGAG GAAGAACATC AAGGGCATCC 2940 

ATACCCTCCT TCAGAACTTG GCCAAGGCAT CTCCGGTCTA CCTGGACATT CTAGGCTAGG 3 0 00 

GCCCTTTTCC CCAGACCGAT CCTTCCCAAC GTACTCCTCA GACGGGCTGA GAGGATGAAC 3 060 

ATCTTTTAAC TGCCGCTGGA GGCCACCAAG CTGCTCTCCT TCACTCTGAC AGTATTAACA 312 0 

TCAAAGACTC CX3AGAAGCTC TCGAGGGAAG CAGTGTGTAC TTCTTCATCC ATAGACACAG 3180 

TATTGACTTC TTTTTGGCAT TATCTCTTTC TCTCTTTCCA TCTCCCTTGG TTGTTCCTTT 3240 

TTCTTTTTTT AAATTTTCTT TTTCTTCTTT TTTTTCGTCT TCCCTGCTTC ACGATTCTTA 3300 

CCCTTTCTTT TGAATCAATC TGGCTTCTGC ATTACTATTA ACTCTGCATA GACAAAGGCC 33 60 

TTAACAAACG TAATTTGTTA TATCAGCAGA CACTCCAGTT TGCCCACCAC AACTAACAAT 3420 

GCCTTGTTGT ATTCCTGCCT TTGATGTGGA TGAAAAAAAG GGAAAACAAA TATTTCACTT 3480 

AAACTTTGTC ACTTCTGCTG TACAGATATC GAGAGTTTCT ATGGATTCAC TTCTATTTAT 3540 

TTATTATTAT TACTGTTCTT ATTGTTTTTG GATGGCTTAA GCCTGTGTAT 7VAAAAAGAAA 3600 

ACTTGTGTTC AATCTGTGAA GCCTTTATCT ATGGGAGATT A7\AACCAGAG AGAAAGAAGA 3 660 

TTTATTATGA ACCGCAATAT GGGAGGAACA AAGACAACCA CTGGGATCAG CTGGTGTCAG 3720 

TCCCTACTTA GGAAATACTC AGCAACTGTT AGCTGGGAAG AATGTATTCG GCACCTTCCC 3 780 

CTGAGGACCT TTCTGAGGAG TAAAAAGACT ACTGGCCTCT GTGCCATGGA TGATTCTTTT 3840 

CCCATCACCA GAAATGATAG CGTGCAGTAG AGAGCAAAGA TG^CTTCCGT GAGACACAAG 3 900 

ATGGCGCATA GTGTGCTCGG ACACAGTTTT GTCTTCGTAG GTTGTGATGA TAGCACTGGT 3960 

TTGTTTCTCA AGCGCTATCC ACAGAACCTT TGTCAACTTC AGTTGAAAAG AGGTGGATTC 4020 
ATGTCCAGAG CTCATTTCGG GGTCAGGTGG GAAAGCC 

Seq ID NO: 597 Protein sequence 
Protein Accession #: AAL67 965.1 

1 11 21 31 41 51 

I I 1 i I I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE IINEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

NFTRNKLTSXj SRIOIFRHLDL SEIiILVGNPF TCSCDIMWIK TLQBAKSSPD TQDtiYCIiNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 



418 



10 
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NBTSHTQGSL RITNISSDDS GKQISCVAEN LVGBDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKOH PKPALQWFyN GAILHESKYI CTKIRVTNHT EYHGCIiQLDN PTHMNN6DYT 360 

LIAKNBYGKD EKQISAHFMO WPGIDDGAHP NYPDVIYEDY QTAflNDIGDT TNRSNBIPST 420 

DVTDKTGREH LSVYAVWIA SWGFCI.LVM LFLLKLARHS KFGMKDPSWF GFGKVKSRQG 480 

VGPASVISND DDSASPLHHI SNGSNTPSSS EGGPDAVIIG MTKIPVIENP QYFGITNSQIj 540 

KPDTFVQHIK RHNIVLKREL GEGAFGKVFL AECYMLCPEQ DKILVAVKTL KDASDNARKD 600 

FHREAELLTN LQHEHIVKFY GVCVEGDPLI MVPEYMKHGD UIKFLRAHGP DAVLMABCaiP 660 

PTELTQSQML HIAQQIAAGM VYLASQHPVH RDLATBMCLV GENLLVKIGD F6MSSDVYST 720 

DYYRVG6HTM WIRWMPPBS IMYRKFTTBS DVWSIiGWLW BIFTYGKQPW YQIiSNNEVIE 780 

CITQGRVLQR PRTCPQEWYE LMLGCWOREP HMRKMIKGIH TIiLQNIiAKAS PVYLDILG 

Seq ID NO: 593 DNA sequence 
Nucleic Acid Accession #: AB052906 
Coding sequence t 74.. 814 



PCT/US02/12476 



AAAACCTTGA 
CTCTGGGTCC 
GCTCCTGCTG 
CATCACCGTC 
GGATGAAAAG 
CCTGGGGAAG 
GGTGGTGGAC 
GGAACCCCTC 
TGGATCTTGG 
AATGTGGACA 
GGTTGTGGCC 
CTTCTTGATG 
CTCAGGCACA 
CATCCTCCCC 
AAGCTGATAC 
CCAGCTGCCC 
TGGACCCAAT 
TACCTAACAT 
TTCTGGCTGA 
GTACTTCTTT 
TA6ACTTCAG 
ATAAGAAAAA 
TTTAAATAAA 



11 

1 

GGTGATTCAT 
TTAATGGCAG 
TCCGGCTGGT 
ATCCCTAAGT 
ACTTTTCTTC 
AAACTAAATG 
ATACTTACAG 
ACCCTGCAGG 
CAGTTCAGTT 
ACGGTTCATC 
ATGTCCTTCC 
GGCATGGACA 
ACCC/^CTCA 
TGCTTCATCC 
CAAAAGGCTC 
ACGACCTACG 
AGCTCATTCA 
ATTATGCAAT 
CTAAACAAGA 
GAATGATGAT 
ACCTCTGGGG 
ATTTATATTA 
GAGTTCTATT 



21 

1 

CTTCCAGGCT 
CAGCCGCCGC 
CCCGGGCTGG 
TCAGACCTGG 
ACTATGACTG 
TCACAACGGC 
AGCAACTGCG 
CCAGGATGTC 
TCGATGGGCA 
CTGGAGCCAG 
ATTACTTCTC 
GCACCCTGGA 
GGGCCACAGC 
TCCCTGGCAT 
CTGTGAGCAC 
GTGTATGTCC 
CTGCCTTGAT 
TTTCTCTTGG 
TATATCATTT 
CTCTTTCTTG 
ATTCTTTCCG 
ATGATTGTTT 
TCCCAAAAAA 



31 

1 

CTCCTTCCAT 
TACCAAGATC 
GCGAGCCGAC 
ACCACGGTGG 
TGGCAACAAG 
CTGGAAAGCA 
TGACATTCAG 
TTGTGAGCAG 
GATCTTCCTC 
AAAGATGAAA 
AATGGGAGAC 
GCCAAGTGCA 
CACCACCCTC 
CTGAGGAGAG 
GGTCTTGATC 
AGTGGCCTCC 
TCCTTTTGCC 
TGCTACCTGA 
TCTTTCTTCT 
CAAATGATAT 
TGTCCTGAAA 
CCTTTAGTAA 
AAAAAAAT^ 



41 

1 

CAAGTCTCTC 
CTTCTGTGCC 
CCTCACTCTC 
TGTGCGGTTC 
ACAGTCACAC 
CAGAACCCAG 
CTGGAGAATT 
AAAGCTGAAG 
CTCTTTGACT 
GAJWVAGTGGG 
TGTATAGGAT 
GGAGCACCAC 
ATCCTTTGCT 
TCCTTTAGAG 
AAACTCGCCC 
AGCAGATCAT 
AACAATTTTA 
TGGAATTCCT 
CTTTTTGTTT 
TGTCAGTAAA 
GAGAATTTTT 
TTTATTGTTC 
AA 



51 
I 

CTCCCTAGCG 
TCCCGCTTCT 
TTTGCTATGA 
AAGGCCAGGT 
CTGTCAGTCC 
TACTGAGAGA 
ACACACCCAA 
GACACAGCAG 
CAGAGAAGAG 
AGAATGACAA 
GGCTTGAGGA 
TCGCCATGTC 
GCCTCCTCAT 
TGACAGGTTA 
TTCTGTCTGG 
GATGACATCA 
CCAGCAGTTA 
GCACTTAAAG 
GGAAAATCAA 
ATAATCACGT 
AAATTATTTA 
TGTACTGATA 



Seq ID NO: 599 Protein sequence 
Protein Accession # : BAB61048 . 1 



11 



21 



Seq ID NO: 600 DNA sequence 
Nucleic Acid Accession #: NM_001898. 
55 Coding sequence: 57.. 482 



11 



21 



31 



41 



51 



GGCTCTCACC 
CCCAGTATCT 
GCCCCAAGGA 
AGTGGGTACA 
ACTACTACAG 
ATTACTTCTT 
ACACCTGTGC 
TCTACG/AGT 
AGGGATCTGT 
CCACCCCTGG 
GACAGACAGA 
CTTCCTTCTT 
AAACAGTAGC 



CTCCTCTCCT 
GAGTACCCTG 
GGAGGATAGG 
GCGT6CCCTT 
ACGTCCGCTG 
CGACGTAGAG 
CTTCCATGAA 
TCCCTGGGAG 
GCCAGGCCAT 
ACTGGTGGCC 
GAAGGCTGCA 
GCTTCTAATA 
ATCGCC 



GCAGCTCCAG 
CTGCTCCTGC 
ATAATCCCGG 
CACTTCGCCA 
CGGGTACTAA 
GTGGGCCGCA 
CAGCCAGAAC 
AACAGAAGGT 
TCGCACCAGC 
CCCACCCTGC 
GGAGTCCTTT 
GCCCTGGTAC 



CTTTGTGCTC 
TGGCCACCCT 
GTGGCATCTA 
TCA6CGAGTA 
GAGCCAGGCA 
CCATATGTAC 
TGCAGAAGAA 
CCCTGGTGAA 
CACCACCCAC 
GGGAGGCCTC 
GTTGCTCAGC 
ATGGTACACA 



TGCCTCTGAG 
AGCTGTGGCC 
TAACGCAGAC 
TAACAAGGCC 
ACAGACCGTT 
CAAGTCCCAG 
ACAGTTGTGC 
ATCCAGGTGT 
TCCCACCCCC 
CCCATGTGCC 
AGGGCGCTCT 
CCCCCCCACC 



GAGACCATGG 
CTGGCCTGGA 
CTCAATGATG 
ACCAAAGATG 
GGGGGGGTGA 
CCCAACTTGG 
TCTTTCGAGA 
CAAGAATCCT 
TGTAGT6CTC 
TGCX3CCAAGA 
GCCCTCCCTC 
TCCTGCAATT 



Seq ID NO: 601 Protein sequence 
Protein Accession #: NP_001889.1 

1 11 



51 



21 31 41 

I I 1 I I I 

MAQYLSTLhh LLATLAVAIA WSPKEEDRII PGGIYNADLN DEWVQRALHF AISEYNKATK 
DDYYRRPIiRV LRARQQTVGG VNYFFDVEVG RTICTKSQPN LDTCAPHEQP ELQKKQIjCSF 
EIYEVPWENR RSLVKSRCQE S 

Seq ID NO: 602 DNA sequence 

Nucleic Acid Accession #: NM_^003976.2 

Coding sequence: 299.961 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



31 41 51 

I I I I I I 

MAAAAATKIL LCLPLLLLLS GWSRAGRADP HSLCYDITVI PKFRPGPRWC AVQGQVDEKT 60 

FLHYDCGNKT VTPVSPLGKK LNVTTAWKAQ NPVLREWDI LTEQLRDIQL ENYTPKEPLT 12 0 

LQARMSCEQK AEGHSSGSWQ FSFDGQIFLL FDSEKRMWTT VHPGARKMKE KWENDKWAM 180 

SFHYFSMGDC IGWLEDFIiMG MDSTLEPSAG APLAMSSGTT QLRATATTLI LCCLLIIIiPC 240 
FILPGI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 



11 

1 



21 



31 
1 



41 



51 
1 
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CTCTGAGCTT CTCTGAGCCT TGTTTGCTCA TCTGGAAAAA GGGGATTAAA CCATTTACCT 60 

CATGGAGTTG TGAAAGAATA GCTGCAAAGC ACCTAACACA TAGTAAGGTT CCCAGTGCAG 120 

CTACTTCTGC TGGGTTGAGT CTAGCTGTGT AGGCCCCTTG TTCCTCACCT GGAGAAACTG 180 

GGGTGGCAGG CCGGTCCCCC ACAAAAGATA ACTCATCTCT TAATTTGCAA GCTGCCTCAA 240 

CAGGAGGGTG GGGGAACAGC TCAACAATGG CTGATGGGCG CTCCTGGTGT TGATAGAGAT 300 

GGAACTTGGA CTTGGAGGCC TCTCCACGCT GTCCCACTGC CCCTGGCCTA GGCGGCAGCC 360 

TGCCCTGTGG CCCACCCTGG CCGCTCTGGC TCTGCTGAGC AGCGTCGCAG AGGCCTCCCT 420 

GGGCTCCGCG CCCCGCAGCC CTGCCCCCCG CGAAGGCCCC CCGCCTGTCC TGGCGTCCCC 480 

CX3CCGGCCAC CTGCCGGGGG GACGCACGGC CCGCTGGTGC AGTGGAAGAG CCCGGCGGCC 540 

GCCGCCGCAG CCTTCTCGGC CCGCGCCCCC GCCGCCTGCA CCCCCATCTG CTCTTCCCCG 60 0 

CGGGGGCCGC GCGGCGCGGG CTGGGGGCCC GGGCAGCCGC GCTCGGGCAG CGGGGGCGCG 660 

GGGCTGCCGC CTGCGCTCGC AGCTGGTGCC GGTGCGCGCG CTCGGCCTGG GCCACCGCTC 720 

CGACGAGCTG GTGCGTTTCC GCTTCTGCAG CGGCTCCTGC CGCCGCGCGC GCTCTCCACA 780 

CGACCTCAGC CTGGCCAGCC TACTGGGCGC CGGGGCCCTG CGACCGCCCC CGGGCTCCCG 840 

GCCCGTCAGC CAGCCCTGCT GCCGACCCAC GCGCTACGAA GCGGTCTCCT TCATGGACGT 900 

CAACAGCACC TGGAGAACCG TGGACCGCCT CTCCGCCACC GCCTGCGGCT GCCTGGGCTG 960 

AGGGCTCGCT CCAGGGCTTT GCAGACTGGA CCCTTACCGG TGGCTCTTCC TGCCTGGGAC 1020 

CCTCCCGCAG AGTCCCACTA GCCAGCGGCC TCAGCCAGGG ACGAAGGCCT CAAAGCTGAG 1080 

AGGCCCCTAC CGGTGGGTGA TGGATATCAT CCCCGAACAG GTGAAGGGAC AACTGACTAG 1140 

CAGCCCCAGA GCCCTCACCC TGCGGATCCC AGCCTAAAAG ACACCAGAGA CCTCAGCTAT 1200 

GGAGCCCTTC GGACCCACTT CTCACAGACT CTGGCACTGG CCAGGCCTCG AACCTGGGAC 1260 

CCCTCCTCTG ATGAACACTA CAGTGGCTGA GGCATCAGCC CCCGCCCAGG CCCTGTAGGG 1320 

ACAGCATTTG AAGGACACAT ATTGCAGTTG CTTGGTTGAA AGTGCCTGTG CTGGAACTGG 1380 
CCTGTACTCA CTCATGGGAG CTGGCCCC 

Seq ID NO: 603 Protein sequence 
Protein Accession #: NP_003967.1 

1 11 21 31 41 51 

I I I 1 I I 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 
PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 
RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASIjLG AGAIjRPPPGS 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 

Seq ID NO: 604 DNA sequence 
Nucleic Acid Accession #: NM_0570 91.1 
Coding sequence: 783.. 1445 



PCT/US02/12476 



ACTGGCCGCT 
GGACCCCCAA 
TCGCTCCCCG 
CGCGTGTCTA 
CTCCATATCC 
CAAGCTAGGG 
CGGGGCAGGG 
CACCGGACGG 
CAGACAAGGC 
CCCGGGCCTG 
TAAAAGAGGC 
GCCCAGCACT 
TCAACAGGAG 
AGATGGAACT 
AGCCTGCCCT 
GCCTGGGCTG 
CCCCCGCCGG 
GGCCGCCGCC 
CCGGCGGGGG 
CGCGGGGCTG 
GCTCOGACGA 
CACACGACCT 
CCCGGCCCGT 
ACGTCAACAG 
GCTGAGGGCT 
GGACCCTCCC 
TGAGAGGCCC 
CTAGCAGCCC 
CTATGGAGCC 
GGACCCCTCC 
AGGGACAGCA 
CTGGCCTGTA 



11 
1 

GAGAGAAGAA 
ATCTGCACGT 
CCCTCACTCA 
CAAACTCAAC 
GAGGGGCCCC 
GGGACTGGAT 
GCGCTCCCAG 
CTGCGGCGGC 
CCGGGGGCTC 
GAGCCCCACA 
ACTGCCAGGT 
GGTCCCCGGA 
GGTGGGGGAA 
TGGACTTGGA 
GTGGCCCACC 
CGCGCCCCGC 
CCACCTGCCG 
GCAGCCTTCT 
CCGCGCGGCG 
CCGCCTGCGC 
6CTGGT6CGT 
CAGCCTGGCC 
CAGCCAGCCC 
CACCTGGAGA 
CGCTCCAGGG 
GCAGAGTCCC 
CTACCGGTGG 
CAGAGCCCTC 
CTTCGGACCC 
TCTGATGAAC 
TTTGAAGGAC 
CTCACTCATG 



21 

1 

TCGGGTGGAG 
ACCAGCAGTC 
CTTTCTCCCG 
TCCCGGTTTC 
TCCCAGCATC 
CCGACGGGTG 
CCCCACCCCG 
GGGCAGGAGG 
CGCCAGCAGC 
CCCGAGGGTG 
GTACAGTCCT 
AAGGTGCCTA 
CAGCTCAACA 
GGCCTCTCCA 
CTGGCCGCTC 
AGCCCTGCCC 
GGGGGACGCA 
CGGCCCGCGC 
CGGGCTGGGG 
TCGCAGCTGG 
TTCCGCTTCT 
AGCCTACTGG 
TGCTGCCGAC 
ACCGTGGACC 
CTTTGCAGAC 
ACTAGCCAGC 
GTGATGGATA 
ACCCTGCGGA 
ACTTCTCACA 
ACTACAGTGG 
ACATATTGCA 
GGAGCTGGCC 



31 

1 

CAGAGAGCAG 
AGCCGCCCCA 
CCCTCGGCCC 
CGTGCCTCTC 
TACCCCCCTC 
GAGCAGCCAG 
GGATCTGGTG 
CTGCTGAGGG 
AGGTCCCTCG 
CAGACTGGCT 
GGGCATGCGC 
GAAGAACAAG 
ATGGCTGATG 
CGCTGTCCCA 
TGGCTCTGCT 
CCCGCGAAGG 
CGGCCCGCTG 
CCCCGCCGCC 
GCCCGGGCAG 
TGCCGGTGCG 
GCAGCGGCTC 
GCGCCGGGGC 
CCACGCGCTA 
GCCTCTCCGC 
TGGACCCTTA 
GGCCTCAGCC 
TCATCCCCGA 
TCCCAGCCTA 
GACTCTGGCA 
CTGAGGCATC 
GTTGCTTGGT 
CC 



41 
1 

CTGCTGCAGG 
CGCAGGGACC 
GGCCTCCCAG 
CACCGCTCGA 
CCAACCTCGG 
GTGAGCCCCG 
ACGCTGGGGC 
ATGGAGTTGG 
GGCCCCAGCC 
GCCAAGGCCA 
TGTTTGAGCT 
GTGCAGGACC 
GGCGCTCCTG 
CTGCCCCTGG 
GAGCAGCGTC 
CCCCCCGCCT 
GTGCAGTGQA 
TGCACCCCCA 
CCGCGCTCGG 
CGCGCTCGGC 
CTGCOGOCGC 
CCTGCGACCG 
CGAAGCGGTC 
CACCGCCTGC 
CCGGTGGCTC 
AGGGACGAAG 
ACAGGTGAAG 
AAAGACACCA 
CTGGCCAGGC 
AGCCCCCGCC 
TGAAAGTGCC 



51 

1 

GCAGACAGCC 
GGCTTACCCC 
CTCTCTACTT 
GTTCTCTACT 
GGGACCTAGC 
AAAGGTGGGG 
TGGAATTTGA 
GCCCGGCCCC 
CTCGCTGCCA 
CACTTTTGGC 
TCGGGGGAGA 
CCGTGCTGCC 
GTGTTGATAG 
CCTAGGCGGC 
GCAGAGGCCT 
GTCCTGGCGT 
AGAGCCCGGC 
TCTGCTCTTC 
GCAGCGGGGG 
CTGGGCCACC 
GCGCGCTCrC 
CCCCCGGGCT 
TCCTTCATGG 
GGCTGCCTGG 
TTCCTGCCTG 
GCCTCAAAGC 
GGACAACTGA 
GAGACCTCAG 
CTCGAACCTG 
CAGGCCCTGT 
TGTGCTGGAA 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



Seq ID NO: 605 Protein sequence 
Protein Accession #: NP 003967.1 



80 



85 



1 11 21 31 41 51 

I I I I I I 

MELGLGGLST LSHCPWPRRQ PALWPTXjAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 
PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 
RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 



Seq ID NO: 606 DNA sequence 

Nucleic Acid Accession #i NM_057160.1 



60 
120 
180 



420 



wo 02/086443 

Coding sequence: 1..714 



11 



21 



31 



41 



51 
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10 
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40 
45 
50 
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80 
85 



ATGCCCGGCC 
CACCTGGGTG 
TGGCCCACCC 
GCGCCCCGCA 
CACCTGCCGG 
CAGCCTTCTC 
CGCGCGGCGC 
CGCCTGCGCT 
CTGGTGCGTT 
AGCCTGGCCA 
AGCCAGCCCT 
ACCTGGAGAA 
GCTCCAGGGC 
CAGAGTCCCA 
TACCGGTGGG 
AGAGCCCTCA 
TTCGGACCCA 
CT6ATGAACA 
TTGAAGGACA 
TCACTCATGG 



TGATCTCAGC 
CCCTCTTTCT 
TGGCCGCTCT 
GCCCTGCCCC 
GGGGACGCAC 
GGCCCGCGCC 
GGGCTGGGGG 
CGCAGCTGGT 
TCCGCTTCTG 
GCCTACTGGG 
GCTGCCGACC 
CCGTGGACCG 
TTTGCAGACT 
CTAGCCAGCG 
TGATGGATAT 
CCCTGCGGAT 
CTTCTCACAG 
CTACAGTGGC 
CATATTGCAG 
GAGCTGGCCC 



CCGAGGACAG 
CCCTGAGGCT 
GGCTCTGCTG 
CCGCGAAGGC 
GGCCCGCTGG 
CCCGCCGCCT 
CCCGGGCAGC 
GCCGGTGCGC 
CAGCGGCTCC 
CGCCGGGGCC 
CACGCGCTAC 
CCTCTCCGCC 
GGACCCTTAC 
GCCTCAGCCA 
CATCCCCGAA 
CCCAGCCTAA 
ACTCTGGCAC 
T6AGGCATCA 
TTGCTTGGTT 
C 



CCCCTCCTTG 
CCACTTGGTC 
AGCAGCGTCG 
CCCCCGCCTG 
TGCAGTGGAA 
GCACCCCCAT 
CGCGCTCGGG 
GCGCTCGGCC 
TGCCGCCGCG 
CTGCGACCGC 
GT^GCGGTCT 
ACCGCCTGCG 
CGGTGGCTCT 
GGGACGAAGG 
CAGGTGAAGG 
AAGACACCAG 
TGGCCAGGCC 
GCCCCCGCCC 
GAAAGTGCCT 



AGGTCCTTCC 
TCTCCGCGCA 
CAGAGGCCTC 
TCCTGGCGTC 
GAGCCCGGCG 
CTGCTCTTCC 
CAGCGGGGGC 
TGGGCCACCG 
CGCGCTCTCC 
CCCCGGGCTC 
CCTTCATGGA 
GCTGCCTGGG 
TCCTGCCTGG 
CCTCAAAGCT 
GACAACTGAC 
AGACCTCAGC 
TCGAACCTGG 
AGGCCCTGTA 
GTGCTGGAAC 



TCCCCAAGCC 
GCCTGCCCTG 
CCTGGGCTCC 
CCCCGCCGGC 
GCCGCCGCCG 
CCGCGGGGGC 
GCGGGGCTGC 
CTCCGACGAG 
ACACGACCTC 
CCGGCCCGTC 
CGTCAACAGC 
CTGAGGGCTC 
GACCCTCCCG 
GAGAGGCCCC 
TAGCAGCCCC 
TATGGAGCCC 
GACCCCTCCT 
GGGACAGCAT 
TGGCCTGTAC 



Seq ID NO: 607 Protein sequence 
Protein Accession #! NP_476S01.1 

1 11 21 31 41 51 

I I I I I I 

MPGLISARGQ PLLEVLPPQA HLGALFLPEA PLGLSAQPAL WPTLAALALL SSVAEASLGS 
APRSPAPREG PPPVLASPAG HLPGGRTARW CSGRARRPPP QPSRPAPPPP APPSALPR6G 
RAARAGGPGS RARAAGT^GC RLRSQLVPVR ALGLGHRSDE LVRFRFCSGS CRRARSPHDL 
SLASLLGAGA LRPPPGSRPV SQPCCRPTRY EAVSFMDVNS TWRTVDRLSA TACGCLG 

Seq ID NO: 608 DNA sequence 

Nucleic Acid Accession #: NM_0570 90.1 

Codl ng sequence : 2 9 . . 7 1 5 



1 
1 

CTGATGGGCG 
GTCCCACTGC 
GTGGCCCACC 
CGCGCCCCGC 
CCACCTGCCG 
GCAGCCTTCT 
CCGCGCGGCG 
CCGCCTGCGC 
GCTGGTGCGT 
CAGCCTGGCC 
CAGCCAGCCC 
CACCTGGAGA 
CGCTCCAGGG 
GCAGAGTCCC 
CTACCGGTGG 
CAGAGGCCTC 
CTTCGGACCC 
TCTGATGAAC 
TTTGAAGGAC 
CTCACTCATG 



11 

1 

CTCCTGGTGT 
CCCTGGCCTA 
CTGGCCGCTC 
AGCCCTGCCC 
GGGGGACGCA 
CGGCCCGCGC 
CGGGCTGGGG 
TCGCAGCTGG 
TTCCGCTTCT 
AGCCTACTGG 
TGCTGCCGAC 
ACCGTGGACC 
CTTTGCAGAC 
ACTAGCCAGC 
GTGATGGATA 
ACCCTGCGGA 
ACTTCTCACA 
ACTACAGTGG 
ACATATTGCA 
GGAGCTGGCC 



21 
I 

TGATAGAGAT 
GGCGGCAGGC 
TGGCTCTGCT 
CCCGCGAAGG 
CGGCCCGCTG 
CCCCGCCGCC 
GCCCGGGCAG 
TGCCGGTGCG 
GCAGCGGCTC 
GCGCCGGGGC 
CCACGCGCTA 
GCCTCTCCGC 
TGGACCCTTA 
GGCCTCAGCC 
TCATCCCCGA 
TCCCAGCCTA 
GACTCTGGCA 
CTGAGGCATC 
GTTGCTTGGT 
CC 



31 

! 

GGAACTTGGA 
TCCACTTGGT 
GAGCAGCGTC 
CCCCCCGCCT 
GTGCAGTGGA 
TGCACCCCCA 
CCGCGCTCGG 
CGCGCTCGGG 
CTGCCGCCGC 
CCTGCGACCG 
CGAAGCGGTC 
CACCGCCTGC 
CCGGTGGCTC 
AGGGACGAAG 
ACAGGTGAAG 
AAAGACACCA 
CTGGCCAGGC 
AGCCCCCGCC 
TGAAAGTGCC 



41 

I 

CTTGGAGGCC 
CTCTCCGCGC 
GCAGAGGCCT 
GTCCTGGCGT 
AGAGCCCGGC 
TCTGCTCTTC 
GCAGCGGGGG 
CTGGGCCACC 
GCGCGCTCTC 
CCCCCX3GGCT 
TCCTTCATGG 
GGCTGCCTGG 
TTCCTGCCTG 
GCCTCAAAGC 
GGACAACTGA 
GAGACCTCAG 
CTCGAACCTG 
CAGGCCCTGT 
TGTGCTGGAA 



51 
1 

TCTCCACGCT 
AGCCTGCCCT 
CCCTGGGCTC 
CCCCCGCCGG 
GGCCGCCGCC 
CCCGCGGGGG 
CGCGGGGCTG 
GCTCCGACGA 
CACACGACCT 
CCCGGCCCGT 
ACGTCAACAG 
GCTGAGGGCT 
GGACCCTCCC 
TGAGAGGCCC 
CTAGCAGCCC 
CTATGGAGCC 
GGACCCCTCC 
AGGGACAGCA 
CTGGCCTGTA 



Seq ID NO: €09 Protein sequence 
Protein Accession #: NP_476431.1 

1 11 21 31 41 51 

I I I I I I 

MELGLGGLST LSHCPWPRRQ APLGLSAQPA IiWPTLAALAL LSSVAEASLG SAPRSPAPRE 
GPPPVLASPA GHIjPGGRTAR WCSGRARRPP PQPSRPAPPP PAPPSALPRG GRAARAGGPG 
SRARAAGARG CRLRSQLVPV RALGLGHRSD ELVRFRFCSG SCRRARSPHD LSLASLLGA6 
ALRPPPGSRP VSQPCCRPTR YEAVSFMDVN STWRTVDRLS atacgclg 

Seq ID NO: 610 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

CtJding sequence: 1..1746 



1 
1 

atgccactga 
gcctaccatg 
ggggcacgca 

CTCAACACGC 
GCCCTGAGGA 
GGCTCGCTGC 
TTCCAGGGCC 
CAGCCGGCCC 
CTGGAATACA 



11 
I 

AGCATTATCT 
GCTGCCCTAG 
TTGTGGCGGT 
ACATCACTGA 
TTGAGAAGAA 
GCTATCTCAG 
TGGACAGCCT 
ACTTCTCCCA 
TCCCTGACGG 



21 

1 

CCTTTTGCTG 
CGAGTGTACC 
GCCCACCCCT 
ACTCAATGAG 
TGAGCTGTCG 
CCTCGCCAAC 
TGAGTCTCTC 
GTGCAGCAAC 
AGCCTTCGAC 



31 
I 

GTGGGCTGCC 
TGCTCCAGGG 
CTGCCCTGGA 
TCCCCGTTCC 
CGCATCACGC 
AACAAGCTGC 
CTTCTGTCCA 
CTCAAGGA6C 
CACCTGGTAG 



41 
I 

AAGCCTGGGG 
CCTCCCAGGT 
ACGCCATGAG 
TCAATATCTC 
CTGGGGCCTT 
AGGTTCTGCC 
GTAACCAGCT 
TGCAGTTGCA 
GACTCACGAA 



51 

I 

TGCAGGGTTG 
GGAGTGCACC 
CCTGCAGATC 
AGCCCTCATC 
CCGAAACCTG 
CATCGGCCTC 
GTTGCAGATC 
CGGCAACCAC 
GCTCAATCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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GGCAAGAATA GCCTCACCCA CATCTCACCC 
GTCCTCCGGC TGTATGAGAA CAGGCTCACG 
GTTAACCTGC AGGAACTGGC TCTACAGCAG 
TTCCACAACA ACCACAACCT CCAGAGACTC 
CCACCCAGCA TCTTCATGCA GCTGCCCCAG 
CTGAAGGAGC TCTCTCTGGG GATCTTCGGG 
TATGACAACC ACATCTCTTC TCTACCCGAC 
GTCCTGATTC TTAGCCGCAA TCAGATCAGC 
ACGGAGCTTC GGGAGCTGTC CCTCCACACC 
TTCCGCATGT TGGCCAACCT GCAGAACATC 
CCAGGGAATA TCTTCGCCAA CGTCAATGGC 
CTGGAGAACT TGCCCCTCGG CATCTTCGAT 
TATGACAATC CCTGGAGGTG TGACTCAGAC 
AACCAGCCTA GGTTAGGGAC GGACACTGTA 
GGCCAGTCCC TCATTATCAT CAATGTCAAC 
GTGCCTAGTT ACCCAGAAAC ACCATGGTAC 
TCCGTCTCTT CTACCACTGA GCTAACCAGC 
ATTCAGGTCA CTGATGACCG CAGCGTTTGG 
ATTGCCGCCA TTGTAATTGG CATTGTCGCC 
TGTTGCTGCT GCAAGAAGAG GAGCCAAGCT 
TGTTAAAGAG GCAGGCTGGA GCAGGGCTGG 
TCATCTTTCT GCCTCCACCC CTGGGTCCAT 
CTAGATAAAG GTGTGCCTAC CTCTTCCTGA 
CGTGCCGGAC CTTCCTACAA TCAGGAAGAT 
GGATTTCCGA TTCATACCCC TGGGCTTCCT 
ACCTGTCCTC CAAGAACAGC CTTCCCTGCG 
AGTTAGTCCA CAGCCTGCTC ACTTCGTGGG 
CCTAAGTATT ATGTAAGTTG ATTTCCCTTC 
ACCCAGCATG TCCCCTCAAA TGAAAGTTCT 
TGAGTTCTCT CCTCAAAGAA GACTTCAAAC 
CAGCCTGGTT TTGGGGATGC TATGAAAGAG 
AGACAGAAGA GCCGTCATCA GTGTCTCACT 
CCCCAGCACA GCAAGCTGAG CCTTTTAGAG 
TGAAAAGTTT AGCCCTTTAA GGAATGAAAT 
AAAATCAGCT TATTAATACG GGATAGAGAA 
CACCCCTAGA GTTTGTTTTA AAATTTTTAA 
GTGGGAACAT GATAGTGTAT GGCTTGGTGG 
CAGCATCTAG ACCCAGACCC AGAGCATCAC 
GGAGATGGGG GCTTCTGAAG ATGGACTTAC 
TCCCCCCACA GTCAGCCTGT GCAAAGGCCC 
TGTGGACAGG ATGGGAGACT GTGGCCTGAA 
AGAGACCCTG AGACCTGGGG CACCATGGCT 
GTCCGTGCAG CCACACCCTC TTCCCTGCCA 
TCCGCCTGGA GCCTTCTATG GACGTGATAT 
ACTTAGGG6A AGTGAAATCG CTCAGAGATG 
6AATCTAGTG TCTTTCTAAT GTGGTAAAAT 
TGAACTTCAG AATCTCACTT ACAGCAGGCG 
GTCTGGGGGC TCCCTGGAGC TCCTCCTGCG 
TCCAGGGTTA TTCTCCTCCT CGAGTCACAG 
CTGCTATACA CATATTCACA TGGCGCTCAA 
CTCTGGACAA CTGGCCCAGT TTACAGTGAA 
AGGA7UVGAAC TTCAGCTGAC TCCACGGGGA 
TCTTATTAGC TCCCCGCTCC ACAAGACACC 
TCGGCTCTTA TTAGCTCCCC GCTCCACAAG 
CCCGATCGGC TCTTATTAGC TCCCCGCTCC 
CAGGAGCACG TGCTGACCAG TTTTCCCTTC 
TGTTTGCAAA CACTAGTGCA CTTTGTAGCT 
AGATGAGGCC CGTCAGAGTC AAGAGATGTC 
ACTATTGGTG GCACCTGGAG GACATGCACC 
CCAGAGCATG GCACATGAGC ATCACCCGCT 
GGGGCATCCC GGCCCGTACC CCTCCAGACA 
GGTGCTCCTG TGAGTGGCCT CCAGATGTCT 
GAGGGAGGTG GGAAACCTCA TCATCCGGTG 
GGTATTCGTG GCAGTAGCCA TGACATTGGA 
GAGGGCCACT GTCCTCAGAT GACACCACCC 
CCTTATGTGA ACCTCTTGCC TCTTCCTTTC 
GCCTCCTTTT GTTCAGCGGG CCCTTCAACC 
TACTAGAAAA GCTGAGTGGA GTCTCCTTTC 
GGGCTGGAAT GAGCCGGCTG GTCCCCCAGA 
CCTCTCTGTT TACAGCTCCT TGACAGTCCC 
GTGTTGGAGA AGAAACAACA AAAGCCAATT 
TGCACAGATA CTCTTCAAGC ACTGGACGTG 
GGTAGGAGTG CCGCCTCTAC CCACTTGTGA 
GGTGTTCAAT AGGCTGGGAG TTTTATTTAT 
TTGTCTTGGG CTTTCGTCAT TAAACC7UVAG 
TTAGTCTTGG TCATCAGAAC CTCACTTGGT 
GGAAAAAATA AACTCTTCCA TCCCTTAAAG 
TGGGCTGTAT GTATATTGTT CTTCCTCCTT 
AACTTTTCAT GGACACAATT TCCACAACCT 
GAACTTCCAA ACTCAGGAAG TTTGCAGAGA 
AGTTGGTCGA CAGATGTTAG ATGTATCCTA 
GCCCCCAGAT CCCACAGTCA GAACTGAATC 
GGAAGGAAGC CATGGCTGTG GTTCAGAGAG 
CTCCTTCCGC CCCAGGTTTC TTCTTCTCTT 
CTTCATGCTG CCTTCAAAGC TAGATCATGT 
GCCCCAGTGC TTGGGGATGC ATTTACAGAT 
AGCCCTGGTG GGCAGGGTTG GGGGGTCTGT 



AGGGTCTTCC AGCACCTGGG CAATCTCCAG 600 

GATATCCCCA TGGGCACTTT TGATGGGCTT 660 

AACCAGATTG GACTGCTCTC CCCTGGTCTC 720 

TACCTGTCCA ACAACCACAT CTCCCAGCTG 780 

CTCAACCGTC TTACTCTCTT TGGGAATTCC 840 

CCCATGCCCA ACCTGCGGGA GCTTTGGCTC 900 

AATGTCTTCA GCAACCTCCG CCAGTTGCAG 960 

TTCATCTCCC CGGGTGCCTT CAACGGGCTA 1020 

AACGCACTGC AGGACCTGGA CGGGAATGTC 1080 

TCCCTGCAGA ACAATCGCCT CAGACAGCTC 1140 

CTCATGGCCA TCCAGCTGCA GAACAACCAG 120 0 

CACCTGGGGA AACTGTGTGA GCTGCGGCTG 12 60 

ATCCTTCCGC TCCGCAACTG GCTCCTGCTC 1320 

CCTGTGTGTT TCAGCCCAGC CAATGTCCGA 1380 

GTTGCTGTTC CAAGCGTCCA TGTCCCTGAG 1440 

CCAGACACAC CCAGTTACCC TGACACCACA 1500 

CCTGTGGT^G ACTACACTGA TCTGACTACC 1560 

GGCATGACCC AGGCCCAGAG CGGGCTGGCC 1620 

CTGGCCTGCT CCCTGGCTGC CTGCGTCGGC 1680 

GTCCTGATGC AGATGAAGGC ACCCAATGAG 1740 

GGAATGATGG GACTGGAGGA CCTGGGAATT 180 0 

GGAGCTTTCC CGTGATTGCT CTTTCTGGCC 1860 

CTTGCCTGAT TCTCCCGTAG AGAAGCAGGT 192 0 

AGATCCAACT GGCCATGGCA AAAGCCCTGG 1980 

TCGAGAGGGC TCTTCCTCCA AATCCTCCCC 2040 

CGCAGGCCCC CTCCGGGCCT CTGTAGACTC 2100 

;\ATAGTTCTC CGCTGAGATA GCCCCTCTCG 2160 

TTTTGTTTCT CTTGTTTGTG CTATGGCTTG 2220 

CCCCTTGATT TTCTGCTCCT GAAGGCAGGG 2280 

CATTTAACTG GTTTCTTAAG AGCCGTCAAT 2340 

AGAAGGAAAA TCATGCCGCT CAGTTCCTGG 2400 

TGTGATTTTT ATCTGGAAAA GGAAGAAACA 2460 

AAGGATATTT CCAAACTGCA AACTTTGCTT 2 520 

CATGTAGAAT TTTGGACTTC TAAAAACATT 2580 

AGAAATCTGG TGCCTGGGGG TCCCTGTGTT 2 640 

TTGAAGCATG TGAAGTGTAC STGCAGAAAA 2 700 

ATTTTCACAA ACTGAACATA CCTGTGTAAT 2760 

AAATATCCCC CATCCTGGGC TTTTCCCAGA 2820 

CTGGGACCTG CCCCCCATGA GCCAGGACGG 2880 

CGTGGCCAGG GGTGGAGGAG AATATGTGGG 2940 

CAGGAGATTT TATTATATCT GGAGACCCTG 3 000 

GGCCAGGTCA GAAGCATCCT GACTGCAGAG 3060 

GCAAGTTGTC TGCGGCTCAT CGGAGGCCCC 3120 

GCCTGTATCT GTTTTTAATT TTCATTCTTC 3180 

AGATCCTTTA ATTGAAAACG AAGTGTAACG 3240 

TCTCCATCAA CATCACAGTC AGCTGGCAGC 33 00 

ACACGGGGGT ACACCGATGG GTCACACTGG 3360 

TGTGGTCTGG TTAGGAGTTG AGTTGTTTGC 3420 

TCACACGAAT ACCTGCCTTC TCTGGCTTTC 3480 

GAAGTTAGGC TCATGGCAAC GTGTGTCTTT 3540 

ATGGAGAATT TCAGGTCTCC ACGTCTGCCC 3600 

TCTGGAAATC CACGACCAAT CCCGATCGGC 3 660 

TGTGCTTTGG AAATCCACCA CCAATCCCGA 3 720 

ACACCTGTGA TCTGGAAATC TACCACCAAT 3780 

ACAAGACACC TGTGACATCC TCCAGGGCCA 3 840 

CAGTTCCTGC ACAAAAAGTG TCCAGAGGGC 3900 

TTTCACCCTC TGTCCCAGGG AATCTAGGAG 3 960 

ATCCCCCCAG GGTCTCCAAG GCATTTCCAC 4020 

AAGGCTTGCC AGAGCCAACA GGAAGTGAGC 4 0 80 

GATGGTGGCC TGCTGTGCCT GGTGCCAACA 4140 

GGAAGCATGG GTTTGCCCAC AGACCTGTCG 4200 

TTGTGCATAG GCACAAGTGG GCCAGGGCTG 4260 

GGCCCTGCCA ATCTTAACCC AGAACCCTTA 4320 

GCACCTTCCT CTCCAGCCAG AGGCTGACCT 4380 

AGGAGCACCC TAGGTGAGGG GTGAGGGCCC 4440 

TCCCATCAGA GTGGTTGGAT GGAGCCATTG 4500 

TCTCTGCACC ATGTTGTCTG GCTGAGGAGC 4560 

CAACAGGATG ATGCATTTQC TCAATTCTCA 4620 

AAGCTGGAGT GGGGTACAGA GTTCAGTTTT 4680 

ACGCCCATCT GGAGTGGGAG CTGGGAGTTA 4740 

AGAACCACTA TTTTTAAAAA GTGCTTACTG 4800 

GATTCTCTCT CTAGCCCTCA GCACCCCTGC 4860 

TGGGGTACAG AGGCACTTGC TCTTCTGCAT 4920 

CTCTTCAAAC TTTGTACAAG AGCTCATGGC 4980 

GAAATGGAAG CCATTCCCCT GTTGCTCTCC 5040 

ACCATATAGA TCAAAAGCTT TGTAACCACA 5100 

AATAGAATAG TTTGTCCCTC TCATGGGAAT 5160 

AGAATTTAGA GATACAAGAG TTCTACTTAG 5220 

TTCAGATGCT GATGTAGAGC TATTGGG7WA 5280 

GCAGACAGCT AGAGATAACT CGGGACCCAG 5340 

GCTTTTAGCC ATAAACCACT CAAAGATTCA 5400 

TGCGTTGTTG GGAAGCCAGC AGTGGCCTTG 5460 

GGTGGGCTGG CAAGCCACTT CCGGGGAAAA 5520 

AAGGAGAGAT TGTTCTCACC AACCCGCTGC 5580 

TTGCCTTGCT TAGAGAATTA CTGCAAATCA 5640 

TTCTAGGCCC TCAGGGTTTT GTAGAGTGTG 5700 

CTTCTGCTGG ATGCTGCTTG TAATCCATTT 5760 
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GGTGTACAGA ATCAACAATA AATAATATAC ATGTAT 



PCT/US02/12476 



Seq ID NO: 611 Protein sequence 
^ Protein Accession #: BAB84S87.1 

1 11 21 31 41 51 

I I I 1 I I 

MPIiKHYLLIiL VGCQAWGAGL AYHGCPSECT CSRASQVECT GTUIIVAVPTP LPWNAMSLQI 60 

^ - LNTHITELNE SPFLNISALI ALRIEKNELS RITPGAFRNL GSLRYLSLAN NKLQVLPIGL 120 

10 PQGLDSLESL LLSSNQLLQI QP7\HFSQCSN LKELQLHGNH LEYIPDGAFD HLVGLTKLNL 180 

GKNSLTHISP RVFQHLGNLQ VLRLYENRLT DIPMGTFDGL VNLQELALQQ NQIGLLSPGL 240 

FHNNHNLQRL YLSNNHISQL PPSIFMQLPQ LNRLTLFGNS LKELSLGIFG PMPNLRELWL 300 

YDNHISSLPD NVFSNLRQLQ VLILSRNQIS FISPGAFNGL TELRELSLHT NALQDLDGNV 360 

FRMLANLQNI SLQMNRLRQL PGMXFANVNG LMAXQLQNNQ LENLPLGIFD HLGKLCELRL 420 

15 YDNPWRCDSD ILPLRNWIiIiL NQPRLGTDTV PVGFSPANVR GQSLIIINVN VAVPSVHVPE 480 

VPSYPETPWY PDTPSYPDTT SVSSTTELTS PVEDYTDIiTT IQVTDDRSVW GMTQAQSGLA 540 
lAAIVIGIVA LACSLAACVG CCCCKKRSQA VLMQMKAPNE C 

Seq ID NO: 612 DNA sequence 
20 Nucleic Acid Accession #: XM_098151 
Coding sequence: 1..447 

1 11 21 31 41 51 

I I I I 1 1 

ZD ATGATGCATT TGCTCAATTC TCAGGGCTGG AATGAGCCGG CTGGTCCCCC AGAAAGCTGG 60 

AGTGGGGTAC AGAGTTCAGT TTTCCTCTCT GTTTACAGCT CCTTGACAGT CCCACGCCCA 120 

TCTGGAGTGG GAGCTGGGAG TCAGTGTTGG AGAAGAAACA ACAAAAGCCA ATTAGAACCA 180 

CTATTTTTAA AAAGTGCTTA CTGTGCACAG ATACTCTTCA AGCACTGGAC GTGGATTCTC 240 

TCTCTAGCCC TCAGCACCCC TGCGGTAGGA GTGCCGCCTC TACCCACTTG TGATGGGGTA 3 00 

30 CAGAGGCACT TGCTCTTCTG CATGGTGTTC AATAGGCTGG GAGTTTTATT TATCTCTTCA 360 

AACTTTGTAC AAGAGCTCAT GGCTTGTCTT GGGCTTTCGT CATTAT^CCA AAGGA7\ATGG 420 
AAGCCATTCC CCTGTTGCTC TCCTTAG 

Seq ID NO: 613 Protein sequence 
35 Protein Accession #: XP_098151 

1 11 21 31 41 51 

I I I I I I 

MMHLLNSQGW NEPA6PPBSW SGVQSSVFIiS VYSSLTVPRP SGVGAGSQCW RRNNKSQLEP 60 

40 LFLKSAYCAQ ILFKHWTWIL SLALSTPAVG VPPLPTCXIGV QRHLLFCMVF NRLGVLFISS 120 
NFVQELMACL GLSSLNQRKW KPFPCCSP 

Seq ID NO: 614 DNA sequence 
- Nucleic Acid Accession #: NM_002658.1 

45 Coding sequence: 77.. 1372 

1 11 21 31 41 51 

I I I I I I 

GTCCCCGCAG CGGCGTCGCG CCCTCCTGCC GCAGGCCACC GAGGCCGCCG CCGTCTAGCG 60 

50 CCCCGACCTC GCCACCATGA GAGCCCTGCT GGCGCGCCTG CTTCTCTGCG TCCTGGTCGT 120 

GAGCGACTCC AAAGGCAGCA ATGAACTTCA TCAAGTTCCA TCGTU^CTGTG ACTGTCTAAA 180 

TGGAGGAACA TGTGTGTCCA ACAAGTACTT CTCCAACATT CACTGGTGCA ACTGCCCAAA 240 

GAAATTCGGA GGGCAGCACT GTGAAATAGA TAAGTCAAAA ACCTGCTATG AGGGGAATGG 3 00 

TCACTTTTAC CGAGGAAAGG CCAGCACTGA CACCATGGGC CGGCCCTGCC TGCCCTGGAA 3 60 

55 CTCTGCCACT GTCCTTCAGC AAACGTACCA TGCCCACAGA TCTGATGCTC TTCAGCTGGG 420 

CCTGGGGAAA CATAATTACT GCAGGAACCC AGACAACCGG AGGCGACCCT GGTGCTATGT 480 

GCAGGTGGGC CTAAAGCCGC TTGTCCAAGA GTGCATGGTG CATGACTGCG CAGATGGAAA 540 

AAAGCCCTCC TCTCCTCCAG AAGAATTAAA ATTTCAGTGT GGCCAAAAGA CTCTGAGGCC 60 0 

CCGCTTTAAG ATTATTGGGG GAGAATTCAC CACCATCGAG AACCAGCCCT GGTTTGCGGC 660 

60 CATCTACAGG AGGCACCGGG GGGGCTCTGT CACCTACGTG TGTGGAGGCA GCCTCATCAG 72 0 

CCCTTGCTGG GTGATCAGCG CCACACACTG CTTCATTGAT TACCCAAAGA AGGAGGACTA 780 

CATCGTCTAC CTGGGTCGCT CAAGGCTTAA CTCCAACACG CAAGGGGAGA TGAAGTTTGA 840 

GGTGGAAAAC CTCATCCTAC ACAAGGACTA CAGCGCTGAC ACGCTTGCTC ACCACAACGA 900 

CATTGCCTTG CTGAAGATCC GTTCCAAGGA GGGCAGGTGT GCGCAGCCAT CCCGGACTAT 960 

65 ACAGACCATC TGCCTGCCCT CGATGTATAA CGATCCCCAG TTTGGCACAA GCTGTGAGAT 102 0 

CACTGGCTTT GGAAAAGAGA ATTCTACCGA CTATCTCTAT CCGGAGCAGC TGAAAATGAC 1080 

TGTTGT6AAG CTGATTTCCC ACCGGGAGTG TCAGCAGCCC CACTACTACG GCTCTGAAGT 1140 

CACCACCAAA ATGCTATGTG CTGCTGACCC CCAATGGAAA ACAGATTCCT GCCAGGGAGA 1200 

CTCAGGGGGA CCCCTCGTCT GTTCCCTCCA AGGCCGCATG ACTTTGACTG GAATTGTGAG 1260 

70 CTGGGGCCGT GGATGTGCCC TGAAGGACAA GCCAGGCGTC TACACGAGAG TCTCACACTT 1320 

CTTACCCTGG ATCCGCAGTC ACACCAAGGA AGAGAATGGC CTGGCCCTCT GAGGGTCCCC 1380 

AGGGAGGTU^ CGGGCACCAC CCGCTTTCTT GCTGGTTGTC ATTTTTGCAG TAGAGTCATC 1440 

TCCATCAGCT GTAAGAAGAG ACTGGGAAGA TAGGCTCTGC ACAGATGGAT TTGCCTGTGG 1500 

CACCACCAGG GTGAACGACA ATAGCTTTAC CCTCACGGAT AGGCCTGGGT GCTGGCTGCC 1560 

75 CAGACCCTCT GGCCAGGATG GAGGGGTGGT CCTGACTCAA CATGTTACTG ACCAGCAACT 1620 

TGTCTTTTTC TGGACTGAAG CCTGCAGGAG TTAAAAAGGG CAGGGCATCT CCTGTGCATG 1680 

GGCTCGAAGG GAGAGCCAGC TCCCCCGACC GGTGGGCATT TGTGAGGCCC ATGGTTGAGA 1740 

AATGAATAAT TTCCCAATTA GGAAGTGTAA GCAGCTGAGG TCTCTTGAGG GAGCTTAGCC 1800 

AATGTGGGA6 CAGCGGTTTG GGGAGCAGAG ACACTAACGA CTTCAGGGCA GGGCTCTGAT 1860 

80 ATTCCATGAA TGTATCAGGA J\ATATATATG TGTGTGTATG TTTGCACACT TGTTGTGTGG 1920 

GCTGTGAGTG TAAGTGTGAG TAAGAGCTGG TGTCTGATTG TTAAGTCTAA ATATTTCCTT 1980 

AAACTGTGTG GACTGTGATG CCACACAGAG TGGTCTTTCT GGAGAGGTTA TAGGTCACTC 2 04 0 

CTGGGGCCTC TTGGGTCCCC CACGTGACAG TGCCTGGGAA TGTACTTATT CTGCAGCATG 2100 

ACCTGTGACC AGCACTGTCT CAGTTTCACT TTCACATAGA TGTCCCTTTC TTGGCCAGTT 2160 

85 ATCCCTTCCT TTTAGCCTAG TTCATCCAAT CCTCACTGGG TGGGGTGAGG ACCACTCCTT 2220 

ACACTGAATA TTTATATTTC ACTATTTTTA TTTATATTTT TGTAATTTTA AATAAAAGTG 2280 
ATCAATAAAA TGTGATTTTT CTGA 
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Seq ID NO: 615 Protein sequence 
Protein Accession #: NP_002649.1 



1 11 21 31 41 51 

I 1 1 1 1 I 

MRALLARLLIi CVLWSDSKG SNELHQVPSN CDCLNGGTCV SNKYFSNIHW CNCPKKFGGQ 60 

HCEIDKSKTC YEGNGHFYRG KASTDTMGRP CLPWNSATVL QQTYHAHRSD ALQLGLGKHN 120 

YCRNPDNRRR PWCYVQVGLK PLVQECMVHD CADGKKPSSP PEELKFQCGQ KTLRPRFKII 180 

GGEFTTIENQ PWFAAIYRRH RGGSVTYVCG GShlSPCWVl SATHCFIDYP KKEDYIVYLG 240 

RSRLNSNTQG EMKFEVENLI LHKDYSADTL AHHNDIALLK IRSKEGRCAQ PSRTIQTICL 300 

PSMYNDPQFG TSCEITGFGK ENSTDYLYPE QLKMTWKLI SHRECQQPHY YGSEVTTKML 360 

CAADPQWKTD SCQGDSGGPL VCSLQGRMTL TGIVSWGRGC ALKDKPGVYT RVSHFLPWIR 420 
SHTKEENGIiA L 



Seq ID NO: 616 DNA sequence 

Nucleic Acid Accession NM_024422.1 

Coding sequence: 202.. 2907 

1 11 21 31 41 51 

I I I I I I 

CGCCAAAGGA TUW^GCCCCTT GGATGAGAGG CAGGCGCTTC AGAGAAGCTA AGAAAAGCAC 60 

CTCTCCGCGC GCCCCACCTC CTCCGCCTCG CGCTCCTCCT GAGCAGCGGG CCCAGACTGC 12 0 

GCTCCGGCCG CGGCCCTCGC CCCGCGGAGC CCTCCTACCC CGGCCCGACG CTCGGCCCGC 180 

GACCTGCCCC GAGCCCTCTC CATGGAGGCA GCCCGCCCCT CCGGCTCCTG GAACGGAGCC 240 

CTCTGCCGGC TGCTCCTGCT GACCCTCGCG ATCTTAATAT TTGCCAGTGA TGCCTGCAAA 300 

AATGTGACAT TACATGTTCC CTCCAAACTA GATGCCGAGA AACTTGTTGG TAGAGTTAAC 360 

CTGAAAGAGT GCTTTACAGC TGCAAATCTA ATTCATTCAA GTGATCCTGA CTTCCAAATT 420 

TTGGAGGATG GTTCAGTCTA TACAACAAAT ACTATTCTAT TGTCCTCGGA GAAGAGAAGT 480 

TTTACCATAT TACTTTCCAA CACTGAGAAC CAAGAAAAGA AGAAAATATT TGTCTTTTTG 540 

GAGCATCAAA CAAAGGTCCT AAAGAAAAGA CATACTAAAG AAAAAGTTCT AAGGCGCGCC 600 

AAGAGAAGAT GGGCTCCAAT TCCTTGTTCG ATGCTAGAAA ACTCCTTGGG TCCTTTTCCA 660 

CTTTTCCTTC AACAGGTTCA ATCTGACACG GCCCAAAACT ATACCATATA CTATTCCATA 720 

AGAGGTCCTG GAGTTQACCA AGAACCTCGG AATTTATTTT ATGTGGAGAG AGACACTGGA 780 

AACTTGTATT GTACTCGTCC TGTAGATCGT GAGCAGTATG AATCTTTTGA GATAATTGCC 840 

TTTGCAACAA CTCCAGATGG GTATACTCCA GAACTTCCAC TGCCCCTAAT AATCAAAATA 900 

GAGGATGAAA ATGATAACTA CCCAATTTTT ACAGAAGAAA CTTATACTTT TACAJ^TTTTT 960 

GAAAATTGCA GAGTGGGCAC TACTGTGGGA CAAGTGTGTG CTACTGACAA AGATGAGCCT 1020 

GACACGATGC ACACACGCCT GT^AGTACTCC ATCATTGGGC AG6TGCCACC ATCACCCACC 1080 

CTATTTTCTA TGCATCCAAC TACAGGCGTG ATCACCACAA CATCATCTCA GCTAGACAGA 1140 

GAGTTAATTG ACAAGTACCA GTTGAAAATA AAAGTACAAG ACATGGATGG TCAGTATTTT 1200 

GGTCTACAGA CAACTTCAAC TTGTATCATT AACATTGATG ATGTAAATGA CCACTTGCCA 1260 

ACATTTACTC GTACTTCTTA TGTGACATCA GTGGAAGAAA ATACAGTTGA TGTGGAAATC 1320 

TTACGAGTTA CTGTTGAGGA TAAGGACTTA GTGAATACTG CTAACTGGAG AGCTAATTAT 1380 

ACCATTTTAA AGGGCAATGA AAATGGCAAT TTTAAAATTG TAACAGATGC CAAAACCAAT 1440 

GAAGGAGTTC TTTGTGTAGT TAAGCCTTTG AATTATGAAG AAAAGCAACA GATGATCTTG 1500 

CAAATTGGTG TAGTTAATGA AGCTCCATTT TCCAGAGAGG CTAGTCCAAG ATCAGCCATG 1560 

AGCACAGCAA CAGTTACTGT TAATGTAGAA GATCAGGATG AGGGCCCTGA GTGTAACCCT 162 0 

CCAATACAGA CTGTTCGCAT GAAAGAAAAT GCAGAAGTGG GAACAACAAG CAATGGATAT 1680 

AAAGCATATG ACCCAGAJkAC AAGAAGTAGC AGTGGCATAA GGTATAAGAA ATTAACTGAT 1740 

CCAACAGGGT GGGTCACCAT TGATGAAAAT ACAGGATCAA TCAAAGTTTT CAGAAGCCTG 1800 

GATAGAGAGG CAGAGACCAT CAAAAATGGC ATATAT7VATA TTACAGTCCT TGCATCAGAC 1860 

CAAGGAGGGA GAACATGTAC GGGGACACTG GGCATTATAC TTCAAGACGT GAAXGATAAC 192 0 

AGCCCATTCA TACCTAAAAA GACAGTGATC ATCTGCAAAC CCACCATGTC ATCTGCGGAG 1980 

ATTGTTGCGG TTGATCCTGA TGAGCCTATC CATGGCCCAC CCTTTGACTT TAGTCTGGAG 2040 

AGTTCTACTT CAGAAGTACA GAGAATGTGG AGACTGAAAG CAATTAATGA TACAGCAGCA 2100 

CGTCTTTCCT ATCAGAATGA TCCTCCATTT GGCTCATATG TAGTACCTAT AACAGTGAGA 2160 

GATAGACTTG GCATGTCTAG TGTCACTTCA TTGGATGTTA CACTGTGTGA CTGCATTACC 2220 

GAAAATGACT GCACACATCG TGTAGATCCA AGGATTGGCG GTGGAGGAGT ACAACTTGGA 2280 

AAGTGGGCCA TCCTTGCAAT ATTGTTGGGC ATAGCATTGC TCTTTTGCAT CCTGTTTACG 2340 

CTGGTCTGTG GGGCTTCTGG GACGTCTAAA CAACCAAAAG TAATTCCTGA TGATTTAGCC 2400 

CAGCAGAACC TAATTGTATC AAACACAGAA GCTCCTGGAG ATGACAAAGT GTATTCTGCG 246 0 

AATGGCTTCA CAACCCAAAC TGTGGGCGCT TCTGCTCAGG GAGTTTGTGG CACCGTGGGA 2 52 0 

TCAGGAATCA AAAACGGAGG TCAGGAGACC ATCGAAATGG TGAAAGGAGG ACACCAGACC 2580 

TCGGAATCCT GCCGGGGGGC TGGGCACCAT CACACCCTGG ACTCCTGCAG GGGAGGACAC 2 640 

ACGGAGGTGG ACAACTGCAG ATACACTTAC TCGGAGTGGC ACAGTTTTAC TCAGCCCCGT 2700 

CTTGGTGAAA AAGT6TATCT GTGTAATCAA GATGAAAATC ACAAGCATGC CCAAGACTAT 2760 

GTCCTGACAT ATAACTATGA AGGAAGAGGA TCGGTGGCTG GGTCTGTAGG TTGTTGCAGT 2820 

GAACGACAAG AAGAAGATGG GCTTGAATTT TTGGATAATT TGGAGCCCAA ATTTAGGACA 2880 

CTAGCAGAAG CATGCATGAA GAGATGAGTG TGTTCTAATA AGTCTCTGAA AGCCAGTGGC 2 940 

TTTATGACTT TTAAAAAAAA TTACAAACCA AGAATTTTTT AAAGCAGAAG ATGCTATTTG 3000 

TGGGGGTTTT TCTCTCATTA TTTGGATGGA ATCTCTTTGG TCAAATGCAC ATTTACAGAG 3060 

AGACACTATA AACAAGTACA CAAATTTTTC AATTTTTACA TATTTTTAAA TTACTTATCT 3120 

TCTATCCAAG GAGGTCTACA GAGAAATTAA AGTCTGCCTT ATTTGTTACA TTTGGGTATA 3180 

ATGACAACAG CCAATTTATA GTGCAATAAA ATGTAATTAA TTCAAGTCCT TATTATAGAC 3240 

TATTTGAAGC ACAACCTAAT GGAAAATTGT AGAGACCTTG CTTTAACATT ATCTCCAGTT 3 3 00 

AATTAAGTGT TCATGTGGTG CTTGGA/^CT GTTGTTTTCC TGAACATCTA AAGTGTGTAG 3 3 60 

ACTGCATTCT TGCTATTATT TTATTCTTGT AATGTGACCT TTTCACTGTG CAAAGGGAGA 3420 
TTTCTAGCCA GGCATTGACT ATTACAATTT CATT 



Seq ID NO: 617 Protein sequence 
Protein Accession #: NP_077740.1 



1 11 21 31 41 51 

i 1 1 I t I 

MEAARPSGSW NGTUiCRLLLIi TLAILIFASD ACKNVTmVP SKLDAEKLVG RVNLKECFTA 60 
ANLIHSSDPD FQILEDGSVY TTNTILLSSE KRSFTILLSN TENQEKKKIF VFLEHQTKVL 120 
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KKRHTKEKVL RRAKRRWAPI PCSMIiENSLG PFPIiFLQQVQ SDTAQNYTIY YSIRGPGVDQ 180 

EPRNLFYVER DTGNLYCTRP VDREQYESFE IIAFATTPDG YTPELPLPLI IKIEDENDNY 240 

PIFTEETYTF TIFENCRVGT TVGQVCATDK DEPDTMHTRL KYSIIGQVPP SPTLFSMHPT 300 

TGVITTTSSQ LDRELIDKYQ LKIKVQDMDG QYFGLQTTST CIINIDDVND HIiPTFTRTSY 360 

5 VTSVEENTVD VEILRVTVED KDLVNTANWR ANYTILKGNE NGNFKIVTDA KTNEGVLCW 42 0 

KPLNYEEKQQ MILQIGWNE APFSREASPR SAMSTATVTV NVEDQDEGPE CNPPIQTVRM 480 

KENAEVGTTS NGYKAYDPET RSSSGIRYKK LTDPTGWVTI DENTGSIKVF RSIjDRBAETI 540 

KNGIYNITVL ASDQGGRTCT GTLGIILQDV NDNSPFIPKK TVIICKPTMS SAEIVAVDPD 600 

BPIH6PPFDF SIiESSTSEVQ RMWRLKAIND TAARLSYQND PPFGSYWPI TVRDRLGMSS 660 

10 VTSLDVTLCD CITENDCTHR VDPRIGGGGV QLGKWAILAI LLGIALLFCI LFTLVCGASG 72 0 

TSKQPKVIPD DLAQQNLIVS NTEAPGDDKV YSANGFTTQT VGASAQGVCG TVGSGIKNGG 7 80 

QETIEMVKGG HQTSESCRGA GHHHTLDSCR GGHTEVDNCR YTYSEWHSFT QPRLGEKVYL 840 

CNQDENHKHA QDYVLTYNYE GRGSVAGSVG CCSERQEEDG LEFLDNLEPK FRTLAEACMK 900 



Seq ID NO: 618 DNA sequence 

Nucleic Acid Accession #: NM_004949.1 

Coding sequence: 202.. 2745 

20 1 11 21 31 41 51 

1 - I I 1 I 1 

CGCCAAAGGA AAAGCCCCTT GGATGAGAGG CAGGCGCTTC AGAGAAGCTA AGAAAAGCAC 60 

CTCTCCGCGC GCCCCACCTC CTCCGCCTCG CGCTCCTCCT GAGCAGCGGG CCCAGACTGC 120 

GCTCCGGCCG CGGCCCTCGC CCCGCGGAGC CCTCCTACCC CGGCCCGACG CTCGGCCCGC 180 

25 GACCTGCCCC GAGCCCTCTC CATGGAGGCA GCCCGCCCCT CCGGCTCCTG GAACGGAGCC 240 

CTCTGCCGGC TGCTCCTGCT GACCCTCGCG ATCTTAATAT TTGCCAGTGA TGCCTGCAAA 300 

AATGTGACAT TACATGTTCC CTCCAAACTA GATGCCGAGA AACTTGTTGG TAGAGTTAAC 360 

CTGAAAGAGT GCTTTACAGC TGC7VAATCTA ATTCATTCAA GTGATCCTGA CTTCCAAATT 42 0 

TTGGAGGATG GTTCAGTCTA TACAACAAAT ACTATTCTAT TGTCCTCGGA GAAGAGAAGT 480 

30 TTTACCATAT TACTTTCCAA CACTGAGAAC CAAGAAAAGA AGAAAATATT TGTCTTTTTG 540 

GAGCATCAAA CAAAGGTCCT AAAGAAAAGA CATACTAAAG AAAAAGTTCT AAGGCGCGCC 600 

TU^GAGAAGAT GGGCTCCAAT TCCTTGTTCG ATGCTA6AAA ACTCCTTGGG TCCTTTTCCA 660 

CTTTTCCTTC AACAGGTTCA ATCTGACACG GCCCAAAACT ATACCATATA CTATTCCATA 720 

AGAGGTCCTG GAGTTGACCA AGAACCTCGG AATTTATTTT ATGTGGAGAG AGACACTGGA 780 

35 AACTTGTATT GTACTCGTCC TGTAGATCGT GAGCAGTATG AATCTTTTGA GATAATTGCC 840 

TTTGCAACAA CTCCAGATGG GTATACTCCA GAACTTCCAC TGCCCCTAAT AATCAAAATA 900 

GAGGATGAAA ATGATAACTA CCCAATTTTT ACAGAAGAAA CTTATACTTT TACAATTTTT 960 

GAA7VATTGCA GAGTGGGCAC TACTGTGGGA CAAGTGTGTG CTACTGACAA AGATGAGCCT 1020 

GACACGATGC ACACACGCCT GAAGTACTCC ATCATTGGGC AGGTGCCACC ATCACCCACC 1080 

40 CTATTTTCTA TGCATCCAAC TACAGGCGTG ATCACCACAA CATCATCTCA GCTAGACAGA 1140 

GAGTTAATTG ACAAGTACCA GTTGAAAATA AAAGTACAAG ACATGGATGG TCAGTATTTT 12 00 

GGTCTACAGA CAACTTCAAC TTGTATCATT AACATTGATG ATGTAAATGA CCACTTGCCA 1260 

ACATTTACTC GTACTTCTTA TGTGACATCA GTGGAAGAAA ATACAGTTGA TGTGGAAATC 1320 

TTACGAGTTA CTGTTGAGGA TAAGGACTTA GTGAATACTG CTAACTGGAG AGCTAATTAT 1380 

45 ACCATTTTAA AGGGCAATGA AAATGGCAAT TTTAAAATTG TAACAGATGC CAAAACCAAT 1440 

GAAGGAGTTC TTTGTGTAGT TAAGCCTTTG AATTATGAAG AAAAGCAACA GATGATCTTG 1500 

CAAATTGGTG TAGTTAATGA AGCTCCATTT TCCAGAGAGG CTAGTCCAAG ATCAGCCATG 1560 

AGCACAGCAA CAGTTACTGT TAATGTAGAA GATCAGGATG AGGGCCCTGA GTGTAACCCT 1620 

CCAATACAGA CTGTTCGCAT G/^GAAAAT GCAGAAGTGG GAACAAC7\AG CAATGGATAT 1680 

50 AAAGCATATG ACCCA6AAAC AAGAAGTAGC AGTGGCATAA GGTATAAGAA ATTAACTGAT 1740 

CCAACAGGGT GGGTCACCAT TGATGAAAAT ACAGGATCAA TCAAAGTTTT CAGAAGCCTG 1800 

GATAGAGAGG CAGAGACCAT CAAAAATGGC ATATATAATA TTACAGTCCT TGCATCAGAC 1860 

CAAGGAGGGA GAACATGTAC GGGGACACTG GGCATTATAC TTCAAGACGT GAATGATAAC 1920 

AGCCCATTCA TACCTAAAAA GACAGTGATC ATCTGCAAAC CCACCATGTC ATCTGCGGAG 1980 

55 ATTGTTGCGG TTGATCCTGA TGAGCCTATC CATGGCCCAC CCTTTGACTT TAGTCTGGAG 2040 

AGTTCTACTT CAGAAGTACA GAGAATGTGG AGACTGAAAG CAATTAATGA TACAGCAGCA 2100 

CGTCTTTCCT ATCAGAATGA TCCTCCATTT GGCTCATATG TAGTACCTAT AACAGTGAGA 2160 

GATAGACTTG GCATGTCTAG TGTCACTTCA TTGGATGTTA CACTGTGTGA CTGCATTACC 222 0 

^ GAAAATGACT GCACACATCG TGTAGATCCA AGGATTGGCG GTGGAGGAGT ACAACTTGGA 2280 

60 AAGTGGGCCA TCCTTGCAAT ATTGTTGGGC ATAGCATTGC TCTTTTGCAT CCTGTTTACG 2340 

CTGGTCTGTG GGGCTTCTGG 6ACGTCTAAA CAACCAAAAG TAATTCCTGA TGATTTAGCC 2400 

CAGCAGAACC TAATTGTATC AAACACAGAA GCTCCTGGAG ATGACAAAGT GTATTCTGCG 2460 

AATGGCTTCA CT^CCCAAAC TGTGGGCGCT TCTGCTCAGG GAGTTTGTGG CACCGTGGGA 2520 

TCAGGAATCA AAAACGGAGG TCAGGAGACC ATCGAAATGG TGAAAGGAGG ACACCAGACC 2580 

65 TCGGAATCCT GCCGGGGGGC TGGCCACCAT CACAGCCTGG ACTCCTGCAG GGGAGGACAC 2 640 

ACGGAGGTGG ACAACTGCAG ATACACTTAC TCGGAGTGGC ACAGTTTTAC TCAGCCCCGT 2700 

CTTGGTGAAG AATCCATTAG AGGACACACT CTGATTAAAA ATTAAACAAT GAAAGAAAGT 2760 

GTATCTGTGT AATCAAGATG AAAATCACAA GCATGCCCAA GACTATGTCC TGACATATAA 2820 

CTATGAAGGA AGAGGATCGG TGGCTGGGTC TGTAGGTTGT TGCAGTGAAC GACAAGAAGA 2880 

70 AGATGGGCTT GAATTTTTGG ATAATTTGGA GCCCAAATTT AGGACACTAG CAGAAGCATG 2940 

CATGAAGAGA TGAGTGTGTT CTAATAAGTC TCTGAAAGCC AGTGGCTTTA TGACTTTTAA 3000 

AAAAAATTAC AAACCAAGAA TTTTTTAAAG CAGAAGATGC TATTTGTGGG GGTTTTTCTC 3060 

TCATTATTTG GATGGAATCT CTTTGGTCAA ATGCACATTT ACAGAGAGAC ACTATAAACA 3120 

AGTACACAAA TTTTTCAATT TTTACATATT TTTAAATTAC TTATCTTCTA TCCAAGGAGG 3180 

75 TCTACAGAGA T^TTAAAGTC TGCCTTATTT GTTACATTTG GGTAT7VATGA C7VACAGCCAA 3240 

TTTATAGTGC AATAAAATGT AATTAATTCA AGTCCTTATT ATAGACTATT TGAAGCACAA 3300 

CCTAATGGAA AATTGTAGAG ACCTTGCTTT AACATTATCT CCAGTTAATT AAGTGTTCAT 3360 

GTGGTGCTTG GAAACTGTTG TTTTCCTGAA CATCTAAAGT GTGTAGACTG CATTCTTGCT 3420 

ATTATTTTAT TCTTGTAATG TGACCTTTTC ACTGTQCAAA GGGAGATTTC TAGCCAGGCA 3480 

80 TTGACTATTA CAATTTCATT 

Seq ID NO: 619 Protein sequence 
Protein Accession NP_004940.1 

85 1 11 21 31 41 51 

I I I I I I 

MEAARPSGSW NGALCRLLLIi TIiAIIiIFASD ACKNVTLHVP SXIJ3AEKLVG RVWLKECFTA 60 
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ANIiIHSSDPD FQILEDGSVY TTNTILLSSE KRSPTIIjLSN TENQEKKKIF VFLEHQTKVL 120 

KKRHTKEKVIi RRAKRRWAPI PCSMLENSLG PFPLFLQQVQ SDTAQNYTIY YSIRGPGVDQ 180 

EPRNLFYVER DTGNLYCTRP VDREQYESFE IIAPATTPDG YTPELPLPLI IKIEDENDNY 240 

PIFTEETYTF TIFENCRVGT TVGQVCATDK DEPDTMHTRL KYSIIGQVPP SPTLFSMHPT 300 

5 TGVITTTSSQ LDRELIDKYQ LKIKVQDMDG QYFGLQTTST CIINIDDVND HLPTFTRTSY 360 

VTSVEENTVD VEILRVTVED KDLVNTANWR ANYTILKGNE NGNFKIVTDA KTNEGVLCW 420 

KPLNYEEKQQ MILQIGWNE APFSREASPR SAMSTATVTV NVEDQDEGPE CNPPIQTVRM 480 

KENAEVGTTS NGYKAYDPET RSSSGIRYKK LTDPTGWVTI DENTGSIKVF RSLDREAETI 540 

KNGIYNITVL ASDQGGRTCT GTLGIIIiQDV NDNSPFIPKK TVIICKPTMS SAEIVAVDPD 600 

10 EPIHGPPFDF SLESSTSEVQ RMWRLKAIND TAARLSYQND PPFGSYWPI TVRDRLGMSS 6 60 

VTSLDVTLCD CITENDCTHR VDPRIGGGGV QLGKWAILAI LLGIALLFCI LFTLVCGASG 720 

TSKQPKVIPD DLAQQNLIVS NTEAPGDDKV YSANGFTTQT VGASAQGVCG TVGSGIKNGG 780 

QETIEMVKGG HQTSESCRGA GHHHTLDSCR GGHTEVDNCR YTYSEWHSFT QPRLGBESIR 840 
J ^ GHTIiIKM 

Seg ID NO: 620 DNA sequence 

Nucleic Acid Accession #: NM_^032S45.1 

Coding sequence: 46. ,718 

20 1 11 21 31 41 51 

I I I I I I 

AAACTGATCT TCAATGCACT AAGAGAAGGA GACTCTCAAA CCAAAAATGA CCTGGAGGCA 60 

CCATGTCAGG CTTCTGTTTA CGGTCAGTTT GGCATTACAG ATCATCAATT TGGGAAACAG 12 0 

CTATCAAAGA GAGAAACATA ACGGCGGTAG AGAGGAAGTC ACCAAGGTTG CCACTCAGAA 180 

25 GCACCGACAG TCACCGCTCA ACTGGACCTC CAGTCATTTC GGAGAGGTGA CTGGGAGCGC 240 

CGAGGGCTGG GGGCCX3GAGG AGCCGCTCCC CTACTCCCGG GCTTTCGGAG AGGGTGCGTC 300 

CGCGCGGCCG CGCTGCTGCA GGAACGGCGG TACCTGCGTG CTGGGCAGCT TCTGCGTGTG 360 

CCCGGCCCAC TTCACCGGCC GCTACTGCGA GCATGACCAG AGGCGCAGTG AATGCGGCGC 42 0 

CCTGGAGCAC GGAGCCTGGA CCCTCCGCGC CTGCCACCTC TGCAGGTGCA TCTTCGGGGC 480 

30 CCTGCACTGC CTCCCCCTCC AGACGCCTGA CCGCTGTGAC CCGAAAGACT TCCTGGCCTC 540 

CCACGCTCAC GGGCCGAGCG CCGGGGGCGC GCCCAGCCTG CTACTCTT6C T6CCCTGCGC 600 

ACTCCTGCAC CGCCTCCTGC GCCCGGATGC GCCCGCGCAC CCTCGGTCCC TGGTCCCTTC 660 

CGTCCTCCAG CGGGAGCGGC GCCCCTGCGG RAGGCCGGGA CTTGGGCATC GCCTTTAATT 720 

TTCTATGTTG TAAATAATAG ATGTGTTTAG TTTACCGTAA GCTGAAGCAC TGGGTGAATA 780 

35 TTTTTATTGG GTAATAAATA TTTTCATGAA AGCGCCAAAA AAA/VAAATUVA AAAAAAAAAA 840 

AAAAAA 

Seq ID NO: 621 Protein sequence 
Protein Accession #: NP 115934.1 

40 

1 11 21 31 41 

1 I I I I 

MTWRHHVRLL FTVSLALQII NLGNSYQREK HNGGREEVTK VATQKHRQSP 
VTGSAEGWGP EEPLPYSRAF GEGASARPRC CRNGGTCVLG SFCVCPAHFT 
45 SECGALEHGA WTLRACHLCR CIFGAIiHCLP LQTPDRCDPK DFLASHAHGP 

tiLPCALLHRL LRPDAPAHPR SIiVPSVLQRE RRPCGRPGLG HRL 

Seq ID NO; 622 DNA sequence 
Nucleic Acid Accession #: FGENESH predicted 
50 Coding sequence: 1. .390 



51 
I 

GGCTCCTGCT 60 

CGCTGGCTCA 120 

CCCCTTGGAG 180 

TGGTCCCCCC 240 

CACAAACGAT 300 

TCCCATCTCC 360 



Seq ID NO: 623 Protein sequence 
Protein Accession FGENESH predicted 

65 1 11 21 31 41 51 

1 I I I I I 

MRFSVSGMRT DYPRSVLAPA YVSVCLLLLC PREVIAPAGS EPWLCQPAPR CGDKIYNPLE 60 
QCCYNDAIVS LSETRQCGPP CTFWPCFELC CLDSFGLTND FWKLKVQGV NSQCHSSPIS 120 
SKCERGRIC 

70 

Seq ID NO: 624 DNA sequence 
Nucleic Acid Accession #: M18728.1 
Coding sequence: 51.. 1085 

75 1 11 21 31 41 51 

1 1 I t 1 1 

GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 60 

CCTCAGCCCC TCCCTGCAGA TTGCATGTCC CCTGGAAGGA GGTCCTGCTC ACAGCCTCAC 12 0 

TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 180 

80 ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG 240 

GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 300 

TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 360 

ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 420 

TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACCGGACA GTTCCATGTA TACCCGGAGC 480 

85 TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG 540 

CCTTCACCTG TGAACCTGAG GTTCAGAACA CJ\ACCTACCT GTGGTGGGTA AATGGTCAGA 600 

GCCTCCCGGT CAGTCCCAGG CTGCAGCT6T CCAATGGCAA CATGACCCTC ACTCTACTCA 660 



51 
1 

liNWTSSHFGE 60 
6RYCEHDQRR 120 
SA6GAPSLLL 180 



1 11 21 31 41 

I I I i I 

ATGAGGTTCA GTGTCTCAGG CATGAGGACC GACTACCCCA GGAGTGTGCT 

55 TATGTGTCAG TCTGTCTCCT CCTCTTGTGT CCAAGGGAAG TCATCGCTCC 

GAACCATGGC TGTGCCAGCC GGCACCCAGG TGTGGAGACA AGATCTACAA 

CAGTGCTGTT ACAATGACGC CATCGTGTCC CTGAGCGAGA CCCGCCAATG 

TGCACCTTCT GGCCCTGCTT TGAGCTCTGC TGTCTTGATT CCTTTGGCCT 
TTTGTTGTGA AGCTGAAGGT TCAGGGTGTG AATTCCCAGT GCCACTCATC 

60 AGTAAATGTG AAAGAGGCCG GATATGTTAG 
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GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAGAACCCA GCGAGTGCCA 720 

ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATGTCCCC ACCATTTCCC 780 

CCTCAAAGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA 840 

ACCCACCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 900 

TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 960 

CAGCCACTGG CCTCAATAGG ACCACAGTCA CGATGATCAC AGTCTCTGGA AGTGCTCCTG 1020 

TCCTCTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG GTGGCrCTGA 1080 

TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT .1140 

GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 1200 

CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA 12 60 

ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCAAACT 1320 

GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA 13 80 

AACAAGACTC CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 1440 

TGCCTCTTTC GCTTGGCAGG ATGATGCTGT CATTA6TATT TCACAAGAAG TAGCTTCAGA 1500 

GGGT7\ACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AACGTTTTAC ATAAAATAAG 1560 

AGATCCTTTA GTGCACCCAG TGACTGACAT TAGCAGCATC TTTAACACAG CCGTGTGTTC 162 0 

AAATGTACAG TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCTGTTCT CACTCCCTGT 1680 

TTTAATTCAA CCCAGCCATG CAATGCC/VAA TAATAGAATT GCTCCCTACC AGCTGAACAG 1740 

GGAGGAGTCT GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 1800 

GCTGAGACTA AGTTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACTCAT 1860 

CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 1920 

CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT GTAGTGTATT 1980 

CTCTAAAAGC TTTAAATGTC TGCATGCAGC CAGCCATCAA ATAGTGAATG GTCTCTCTTT 2 040 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCATCAGGAG AACATCATAA CCCATGAAGG 2100 

ATi^AAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT TTGGTCACAC 2160 

TCTCACCTAG GTGAGCGCAT TGAGCCAGTG GTGCTAAATG CTACATACTC CAACTGAAAT 2220 

GTTAAGGAAG AAGATAGATC CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 2280 

ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC TACTTTAACT 2340 

TTTACT^AAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT TCTGTGGTTC 2400 

TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC CCAGGGGGAG 2460 

CTATCACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT AAAAGCCAAT 2520 
TAGCTCTATA ACT 

Seq ID NO; 62 5 Protein sequence 
Protein Accession #: AAA59907.1 

1 11 21 31 41 51 

I I I I I I 

MGPPSAPPCR LHVPWKEVLL TASLLTFWNP PTTAKLTIES TPFNVAEGKE VLLLAHNLPQ 60 

NRIGYSWYKG ERVDGNSLIV GYVIGTQQAT PGPAYSGRET lYPNASLLIQ NVTQNDTGFY 120 

TLQVIKSDLV NEEATGQPHV YPELPKPSIS SNWSNPVEDK DAVAFTCEPE VQNTTYLWWV ISO 

NGQSLPVSPR LQLSNGNMTL TLLSVKRNDA GSYECEIQNP ASANRSDPVT LNVLYGPDVP 240 

TISPSKANYR PGENLNLSCH AASNPPAQYS WFINGTFQQS TQELFIPNIT VNNSGSYMCQ 300 
AHNSATGLNR TTVTMITVSG SAPVLSAVAT VGITIGVLAR VALI 

Seq ID NO: 626 DNA sequence 
Nucleic Acid Accession #: M18728,l 
Coding sequence: 1355.. 1657 

1 11 21 31 41 51 

I I I I I 1 

GGAGCTCAAG CTCCTCTACA J\AGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 60 

CCTCAGCCCC TCCCTGCAGA TTGCATGTCC CCTGGAAGGA GGTCCTGCTC ACAGCCTCAC 12 0 

TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 180 

ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG 240 

GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 300 

TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 360 

ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 42 0 

TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACCGGACA GTTCCATGTA TACCCGGAGC 480 

TGCCCAAGCC CTCCATCTCC AGCAAC7\ACT CCAACCCCGT GGAGGACAAG GATGCTGTGG 540 

CCTTCACCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 600 

GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 660 

GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAGT^CCCA GCGAGTGCCA 720 

ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATGTCCCC ACCATTTCCC 780 

CCTCAAAGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA 840 

ACCCACCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 900 

TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 960 

CAGCCACTGG CCTCAATAGG ACCACAGTCA CGATGATCAC AGTCTCTGGA AGTGCTCCTG 1020 

TCCTCTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG GTGGCTCTGA 1080 

TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT 1140 

GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 1200 

CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA 1260 

ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCAAACT 1320 

GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA 1380 

AACAAGACTC CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 1440 

TGCCTCTTTC GCTTGGCAGG ATGATGCTGT CATTAGTATT TCACAAGAAG TAGCTTCAGA 1500 

GGGTAACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AACGTTTTAC ATAAAATAAG 1560 

AGATCCTTTA GTGCACCCAG TGACTGACAT TAGCAGCATC TTTAACACAG CCGTGTGTTC 162 0 

AAATGTACAG TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCTGTTCT CACTCCCTGT 1680 

TTTAATTCAA CCCAGCCATG CAATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG 1740 

GGAGGAGTCT GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 1800 

GCTGAGACTA AGTTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACTCAT 1860 

CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 1920 

CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT GTAGTGTATT 1980 

CTCTAAAAGC TTTAAATGTC TGCATGCAGC CAGCCATCAA ATAGTGT^TG GTCTCTCTTT 2040 

GGCTGG/^TT ACAAT^CTCA GAGAAATGTG TCATCAGGAG AACATCATAA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT TTGGTCACAC 2160 
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TCTCACCTAG GTGAGCGCAT TGAGCCAGTG GTGCTAAATG CTACATACTC CAACTGAAAT 2220 

GTTAAGGAAG AAGATAGATC CAATTj\AAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 2280 

ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC TACTTTAACT 2340 

TTTACAAAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT TCTGTGGTTC 240 0 

TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC CCAGGGGGAG 2460 

CTATCACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT AAAAGCCAAT 2520 
TAGCTCTATA ACT 

Seg ID NO; 627 Protein sequence 
Protein Accession #: AAA59908.1 

X 11 21 31 41 51 

1 I I I I I 

MDSFSQDVKT RLLIMIRLLP PPNLSLLMPA SPAWQDDAVI SISQEVASEG NLTECQIYLV 60 
NPNVLHKIRD PLVHPVTDIS SIFNTAVCSN VQWSFSELDF 

Seq ID NO: 62 8 DNA sequence 
Nucleic Acid Accession #: M18728.1 
Coding sequences 2370,. 2501 

1 11 21 31 41 51 

I I I I I I 

GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 60 

CCTCAGCCCC TCCCTGCAGA TTGCATGTCC CCTGGT^GGA GGTCCTGCTC ACAGCCTCAC 120 

TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 180 

ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG 240 

GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 300 

TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 3 60 

ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 42 0 

TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACCGGACA GTTCCATGTA TACCCGGAGC 480 

TGCCCi\AGCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACT^G GATGCTGTGG 540 

CCTTCACCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 600 

GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 660 

GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAGAACCCA GCGAGTGCCA 720 

ACCGCAGTGA CCCAGTCACC CTGT^TGTCC TCTATGGCCC AGATGTCCCC ACCATTTCCC 780 

CCTCAAAGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA 840 

ACCCACCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 900 

TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 960 

CAGCCACTGG CCTCAATAGG ACCACAGTCA CGATGATCAC AGTCTCTGGA AGTGCtCCTG 1020 

TCCTCTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG GTGGCTCTGA 1080 

TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT 1140 

GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 1200 

CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA 1260 

ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCTUW^CT 1320 

GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA 1380 

AACAAGACTC CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 1440 

TGCCTCTTTC GCTTGGCAGG ATGATGCTGT CATTAGTATT TCACAAGAAG TAGCTTCAGA 1500 

GGGTAACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AACGTTTTAC ATAAAATAAG 1560 

AGATCCTTTA GTGCACCCAG TGACTGACAT TAGCAGCATC TTTAACACAG CCGTGTGTTC 1620 

AAATGTACAG TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCT6TTCT CACTCCCTGT 1680 

TTTAATTCAA CCCAGCCATG CAATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG 1740 

GGAGGAGTCT GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 180 0 

GCTGAGACTA AGTTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACTCAT 1860 

CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 1920 

CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT 6TAGTGTATT 1980 

CTCTAAJ\AGC TTTAAATGTC TGCATGCAGC CAGCCATCAA ATAGTGAATG GTCTCTCTTT 2040 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCATCAGGAG AACATCATAA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT TTGGTCACAC 2160 

TCTCACCTAG GTGAGCGCAT TGAGCCAGTG GTGCTAAATG CTACATACTC CAACTGAAAT 2220 

GTTAAGGAAG AAGATAGATC CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAATWVGA 2280 

ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC TACTTTAACT 2340 

TTTACAAAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT TCTGTGGTTC 2400 

TGTTTCCTTG TTCCAATTTG ACAATU^CCCA CTGTTCTTGT ATTGTATTGC CCAGGGC5GAG 2460 

CTATCACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT AA7VAGCCAAT 2520 
TAGCTCTATA ACT 

Seq ID NO; 629 Protein sequence 
Protein Accession #: A7^A59909.1 

1 11 21 31 41 51 

I I I I 1 I 

MLTNVFISW LFPCSNLTKP TVLVLYCPGG AITVLVEWCC PNS 



Seq ID NO: 630 DNA sequence 

Nucleic Acid Accession #: NM__01663 9.1 

Coding sequence: 40.. 42 9 

1 11 21 31 41 51 

I I I I I I 

GCGGCGGGCG CAGACAGCGG CGGGCGCAGG ACGTGCACTA TGGCTCGGGG CTCGCTGCGC 60 
CGGTTGCTGC GGCTCCTCGT GCTGGGGCTC TGGCTGGCGT TGCTGCGCTC CGTGGCCGGG 120 
GAGCAAGCGC CAGGCACCGC CCCCTGCTCC CGCGGCAGCT CCTGGAGCGC GGACCTGGAC 180 
AAGTGCATGG ACTGCGCGTC TTGCAGGGCG CGACCGCACA GCGACTTCTG CCTGGGCTGC 240 
GCTGCAGCAC CTCCTGCCCC CTTCCGGCTG CTTTGGCCCA TCCTTGGGGG CGCTCTQAGC 300 
CTGACCTTCG TGCTGGGGCT GCTTTCTGGC TTTTTGGTCT GGAGACGATG CCGCAGGAGA 360 
GAGAAGTTCA CCACCCCCAT AGAGGAGACC GGCGGAGAGG GCTGCCCAGC TGTGGCGCTG 420 



428 
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WO 02/086443 

ATCCAGTGAC AATGTGCCCC CTGCCAGCCG GGGCTCGCCC ACTCATCATT CATTCATCCA 480 

TTCTAGAGCC AGTCTCTGCC TCCCaSACGC GGCGGOAGCC AAQCTCCTCC AACCACAAGG 540 

GGGGTGGGGG GCGGTGAATC ACCTCTGAGG CCTGQGCCCA GGOTTCAGGG GAACCTTCCA 600 

AGGTGTCTGG TTGCCCTGCC TCTGGCTCCA GAACAGAAAG GGAGCCTCAC GCTGGCTCAC 660 

ACAAAACAGC TGACACTGAC TAAGGAACTG CAGCATTTGC ACAGGGGAGG GGGGTGCCCT 720 

CCTTCCTTAG GACCTGGGGG CCAGGCTGAC TTGGGGGGCA GACTTGACAC TAGGCCCCAC 780 

TCACTCAGAT GTCCTGAAAT TCCACCACGG GGGTCACCCT GGGGGOTTAG GGACCTATTT 840 

TTAACACTAG GGGCTGGCCC ACTAGGAGGG CTGGCCCTAA 6ATACAGACC CCCCCAACTC 900 

CCCAAAGCGG GGAGGAQATA TTTATTTTGO GGAQAGTTTG GAGGGGAGGG AGAATTTATT 960 
AATAAAAGAA TCTTTAACTT TAAAAAAAAA AAAAAAAA 

Seq ID NO: 631 Protein sequence 
Protein Accession #: NP_0S7723.1 

1 11 21 31 41 51 

I I I I I I 

MARGSLRRLL RLLVLGLWLA LLRSVAGEQA PGTAPCSRGS SWSADLDKCM DCASCRARPH 
SDFCLGCAAA PPAPFRLLWP ILGGALSLTF VLGLLSGFLV WRRCRRRBKF TTPIEETGGE 
GCPAVALIQ 

Seq ID NOs 632 DNA sequence 
Nucleic Acid Accession #: NM_003816,1 
Coding sequence*. 79.. 2538 



PCT/US02/12476 



CGGCAGGGTT 
CCTGCGGAAT 
CGGTGGTTGC 
CAACAGACCT 
AGAAGAGAAG 
AAAGAGCATA 
TATACTTACA 
CATTATCGGG 
GGACTCAGAG 
AGCTCTCATT 
TGTGGAGTTT 
CCCAGCATGA 
GAGCTGTTCA 
GTGAGAGAAG 
ATTCGAATTG 
GGGGGTGCTG 
CGTCGGAGAC 
ATGGCATTTG 
CAAATCACTG 
ATGAATCACG 
GGAGCATCGG 
TTAAATAAAG 
CCCTCCTGTG 
GAATGTGAAT 
TGTGCATATG 
GGAAAAACCA 
CCAGATGTTT 
GGCATGTGCC 
GCCCCCAAAG 
TTCTCTGGCA 
TGTGAGAATG 
AGTCGAGGCA 
GGGATGGTTA 
GTAGATGCTT 
GTATGTAATA 
ACTAAAGGAT 
TTGAGGGACG 
TTTATCTTCA 
ACATA1GAGT 
CGACATGTTT 
GTACCAACCT 
CCGAAAGTAT 
TATAGTTCCC 
CTAATACTTT 
GAAAACAAAA 
AGTTGTGAAA 
CATCATTGAA 
TGAACATGTT 
AGTGTTTAAG 
AACATGTGAT 
TTTTTCATCA 
CATGAATAAG 
TTATTTTGAA 
TCCATTTTTA 
GAATTTCTAT 
TAAATTATAA 
GGCTATAATA 
CTTGAGAATT 
AGAATGTTTA 
CATAGAAATT 
TTACTGTGGT 



11 
1 

GGAAAATGAT 
CGGCCGAGAT 
TGTTGCTTGG 
CACATCTTTC 
CCCCTAGGCC 
TTATTCACTT 
ACAA6GAAGG 
GCTATGTGGA 
GATTGCTGCA 
TTGAGCACAT 
CCAACAAGGA 
CTCAGCTACT 
TTGTCGTAGA 
AGATGATTCT 
TGCTAGTTGG 
GTGATGTGCT 
ATGACAGTGC 
TGGGAACAGT 
TGGAGACATT 
ATGATGGGAG 
GTTCCAGAAA 
GAGGAAACTG 
GTAATAAGTT 
TGGACCCTTG 
GTGACTGTTG 
GTGAGTGTGA 
TTATTCAGAA 
AGTATTATGA 
ATTGTTTCAT 
ATGAATACAA 
TACAAGAGAT 
CCAAATGTTG 
ACGAAGGCAC 
CTGTTCTGAA 
GCAATAAGAA 
ACGGA66AAG 
GACTTCTGGT 
TCAAGAGGGA 
CAGATGGCAA 
CTCCAGTGAC 
ATGCAGCCAA 
CATCTCAGGG 
TCACTTGATT 

CACCACAAAA 
TACAAGGAAA 
TAAGTCTTAT 
ATTGCAGTGA 
TGTTATTCT6 
AATCTAATAC 
TGCACGAATT 
CAAATATTGT 
AGTACAAAAT 
TGACCTTTCA 
TATGAATCAT 
GCTTTAAGGT 
AAGCAGGAGC 
TCATGAGCAC 
CATTTACTAA 
AGGCTGGAGA 
ATCTATGAGT 



21 
1 

GGTUVGAGGCXS 
GGGGTCTGGC 
CCTGGTGGGC 
TTCTTATGAA 
CTATTCAAAA 
GGAAAGGAAC 
GACTTTAATC 
GGGAGTTCAT 
TTTAGAGAAT 
CATTTATCGA 
TATAGAGAAA 
TCGAAGAAGA 
CAAGGAAAGG 
CCTGGCAAAC 
ACTGGAGATT 
GGGGAACTTC 
ACAGCTAGTT 
GTGTTCAAGG 
TGCTTCCATT 
AGATTGTTCC 
CTTTAGCAGT 
CCTTCTTAAT 
GGTGGACGCT 
CT6CGAAGGA 
TAAAGACTGT 
TGTTCCAGAG 
TGGATATCCT 
TGCTCAATGT 
TGAAGTGAAT 
GAAGTGTGCC 
ACCTGTATTT 
GGGTGTGGAT 
AAAATGTGGT 
TTATGACTGT 
TTGTCACTGT 
TGTGGACAGT 
CTTCTTCTTC 
TCAACTGTGG 
AAATCAAGCA 
ACCTCCCAGA 
GCAACCTCAG 
AAACTTAATT 
TTTTTAACCT 
TGATGTTTTC 
CAGACTTCAC 
TGCAGTAAAG 
TCAGTCATCG 
TTCTCAAATT 
J\ATTTTCTAC 
CTGTGAAAAC 
AATAATCATC 
CTTCJ\AAAGA 
ATACTAAAAG 
ACTATAGGTA 
GTGAAAGCAT 
ACGTUi^GTATT 
AATTATAAAA 
TTTAAAATCT 
GGTGTGCTGG 
AAGAAGGAAG 
TATCATCTTA 



31 
I 

GAGGTGGAG6 
GCGCGCTTTC 
CCAGTCCTCG 
ATTATAACTC 
C/^GTATCTT 
AAAGACCTTT 
ACTGACCATC 
AATTCATCCA 
GCGAGTTATG 
ATGGATGATG 
GAAACTGCAA 
AGAGCTGTCT 
TATGACATGA 
TACTTGGATA 
TGGACCAATG 
GTGCAGTGGC 
CTAAAGAAAG 
AGCCACGCAG 
GTTGCTCATG 
TGTGGAGCAA 
TGCAGTGCAG 
ATTCCAAAGC 
GGGGAAGAGT 
AGTACCTGTA 
CGGTTCCTTC 
TACTGCAATG 
TGCCAGAATA 
CAAGTCATCT 
TCTAAAGGTG 
ACTGGGAATG 
GGAATTGTGC 
TTCCAGCTAG 
GCTGGAAAGA 
GATGTTCAGA 
GAAAATGGCT 
GGACCTACAT 
CTAATTGTTC 
AGAAGCTACT 
AACCCTTCTA 
GAAGTTCCTA 
CAGTTCCCAT 
CCTGCCCGTC 
TCTTTTTGCA 
TTG7UVAAGCC 
TAACACAGAA 
CCAGGGAATT 
GTGAGGTTTUV 
AACTGTATTG 
CTTAGTTATC 
TGACTAATCA 
ATACTCTAGA 
ATGCACAAGA 
AGTGTGTGTG 
ATT^CTCTTA 
GACATTCGTT 
TAATAGATCT 
TCTTCAATCA 
GAACTTTCAA 
GTCAT6TAAA 
AAATGGTTTT 
GCTGTGTTAA 



41 

I 

CGACCGAGTG 
CCTCGGGGAC 
GTGCGGCGCG 
CTTGGAGATT 
ATGTTATTCA 
TGCCTGAAGA 
CCAATATACA 
TTGCTCTTAG 
GGATTGAACC 
TCTACAAAGA 
AGGATGAAGA 
TGCCACAGAC 
TGGGAAGAAA 
GTATGTATAT 
GAAACCTGAT 
GGGAAAAGTT 
GTTTTGGTGG 
GCGGGATTAA 
AATTGGGTCA 
AGAGCTGCAT 
AGGACTTTGA 
CTGATGAAGC 
GTGACTGTGG 
AGCTTAAATC 
CAGGAGGTAC 
GTTCTTCTCA 
ACAAAGCCTA 
TTGGCTCAAA 
ACAGATTTGG 

CTGCTATTAT 
GATCAGATGT 
TCTGTAGAAA 
AAAAGTGTC7V 
GGGCTCGCCC 
ACAATGAAAT 
CCCTTATTGT 
TCAGAAAGAA 
GACAGCCGGG 
TATATGCAAA 
CAAGGCCACC 
CTGCTCCTGC 
AATGTCTTCA 
TTTCTGTTGC 
AAACAGAAAC 
TACAATAACA 
TGCACTAATC 
GTGTAAGATT 
ATTAATGTAG 
GCTGCCAATA 
ATCTTGTCTG 
ACCACAATTA 
TATTCACGCA 
GAGAAATTAA 
CACAATAGCA 
AATCAAATAT 
ATTGAACTTT 
AGCTTGCTAT 
ATATTAGACA 
CTTAAATACC 
AAATGAATTT 



51 
I 

CTGAGAGGAA 
CCTTCGTGTC 
GCCAGGCTTT 
AACTAGAGAA 
GGCTGAAGGA 
TTTTGTGGTT 
GAATCATTGT 
CGACTGTTTT 
CCTGCAGAAC 
GCCTCTGAAA 
GGAAGAGCCT 
CCGGTATGTG 
TCAGACTGCT 
TATGTTAAAT 
CAACATAGTT 
TCTTATCACA 
AACTGCAGGA 
TGTGTTTGGA 
TAATCTTGGA 
CATGAATTCA 
GAAGTTAACT 
CTATAGTGCT 
TACTCCAAAG 
ATTTGCTGAG 
TTTATGCCGA 
GTTCTGTCAG 
TTGCTACAAC 
AGCCAAGGCT 
CAATTGTGGT 
AAAGCTTCAG 
TCAAACGCCT 
TCCAGATCCT 
CTTCCAGTGT 
TGGACATGGG 
AAATTGTGAG 
GAATACTGCA 
CTGTGCTATT 
GAGATCACAA 
GAGTGTTCCT 
CAGATTTGCA 
TCCACCACAA 
ACCTCCTTTA 
GGGAACTGAG 
AACTATGAAT 
TGAGTGTGAG 
TTTCCGTTTC 
ATGGATTTTT 
TTTGTCATTA 
TTCCTCATTG 
ATATCTAATA 
TCACTCACTA 
AGATGTCATA 
GTTACTOGCT 
TTTAATATTA 
CTATTTTAAA 
GTTGATTCAT 
TACAAAACCA 
TAAATCATTT 
CTAATATTTT 
TACZ^AAZ^AAG 
TTACTATGGC 



60 
120 



60 

120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 



429 



wo 02/086443 

AGATATGGTA TGGATCGTAA AATTTTAAGC ACTAAAAATT TTTTCATAAC CTTTCATAAT 3720 
AAAGTTTAAT AATAGGTTTA TTAACTGAAT TTCATTAGTT TTTTAAAAGT GTTTTTGGTT 3780 
TGTGTATATA TACATATACA AATACAACAT TTACAATAAA TAAAATACTT GAAATTCTCA 3840 

AAAAAAAAAA AAAAAAAAAA AAAAA 



PCT/US02/12476 



Seg ID NOt 633 Protein sequence 
Protein Accession #: NP 003807.1 
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MGSGARFPSG 
PYSKQVSYVI 
BGVHNSSIAL 
DIEKETAKDE 
LLANYLDSMY 
AQLVLKKGFG 
RDCSCGAKSC 
LVDAGEECDC 
DVPEYCNGSS 
lEVNSKGDRF 
WGVDFQLGSD 
NCHCBNGWAP 
DQLWRSYFRK 
KQPQQFPSRP 



11 
I 

TLRVRWLLLIi 
QAEGKEHIIH 
SDCFGLRGLL 
EEEPPSMTQL 
IMLNIRIVIiV 
GTAGMAFVGT 
IMNSGASGSR 
GTPKECELDP 
QFCQPDVFIQ 
GNCGFSGNEY 
VPDPGMVNEG 
PNCETKGYGG 
KRSQTYESDG 
PPPQPKVSSQ 



21 
1 

GLVGPVLGAA 
LERNKDLLPE 
HLENASYGIE 
LRRRRAVLPQ 
GLEIWTNGNXi 
VCSRSHAGGI 
NFSSCSAEDF 
CCEGSTCKLK 
NGYPCQNNKA 
KKCATGNALC 
TKCGAGKICR 
SVDSGPTYNE 
KNQANPSRQP 
GNIiIPARPAP 



31 
I 

RPGPQQTSHL 
DFWYTYNKE 
PLQNSSHFGH 
TRYVELFIW 
INIVGGAGDV 
NVFGQITVET 
EKLTLNKGGN 
SFAECAYGDC 
YCYNGMCQYY 
GKLQCENVQE 
NFQCVDASVL 
MMTALRDGliL 
GSVPRHVSPV 
APPLYSSLT 



41 

1 

SSYEIITPWR 
GTLITDHPNI 
IIYRMDDVYK 
DKERYDMMGR 
LGNFVQWREK 
FASIVAHELG 
CLLNIPKPDE 
CKDCRFLPGG 
DAQCQVIFGS 
IPVFGIVPAI 
NYDCDVQKKC 
VFPFLIVPLI 
TPPREVPIYA 



51 
I 

LTRERREAPR 
QNHCHYRGYV 
EPLKCGVSNK 
NQTAVREEMI 
FLITRRRHDS 
HNLGMNHDDG 
AYSAPSCGNK 
TLCRGKTSEC 
KAKAAPKDCF 
IQTPSR6TKC 
HGHGVCNSNK 
VCAIFIFIKR 
NRFAVPTYAA 



Seq ID NO: 634 DNA sequence 

Nucleic Acid Accession #: NM_0 02 091.1 

Coding sequence: 56,. 503 



11 



21 



31 



41 



51 



AGTCTCTGCT 
CGGCAGTGAG 
AGCX3GTCCCG 
CCACTGGGCG 
TGAGAGAGGG 
GAATTTGCTG 
GGCCTTGGGC 
AGGTTCAAAA 
CCCCCAGCTG 
TAAGAGACTG 
AAATATTTGA 
CTTCTGGTTT 
TTTTTATATC 
TAAAAGCTTA 



CTTCCCAGCC 
CTCCCGCTGG 
CTGCCTGCGG 
GTGGGGCACT 
AGCCTGAAGC 
GGTCTCATAG 
AATCAGCAGC 
GGC7VAAGTTG 
AACCAGCAAT 
AGTTCTGCAA 
CTATTCTGTA 
AAACTTGTTT 
TAGGCTACCT 
AACACAT 



TCTCCGGCGC 
TCCTGCTGGC 
GCGGAGGGAC 
TAATGGGGAA 
AGCAGCTGAG 
AAGCAAAGGA 
CTTCGTGGGA 
GTAGACTCTC 
GATAATGATG 
GCATCAGTTC 
TCTTTCATCC 
GCTGTGAACA 
GTTGGTTAGA 



GCTCCAAGGG 
GCTGGTCCTC 
CGTGCTGACC 
AAAGAGCACA 
AGAGTACATC 
GAACAGAAAC 
TTCAGAGGAT 
TGCTCCAGGT 
GCCTCTCTCA 
TACGGATCAT 
TTGACTA7VAT 
ATTGTCX3AAA 
TTCAAGGCCC 



CTTCCCGTCG 
TGCCTAGCGC 
AAGATGTACC 
GGGGAGTCTT 
AGGTGGGAAG 
CACCAGCCAC 
AGCAGCAACT 
TCTCAACGTG 
AAAGAGAAAA 
CAACAAGATT 
TCGTGATTTT 
AGAGTCTTCC 
CGAGCTGTTA 



GGACCATGCG 
CCCGGGGGCG 
CGCGCGGCAA 
CTTCTGTTTC 
AAGCTGCAAG 
CTCAACCCAA 
TCAAAGATGT 
AAGGAAGGAA 
ACAAAACCCC 
TCCTTGTGCA 
CAAGCAGCAT 
AATTAATGCT 
CCATTCACAA 



Seq ID NO: 635 Protein sequence 
Protein Accession #: NP 002082.1 



11 



21 



31 



41 



51 



MRGSELPLVL LALVLCLAPR GRAVPLPAGG GTVLTKMYPR GNHWAVGHI*M GKKSTGESSS 
VSERGSLKQQ LREYIRWEEA ARNLLGLIEA KENRNHQPPQ PKALGNQQPS WDSEDSSNFK 
DVGSKGKVGR LSAPGSQREG RNPQLNQQ 

Seq ID NO: 636 DNA sequence 

Nucleic Acid Accession #: NM_016522.1 

Coding sequence: 265.. 1299 



1 
1 

GCGGAAGCAG 
CTGGCAAAAG 
TTTTCTCCTC 
CCGCACCCCA 
rCGGGGAAGT 
TGCCTCGTGG 
AGCGGAGATG 
GCCACCCTCA 
ACCATCCTCT 
AACACCCAAA 
TACACCTGCT 
CAAGTATCTC 
ATTAGCCTCA 
TCTCCCAAAG 
CGGGAACAGT 
CGGAGAGTAA 
GTCCCCGTGG 
TTCCAGTGGT 
AACAGACCTT 
TACACTTGCG 
CCAGGCGCCG 
CTGCCTCTTC 
CGGGAAAGGC 
CCAATCAGAT 
GGGAGGGGAA 
CCTTGCAGAT 



11 
1 

CGA6GAGGGA 
CCGAGGCTGG 
CCCGCGCCTC 
CCCACTTCCT 
TGTGGCTGTC 
TC6TGTCTCT 
CCACCTTCCC 
GGTGCACTAT 
ATGCTGGGAA 
CGCAGTACAG 
CGGTGCAGAC 
CCAAAATTGT 
CCTGCATAGC 
CGGTTGGCTT 
CAGGGGACTA 
AGGTCACCGT 
GACAAAAGGG 
ACAAG6ATGA 
TCCTCTCAAA 
TGGCCTCCAA 
TCAGCGAGGT 
TGGTCTTGCA 
TGCCGCCACC 
ATATACAAAT 
CAT^GAATAC 
ATTTAGGTAC 



21 
I 

GCCCCCTTTG 
ATTTGGGGGA 
CCGGTCGCCG 
GTGCTCGCCC 
GAGAATGGGG 
CAGGCTGCTG 
CAAAGCTATG 
TGACAACCGG 
TGACAAGTGG 
CATCGAGATC 
AGACAACCAC 
AGAGATTTCT 
AACTGGTAGA 
TGTGAGTGAA 
CGAGTGCAGT 
GAACTATCCA 
GACACTGCAG 
CAAAAGACTG 
ACTCATCTTC 
CAAGCTGGGC 
GAGCAACGGC 
CCTGCTTCTC 
ACCACCACCA 
GAAATTAGAA 
TTTGGGGGGA 
AATGGAGTTT 



31 

1 

GCCGTCCTCC 
GGAATATTAG 
CGGGTTCACC 
GGGGGGCGTG 
GTCTGTGGGT 
TTCCTTGTAC 
GACAACGTGA 
GTCACCCGGG 
TGCCTGGATC 
CAGAACGTGG 
CCAAAGACCT 
TCAGATATCT 
CCAGAGCCTA 
GACGAATACT 
GCCTCCAATG 
CCATACATTT 
TGTGTU^GCCT 
ATTGAAGGAA 
TTCAATGTCT 
CACACCAATG 
ACGTCGAGGA 
AAATTTTGAT 
ACACAACAGC 
GAAACACAGC 
AAAGAGTTTT 
TCTTTTCCCA 



41 

! 

GTGGAACCGG 
ACTCGGAGGA 
GCTCAGTCCC 
TGCCGTGCGG 
ACCTGTTCCT 
CCACAGGAGT 
CGGTCCGGCA 
TGGCCTGGCT 
CTCGCGTGGT 
ATGTGTATGA 
CTAGGGTCCA 
CCATTAATGA 
CGGTTACTTG 
TGGAAATTCA 
ACGTGGCCGC 
CAGT^GCCAA 
CAGCAGTCCC 
AGAAAGGGGT 
CTGAACATGA 
CCAGCATCAT 
GGGCAGGCTG 
GTGAGTGCCA 
AATGGCAACA 
CTCATGGGAC 
AAAAAAGAAA 
AACGGGAAGA 



51 
1 

TTTTCCGAGG 
6TCTGCGCGC 
CGCGCTCGCT 
CTGCCGGAGT 
GCCCTGGAAG 
GCCCGTGCGC 
GGGGGAGAGC 
A/iACCGCAGC 
CCTTCTGAGC 
CGAGGGCCCT 
CCTCATTGTG 
AGGGAACAAT 
GAGACACATC 
GGGCATCACC 
GCCCGTGGTA 
GGGTACAGGT 
CTCAGCAGAA 
GAAAGTGGAA 
CTATGGGAAC 
GCTATTTGGT 
CGTCTGGCTG 
CTTCCCCACC 
CCGACAGCAA 
AGAAATTTGA 
TTGAAAATTG 
ACACAGCACA 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



430 



wo 02/086443 

CCCGGCTTGG ACCCACTGCA AGCTGCATCG TGCAACCTCT TTGGTGCCAG TGTGGGCAAG 
GGCTCAGCCT CTCTGCCCAC AGACTGCCCC CACGTGGAAC ATTCTGGAGC TGGCCATCCC 
AAATTCAATC AGTCCATAGA GACGAACAGA ATGAGACCTT CCGGCCCAAG CGTGGCGCTT 
CCGGCCCAAG CGTGGCGCTG CGGGCACTTT GGTAGACTGT GCCACCACGG CGTGTGTTGT 
GAAACGTGAA ATAAAAAGAG CAAAAAAAAA A7VAAAAAAA 

Seq ID NO: 637 Protein sequence 
Protein Accession #: NP_057606.1 

1 11 21 31 41 51 

1 I I 1 1 1 

MGVCGYLFLP WKCLWVSLR LLFLVPTGVP VRSGDATFPK AMDNVTVRQG ESATLRCTID 

NRVTRVAWIiN RSTILYAGND KWCLDPRWL LSNTQTQYSI EIQNVDVYDE GPYTCSVQTD 

NHPKTSRVHL IVQVSPKIVB ISSDISINEG NNISLTCIAT GRPEPTVTWR HISPKAVGPV 

SEDEYLBIQG ITRBQSGDYE CSASNDVAAP WURVKVTVN YPPYISEAKG TGVPVGQKGT 
LQCEASAVPS AEFQWYKDDK RLIEGKKGVK VENRPFLSKL IPFNVSEHDY GNYTCVASNK 

LGHTNASIMtj FGPGAVSBVS NGTSRRAGCV WLLPLLVLHL LLKP 

Seq ID NO: 638 DNA sequence 

Nucleic Acid Accession #: NM_012261.1 

Coding sequence: 203.. 1045 

1 11 21 31 41 51 

I I I I I I 

GATTTGCTCr GCCAGCAGCT GTCGGTGCCG CGCTCGACAC CGAGTCCTAG CTAGGCGCTC 
ACAGAATACG CGCTCCCTCC CTCCCCCTTC TCTGTCCCCC GCCTCTCGCT CACCCCGGCC 
CACTCCAGCG GCGACTTTGA GGGATTCCCT CTCTGGCGGC CTCTGCAGCA GCACAGCCGG 
CCTCATTCGG GGCACTGCGA GTATGGATCT CCAAGGAAGA GGGGTCCCCA GCATCGACAG 
ACTTCGAGTT CTCCTGATGT TGTTCCATAC AATGGCTCAA ATCATGGCAG AACAAGAAGT 
GGAAAATCTC TCAGGCCTTT CCACTAACCC TGAAAAAGAT ATATTTGTGG TGCGGGAAAA 
TGGGACGACG TGTCTCATGG CAGAGTTTGC AGCCAAATTT ATTGTACCTT ATGATGTGTG 
GGCCAGCAAC TACGTAGATC TGATCACAGA ACAGGCCGAT ATCGCATTGA CCCGGGGAGC 
TGAGGTGAAG GGCCGCTGTG GCCACAGCCA GTCGGAGCTG CAAGTGTTCT GGGTGGATCG 
CGCATATGCA CTCAAAATGC TCTTTGTAAA GGAAAGCCAC AACATGTCCA AGGGACCTGA 
GGCGACTTGG AGGCTGAGCA AAGTGCAGTT TGTCTACGAC TCCTCGGAGA AAACCCACTT 
CAAAGACGCA GTCAGTGCTG GGAAGCACAC AGCCAACTCG CACCACCTCT CTGCCTTGGT 
CACCCCCGCT GGGAAGTCCT ATGAGTGTCA AGCTCAACAA ACCATTTCAC TGGCCTCTAG 
TGATCCGCAG AAGACGGTCA CCAT6ATCCT GTCTGCGGTC CACATCCAAC CTTTTGACAT 
TATCTCAGAT TTTGTCTTCA GTGAAGAGCA TA7\ATGCCCA GTGGATGAGC GGGAGCAACT 
GGAAGAAACC TTGCCCCTGA TTTTGGGGCT CATCTTGGGC CTCGTCATCA TGGTAACACT 
CGCGATTTAC CACGTCCACC ACAAAATGAC TGCCAACCAG GTGCAGATCC CTCGGGACAG 
ATCCCAGTAT AAGCACATGG GCTAGAGGCC GTTAGGCAGG CACCCCCTAT TCCTGCTCCC 
CCAACTGGAT CAGGTAGAAC AACAAAAGCA CTTTTCCATC TTGTACACGA GATACACCAA 
CATAGCTACA ATCAAACAGG CCTGGGTATC TGAGGCTTGC TTGGCTTGTG TCCATGCTTA 
AACCCACGGA AGGGGGAGAC TCTTTCGGAT TTGTAGGGTG AAATGGCAAT TATTCTCTCC 
ATGCTGGGGA GGAGGG6AGG AGGGTCTCAG ACAGCTTTCG TGCTCATGGT GGCTTGGCTT 
TGACTCTCCA AAGAGCAATA AATGCCACTT GGAGCTGTAT CTGGCCCCAA AGTTTAGGGA 
TTGAAAACAT GCTTCTTTGA GGAGGAAACC CCTTTAGGTT CAGAAGAATA TGGGGTGCTT 
TGCTCCCTTG GACACAGCTG GCTTATCCTA TACAGTTGTC AATGCACACA GAATACAACC 
TCATGCTCCC TGCAGCAAGA CCCCTGAAAG TGATTCATGC TTCTGGCTGG CATTCTGCAT 
GTTTAGTGAT TGTCTTGGGA ATGTTTCACT GCTACCCGCA TCCAGCGACT GCAGCACCAG 
AAAACGACTA ATGTAACTAT GCAGAGTTGT TTGGACTTCT TCCTGTGCCA GGTCCAAGTC 
GGGGGACCTG AAGAATCAAT CTGTGTGAGT CTGTTTTTCA AAATGAAATA AAACACACTA 
TTCTCTGGC 

Seq ID NO: 639 Protein seG[uence 
Protein Accession #: NP_036393.1 

1 11 21 31 41 51 

I I t I I I 

MDLQGRGVPS IDRIiRVIil/MI* FHTMAQIiVIAE QEVENLSGLS TNPEKDIFW RENGTTCLMA 
EFAAKFIVPY DVWASNYVDIi ITEQADIALT RGAEVKGRCG HSQSELQVFW VDRAYALKML 
FVKESHNMSK GPEATWRLSK VQFVYDSSEK THFKDAVSAG KHTANSHHLS ALVTPAGKSY 
ECQAQQTISL ASSDPQECTVT MIIiSAVHIQP FDIISDFVFS EBEIKCPVDER EgtiEETLPIiI 
LGIiILGLVIM VTLAIYHVHH KMTANQVQIP RDRSQYKHM6 

Seq ID NO: 640 DNA sequence 

Nucleic Acid Accession #: NM_002 993.1 

Coding sequence: 64., 408 

1 11 21 31 41 51 

1 I i I I I 

GGCACGAGCC AGTCTCCGCG CCTCCACCCA GCTCAGGAAC CCGCGAACCC TCTCTTGACC 
ACTATGAGCC TCCCGTCCAG CCGCGCGGCC CGTGTCCCGG GTCCTTCGGG CTCCTTGTGC 
GCGCTGCTCG CGCTGCTGCT CCTGCTGACG CCGCCGGGGC CCCTCGCCAG CGCTGGTCCT 
GTCTCTGCTG TGCTGACAGA GCTGCGTTGC ACTTGTTTAC GCGTTACGCT GAGAGTTiAAC 
CCCAAAACGA TTGGTAAACT GCAGGTGTTC CCCGCAGGCC CGCAGTGCTC CAAGGTGGAA 
GTGGTAGCCT CCCTGAAGAA CGGGAAGCAA GTTTGTCTGG ACCCGGAAGC CCCTTTTCTA 
AAGAAAGTCA TCCAGAAAAT TTTGGACAGT GGAAACAAGA AAAACTGAGT AACAAAAAAG 
ACCATGCATC ATAAAATTGC CCAGTCTTCA GCGGAGCAGT TTTCTGGAGA TCCCTGGACC 
CAGTAAGAAT AAGAAGGAAG GGTTGGTTTT TTTCCATTTT CTACATGGAT TCCCTACTTT 
GAAGAGTGTG GGGGAAAGCC TACGCTTCTC CCTGAAGTTT ACAGCTCAGC TAATGAAGTA 
CTAATATAGT ATTTCCACTA TTTACTGTTA TTTTACCTGA TAAGTTATTG AACCCTTTGG 
CAATTGACCA TATTGTGAGC AAAGAATCAC TGGTTATTAG TCTTTCAATG AATATTGAAT 
TGAAGATAAC TATTGTATTT CTATCATACA TTCGTTAAAG TCTTACCGAA AAGGCTGTGG 
ATTTCGTATG GAAATAATGT TTTATTAGTG TGCTGTTGAG GGAGGTATCC TGTTGTTCTT 
ACTCACTCTT CTCATAAAAT AGGAAATATT TTAGTTCTGT TTTCTTGGGG AATATGTTAC 
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TCTTTACCCT AGGATGCTAT TTAAGTTGTA CTGTATTAGA ACACTGGGTG TGTCATACCG 960 

TTATCTGTGC AGAATATATT TCCTTATTCA GAATTTCTAA AAATTTAAGT TCTOTAAGGG 1020 

CTAATATATT CTCTTCCTAT GGTTTTAGAT GTTTGAT6TC TTCTTAGTAT GGCaTAATGT 1080 

CATGATTTAC TCATTAAACT TTGATTTTGT ATGCTATTTT TTCACTATAG GATGACTATA 1140 

ATTCTGGTCA CTAAATATAC ACTTTAGATA GATGAAGAAG CCCAAAAACA GATAAATTCC 1200 

TGATTGCTAA TTTACATAGA AATGTATTCT CTTGGTTTTT TAAATAAAAG CAAAATTAAC 1260 

AATGATCTGT GCTCTGCAAA GTTTTGAAAA TATATTTGAA CAATTTGAAT ATAAATTCAT 1320 

CATTTAGTCC TCAAAATATA TACAGCATTG CTAAGATTTT CAGATATCTA TTQTOGATCT 1380 

TTTAAAQGTT TTGACCATTT TOTTAT6AGO AATTATACAT GTATCACATT CACTATATTA 1440 

AAATTGCACT TTTATTTTTT CCTGTGTGTC ATSTTQOTTT TTOOTACTTG TATTSTCATT 1500 
TGGAGAAACA ATAAAAGATT TCTAAACCAA AAAAAAAAAA AAAAAAA 



PCT/US02/12476 
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Seq ID NO: 641 Protein sequence 
Protein Accession #: NP__002984.1 

1 11 21 31 41 51 

I 1 I I I I 

MSLPSSRAAR VPGPSGSLCA LLALLLLLTP PGPLASAGPV SAVLTELRCT CLRVTLRVNP 
KTIGKLQVFP AGPQCSKVEV VASbKNGKQV CIiDPEAPFLK KVIQKILDSG NKKN 



Seq ID NO: 642 DNA sequence 
Nucleic Acid Accession #: NM 
Coding sequence: 2 7., 80 9 ' 



013271.1 



TCCGGAGCCA 

CCGGGGGCGT 
TCTGCGCGCG 
AGACTGGCGC 
TGCAGGAGCT 
GGGCCGAGGC 
TCTGGGGCGC 
CTGCAGCGCA 
CCCAGCTTGT 
ACGACGGCCC 
CCGAGCTGTT 
TGGCAGCCCC 
CTGAGGGCGT 
TGCCTGCACG 
CAGAAGTGCC 
TTACCCCGGC 
GATCTGAGC 



11 
1 

GGCTCGCTGG 
CGGCCTTTTG 
GCCGGTAAAG 
TCCTCGCCGC 
GGCGCGGGCG 
GCAGGAGGCT 
CCCCCGCAAC 
GCTCGCTCGC 
CCCCGCGCCC 
CGCGGGCCCG 
GAGGTACTTG 
GCGCCGCCTC 
GCTGGGGGCG 
CCGCCTCTTG 
CCCGCCATCC 
CAGCCAGCCC 



21 
1 

GGCAGCATGG 
GTGCTGCTGC 
GAACCCCGCG 
TTCCGGCGGT 
CTGGCGCATC 
GAGGATCAGC 
TCTGATCCGG 
GCTCTGCTCG 
GTCCCCGCCG 
GATGCTGAGG 
CTGGGACGGA 
CGCCGTGCCX3 
CTGCTGCGTG 
CCACCCTGAG 
CGCCACCAGG 
TCTCACCCGA 



31 
I 

CGGGGTCGCC 
TGCTCGGCCT 
GCCTAAGCGC 
CAGTGCCCCG 
TGCTGGAGGC 
AGGCGCGCGT 
CTCTGGGCCT 
GCGCCCGCCT 
CGGCGCTCCG 
AGGCAGGCGA 
TTCTTGCGGG 
CCGACCACGA 
TGAAACGCCT 
CACTGCCCGG 
ACTTCTCCCC 
GGATCCCTAC 



41 
1 

GCTGCTCTGG 
GTTTCGGCCG 
AGCGTCTCCG 
AGGTGAGGCG 
CGAACGTCAG 
CCTGGCGCAG 
GGACGACGAC 
TGACCCTGCC 
ACCCCGGCCC 
CGAGACACCC 
AAGCGCGGAC 
TGTGGGCTCT 
AGAGACCCCG 
ATCCCGTGCA 
GCCAGCACGT 
CCCCTGGCCC 



51 
I 

GGGCCGCGGG 
CCCCCCGCGC 
CCCTTGGCTG 
GCGGGGGCGG 
GAGCGGGCGC 
CTGCTGCGCG 
CCCGACGCGC 
GCCCTAGCAG 
CCGGTCTACG 
GACGTGGACC 
TCCGAGGGGG 
GAGCTGCCCC 
GCGCCCCAGG 
CCCTGGGACC 
CCAGAGCAAC 
ACAATAACAT 



Seq ID NO: 643 Protein sequence 
Protein Accession #: NP 037403.1 



11 



21 



31 



41 



51 
1 



MAGSPLLWGP RAGGVGLLVI* LLLGLFRPPP ALCARPVKEP RGLSAASPPL AETGAPRRFR 
RSVPRGE7WVG AVQELARALA HLLEAERQER ARAEAQBAED QQARVIAQLL RVWGAPRNSD 
PALGLDDDPD APAAQLARAL tiRARIiDPAAL AAQLVPAPVP AAALRPRPPV YDDGPAGPDA 
EEAGDETPDV DPELLRYLLG RILAGSADSE GVAAPRRLRR AADHDVGSEL PPEGVLG7UJL 
RVKRLETPAP QVPARRLLPP 

Seq ID NO: 644 DNA seq[uence 
Nucleic Acid Accession #: NM_002214 
Coding sequence: 681.. 2990 



1 
I 

CCCAGAGCCG 
CTGCCGACTT 

GTTGGCCTCC 
TCCCCTCGAC 
TAGGGTGGTT 
CTAAGCTGAT 
TGTCCCGGAG 
TGGCCGTCGA 
GGCCGTAGGG 
CCGAGCCGCG 
GGCCCCGAGG 
GGGGCGGGCT 
TCTGCCTGCA 
CACTTGTTCT 
CCTGTGCCAG 
TTTCAGGTGG 
GCTCAGTTGA 
TTAATACCCA 
ATTTTATGCT 
ATGTCTCAGC 
CTAGAAAAAT 
AAACAGTTTC 
ACAATTTAGA 
TCACTGAGTT 
AAGGAGGTTT 
AAGAGGCTAA 



11 

I 

CCTCCCCCTG 
GTCTTTGCCC 
CTGCCCACCT 
CTCGCCGGCG 
TCCCCCCCAG 
TTATGCAGCA 
CAGGCTGCGG 
AGGAGGTGCT 
GCCCTGAGAT 
GGGTCCGCCT 
TCGCCCGGGA 
GTTTTGCATT 
T^AACGACCGG 
TGGACTGGGC 
GTGCCTTGCG 
ATCAAGAAGT 
TTCAATAGAA 
GGTGACACCA 
GAAAGTTCAT 
ATCAATGCAC 
GGCATTTTTC 
ACCATACATT 
CTGCATGCCT 
TGAGAAAGCA 
TGACGCCATG 
AAGATTGCTG 



21 
I 

TTGCTGGCAT 
GCTGCTCCGC 
GTGGAAGCAA 
TACCCTCCCA 
CTTCGGGCTT 
GAAGCCCCAC 
AGCCCTTGCA 
TCTCGCGGAG 
GCCGAGCGGT 
GCTAGGCCTG 
GGCCGAGCCC 
ATGTGCGGCT 
CGAGGTCCCG 
CAAGGTGAAG 
CTGGGTCCAG 
GAACGTTGTG 
TACCCATCTG 
GGAGAAGTGT 
CCTCTGAAGA 
AATAATATAG 
TCCCGTGACT 
AGCATCCACC 
CCCCATGGAT 
GTTCATAGAC 
CTTCAGGCAG 
CTGGTGATGA 



31 
I 

CCCGAGCTTC 
AGACGGGGCT 
CTGCGCTGAT 
CAGATCCAGC 
TGTTTGGGTT 
CG6CTGGAGA 
GAGCCCTCTC 
ACCGCGGGAC 
GCCCGGGCCC 
CGGAAAACGT 
GCGTCCGGAA 
CGGCCCTGGC 
CCTCGTTCCT 
ACAATAGATG 
AATGTGGATG 
ATATTGTTTC 
TGCATGTTAT 
CTATCCAGCT 
AATATCCTGT 
AAAAATTAAA 
TTCGTCTTGG 
CCGAAAGGAT 
ACATCCATGT 
AGAAGATCTC 
CT6TCTGTGA 
CAGATCAGAC 



41 

1 

CTCCCTTGCC 
GCAAAGCTGC 
TGATGCGCCA 
ATCACCCAGT 
TGATTGTGTT 
GAAACAAAAG 
TCCAGTCGCC 
CCGCCGTGCC 
GCTTACCTGC 
CCTAGCGACA 
GGCAGCCAGG 
TTTTTTTACC 
CTGGGCAGCC 
TGCATCTTCA 
GTGTGTTCAA 
CAATTTAATA 
AATACCCACT 
GCGTCCAGGA 
GGATCTTTAT 
TTCCGTTGGA 
ATTTGGCTCA 
TCATAATCAA 
GCTGTCTTTG 
TGGAAACATA 
AAGTCATATC 
GTCTCATCTC 



51 
I 

AGCCAGGACG 
AACTAATGGT 
CAGACTTTTT 
GAATGTACAT 
TGGCTCTTCG 
CTCTTTTCTT 
GCCGGGCCCT 
GAGCCGGGAG 
ACCGCTTGCT 
CTCGCCGGCG 
CGGCGGGCGC 
GCTGCATTTG 
TGGGTGTTTT 
AATGCAGCAT 
GAGGATTTCA 
AGCAAAGGCT 
GAAAATGAAA 
GCCGAAGCTA 
TATCTTGTTG 
AACGATTTAT 
TACGTTGATA 
TGCAGTGACT 
ACAGAGAACA 
GATACACCAG 
GGATGGCGAA 
GCTCTTGATA 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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GCAAATTGGC AGGCATAGTG GTGCCCAATG ACGGAAACTG TCATCTGAAA AACAACGTCT 162 0 

ACGTCAAATC GACAACCATG GAACACCCCT CACTAGGCCA ACTTTCAGAG AAATTAATAG 1680 

ACAACAACAT TAATGTCATC TTTGCAGTTC AAGGAAAACA ATTTCATTGG TATAAGGATC 1740 

TTCTACCCCT CTTGCCAGGC ACCATTGCTG GTGAAATAGA ATCA7VAGGCT GCAAACCTCA ■ 1800 

ATAATTTGGT AGTGGAAGCC TATCAGAAGC TCATTTCAGA AGTGAAAGTT CAGGTGGAAA 18 60 

ACCAGGTACA AGGCATCTAT TTTAACATTA CCGCGATCTG TCCAGATGGG TCCAGAAAGC 1920 

CAGGCATGGA AGGATGCAGA AACGTGACGA GCAATGATGA AGTTCTTTTC AATGTAACAG 1980 

TTACAATGAA AAAATGTGAT GTCACAGGAG GAAAAAACTA TGCAATAATC AAACCTATTG 2040 

GTTTTAATGA AACCGCTAAA ATTCATATAC ACAGAAACTG CAGCTGTCAG TGTGAGGACA 2100 

ACAGAGGACC TAAAGGAAAG TGTGTAGATG AAACTTTTCT AGATTCCAAG TGTTTCCAGT 2160 

GTGATGAGAA TAAATGTCAT TTTGATGAAG ATCAGTTTTC TTCTGAGAGT TGCAAGTCAC 2220 

ACAAGGATCA GCCTGTTTGC AGTGGTCGAG GAGTTTGTGT TTGTGGGAAA TGTTCATGTC 2280 

ACAAAATTAA GCTTGGAAAA GTGTATGGAA AATACTGTGA AAAGGATGAC TTTTCTTGTC 23 40 

CATATCACCA TGGAAATCTG TGTGCTGGGC ATGGAGAGTG TGAAGCAGGC AGATGCCAAT 2400 

GCTTCAGTGG CTGGGAAGGT GATCGATGCC AGTGCCCTTC AGCAGCAGCC CAGCACTGTG 2460 

TCAATTCAAA GGGCCAAGTG TGCAGTGGAA GAGGCACGTG TGTGTGTGGA AGGTGTGAGT 2520 

GCACCGATCC CAGGAGCATC GGCCGCTTCT GTGAACACTG CCCCACCTGT TATACAGCCT 2 580 

GCAAGGAAAA CTGGAATTGT ATGCAATGCC TTCACCCTCA CAATTTGTCT CAGGCTATAC 2640 

TTGATCAGTG CAAAACCTCA TGTGCTCTCA TGGAACAACA GCATTArGTC GACCARACTT 2700 

CAGAATGTTT CTCCAGCCCA AGCTACTTGA GAATATTTTT CATCATTTTC ATAGTTACAT 2760 

TCTTGATTGG GTTGCTTAAA GTCCTGATCA " TTAGACAGGT GATACTACAA TGGAATAGTA 2 820 

ATAAAATTAA GTCCTGATCA GATTACAGAG TGTCAGCCTC AAAAAAGGAT AAGTTGATTC 2880 

TGCAAAGTGT TTGCACAAGA GCAGTCACCT ACCGACGTGA GAAGCCTGAA GAAATAAAAA 2 940 

TGGATATCAG CAAATTAAAT GCTCATGAAA CTTTCAGGTG CAACTTCTAA AAAAAGATTT 3000 

TTAAACACTT AATGGGAAAC TGGAATTGTT AATAATTGCT CCTAAAGATT ATAATTTTAA 3060 

AAGTCACAGG AGGAGACAAA TTGCTCACGG TCATGCCAGT TGCTGGTTGT ACACTCGAAC 3120 

GAAGACTGAC AAGTATCCTC ATCATGATGT 'GACTCACATA GCTGCTGACT TTTTCAGAGA 3180 

AAAATGTGTC TTACTACTGT TTGAGACTAG TGTCGTTGTA GCACTTTACT GTAATATATA 3240 

ACTTATTTAG ATCAGCATAG AATGTAGATC CTCTGAAGAG CACTGATTAC ACTTTACAGG 33 00 

TACCTGTTAT CCCTACGCTT CCCAGAGAGA ACAATGCTGT GAGAGAGTTT AGCATTGTGT 3360 

CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTGAG 6AAAAAAATA ATCTGGCAAG 3420 

TATATTCTAA GGTTGCCAAA CACTTCAACA GTTGGTGGTT GAATAGACAA GAACAGCTAG 3480 

ATGAATAAAT GATTCGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC CTTAATCTTA 3540 

AAAGATTATT GCTTTTTAAA GTGTGTAGTT TTATGCATGT GTGTTTATGG TTTGCTTATT 3 6 00 

TTTGCAAGAT GGATACTAAT TCCAGCATTC TCTCCTCTTT GCCTTTATGT TTTGTTTTCT 3660 

TTTTTACAGG ATAAGTTTAT GTATGTCACA GATGACTGGA TTAATTAAGT GCTAAGTTAC 3720 

TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT TTTAAAAGCT 3780 
GAATGTTAA 

Seq ID NO: 645 Protein sequence 
Protein Accession #: NP_002205 

1 11 21 31 41 51 

I I I I I I 

MCGSALAFFT AAFVCLQNDR RGPASFLWAA WVFSLVLGLG QGEDNRCASS NAASCARCLA 60 

LGPECGWCVQ EDFISGGSRS ERCDIVSNLI SKGCSVDSIE YPSVHVIIPT ENEINTQVTP 120 

GEVSIQLRPG AEANFMLKVH PLKKYPVDLY YLVDVSASMH NNIEKLNSVG NDLSRKMAFF 180 

SRDFRLGFGS YVDKTVSPYI SIHPERIHNQ CSDYNLDCMP PHGYIHVLSL TENITEPEKA 240 

VHRQKISGNI DTPEGGPDAM LQAAVCESHI GWRKEAKRLL LVMTDQTSHL ALDSKLAGIV 300 

VPNDGNCHbK NNVYVKSTTM EHPSLGQLSE KLIDNNINVI PAVQGKQFHW YKDIiLPLLPG 360 

TIAGEIESKA ANLNNLWEA YQKLISEVKV QVENQVQGIY FNITAICPDG SRKPGMEGCR 420 

NVTSNDEVLF NVTVTMKKCD VTGGKNYAII KPIGFNETAK IHIHRNCSCQ CEDNRGPKGK 480 

CVDETFLDSK CFQCDENKCH FDEDQFSSES CKSHKDQPVC SGRGVCVCGK CSCHKIKLGK 540 

VYGKYCEKDD FSCPYHHGNL CAGHGECEAG RCQCFSGWEG DRCQCPSAAA QHCVNSKGQV 600 

CSGRGTCVCG RCECTDPRSI GRFCEHCPTC YTACKENWNC MQCLHPHNLS QAILDQCKTS 660 

CALMEQQHYV DQTSECFSSP SYLRIFFIIF IVTFLIGLLK VLIIRQVIIiQ WNSNKIKSSS 720 
DYRVSASKKD KIjILQSVCTR AVTYRREKPE EIKMDISKIiN AHETFRCNF 

Seq ID NO: 646 DNA sequence 

Nucleic Acid Accession #: NM_003318.1 

Coding sequence.' 1..2574 

1 11 21 31 41 51 

i I I I I I 

ATGGAATCCG AGGATTTAAG TGGCAGAGAA TTGACAATTG ATTCCATAAT GAACAAAGTG 60 

AGAGACATTA A/U^ATAAGTT TAAAAATGAA GACCTTACTG ATG/^CTAAG CTTGAATAAA 120 

ATTTCTGCTG ATACTACAGA T/W^CTCGGGA ACTGTTAACC AAATTATGAT GATGGCAAAC 180 

AACCCAGAGG ACTGGTTGAG TTTGTTGCTC AAACTAGAGA AAAACAGTGT TCCGCTAAGT 240 

GATGCTCTTT TAAATAAATT GATTGGTCGT TACAGTCAAG CAATTGAAGC GCTTCCCCCA 300 

GATAAATATG GCCAAAATGA GAGTTTTGCT AGAATTCAAG TGAGATTTGC TGAATTAAAA 360 

GCTATTCAAG AGCCAGATGA TGCACGTGAC TACTTTCAAA TGGCCAGAGC AAACTGCAAG 420 

AAATTTGCTT TTGTTCATAT ATCTTTTGCA CAATTTGAAC TGTCACAAGG TAATGTCT^ 480 

AAAAGTAAAC AACTTCTTCA AAAAGCTGTA GAACGTGGAG CAGTACCACT AGAAATGCTG 540 

GT^AATTGCCC TGCGGAATTT AAACCTCCAA AAAAAGCAGC TGCTTTCAGA GGAGGAAAAG 600 

AAGAATTTAT CAGCATCTAC GGTATTAACT GCCCAAGAAT CATTTTCCGG TTCACTTGGG 660 

CATTTACAGA ATAGGAACAA CAGTTGTGAT TCCAGAGGAC AGACTACTAA AGCCAGGTTT 720 

TTATATGGAG AGAACATGCC ACCACAAGAT GCAGAAATAG GTTACCGGAA TTCATTGAGA 780 

CAAACTAACA AAACTAAACA GTCATGCCCA TTTGGAAGAG TCCCAGTTAA CCTTCTAAAT 840 

AGCCCAGATT GTGATGTGAA GACAGATGAT TCAGTTGTAC CTTGTTTTAT GAAAAGACAA 900 

ACCTCTAGAT CAGAATGCCG AGATTTGGTT GTGCCTGGAT CTAAACCAAG TGGAAATGAT 960 

TCCTGTGAAT TAAGAAATTT AAAGTCTGTT CAAAATAGTC ATTTCAAGGA ACCTCTGGTG 1020 

TCAGATGAAA AGAGTTCTGA ACTTATTATT ACTGATTCAA TAACCCTGAA GAATAAAACG 1080 

GAATC7\AGTC TTCTAGCTAA ATTAGAAGAA ACTAAAGAGT ATCAAGAACC AGAGGTTCCA 1140 

GAGAGTAACC AGAAACAGTG GCAATCTAAG AGAAAGTCAG AGTGTATTAA CCAGAATCCT 1200 

GCTGCATCTT CAAATCACTG GCAGATTCCG GAGTTAGCCC GAAAAGTTAA TACAGAGCAG 1260 

AAACATACCA CTTTTGAGCA ACCTGTCTTT TCAGTTTCAA AACAGTCACC ACCAATATCA 1320 

ACATCTAAAT GGTTTGACCC AAAATCTATT TGTAAGACAC CAAGCAGCAA TACCTTGGAT 1380 
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GATTACATGA GCTGTTTTAG AACTCCAGTT GTAAAGAATG ACTTTCCACC TGCTTGTCAG 1440 

TTGTCAACAC CTTATGGCCA ACCTGCCTGT TTCCAGCAGC AACAGCATCA AATACTTGCC IS 00 

ACTCCACTTC AAAATTTACA GGTTTTAGCA TCTTCTTCAG CAAATGAATG CATTTCGGTT 1560 

AAAGGAAGAA TTTATTCCAT TTTAAAGCAG ATAGGAAGTG GAGGTTCAAG CAAGGTATTT 1620 

CAGGTGTTAA ATGAAAAGAA ACAGATATAT GCTATAAAAT ATGTGAACTT AGAAGAAGCA 1680 

GATAACCAAA CTCTTGATAG TTACCGGAAC GAAATAGCTT ATTTGAATAA ACTACAACAA 1740 

CACAGTGATA AGATCATCCG ACTTTATGAT TATGAAATCA CGGACCAGTA CATCTACATG 1800 

GTAATGGAGT GTGGAAATAT TGATCTTAAT AGTTGGCTTA AAAAGAAAAA ATCCATTGAT 1860 

CCATGGGAAC GCAAGAGTTA CTGGAAAAAT ATGTTAGAGG CAGTTCACAC AATCCATCAA 1920 

CATGGCATTG TTCACAGTGA TCTTAAACCA GCTAACTTTC TGATAGTTGA TGGAATGCTA 1980 

AAGCTAATTG ATTTTGGGAT TGCAAACCAA ATGCAACCAG ATACAACAAG TGTTGTTAAA 2040 

GATTCTCAGG TTGGCACAGT TAATTATATG CCACCAGAAG CAATCAAAGA TATGTCTTCC 2100 

TCCAGAGAGA ATGGGAAATC TAAGTCAAAG ATAAGCCCCA AAAGTGATGT TTGGTCCTTA 2160 

GGATGTATTT TGTACTATAT GACTTACGGG AAAACACCAT TTCAGCAGAT AATTAATCAG 2220 

ATTTCTAAAT TACATGCCAT AATTGATCCT AATCATGAAA TTGAATTTCC CGATATTCCA 2280 

GAGAAAGATC TTCAAGATGT GTTAAAGTGT TGTTTAAAAA GGGACCCAAA ACAGAGGATA 2340 

TCCATTCCTG AGCTCCTGGC TCATCCCTAT GTTCAAATTC AAACTCATCC AGTTAACCAA 2400 

ATGGCCAAGG GAACCACTGA AGAAATGAAA TATGTTCTGG GCCAACTTGT TGGTCTGAAT 2460 

TCTCCTAACT CCATTTTGAA AGCTGCTAAA ACTTTATATG AACACTATAG TGGTGGTGAA 2520 
AGTCATAATT CTTCATCCTC CAAGACTTTT GAAAAAAAAA GGGGAAAAAA ATGA 

Seq ID NOi 647 Protein secpzenee 
Protein Accession NP_003309.1 

1 11 21 31 41 51 

1 I I I I I 

MESEDLSGRE LTIDSIMNKV RDIKNKFKNE DLTDELSLNK ISADTTDNSG TVNQIMMMAN 60 
NPEDWLSLIjL KLEKNSVPLS DALIiNKLIGR YSQAIEALPP DKYGQNESFA RIQVRFAELK 120 
AIQEPDDARD YFQMARANCK KFAFVHISFA QFELSQGNVK KSKQLLQKAV ERGAVPLEML 180 
EIALRNLMLQ KKQLLSEEEK KNLSASTVLT AQESFSGSLG HLQNRNNSCD SRGQTTKARF 240 
LYGENMPPQD AEIGYRNSLR QTNKTKQSCP FGRVPVNLLN SPDCDVKTDD SWPCFMKRQ 300 
TSRSECRDLV VPGSKPSGND SCEIiRNLKSV QNSHFKEPLV SDEKSSELII TDSITIiKNKT 360 
ESSLLAKLEE TKEYQEPEVP ESNQKQWQSK RKSECINQNP AASSNHWQIP ELARKVNTEQ 420 
KHTTFEQPVF SVSKQSPPIS TSKWFDPKSI CKTPSSNTLD DYMSCPRTPV VKNDFPPACQ 480 
LSTPYGQPAC FQQQQHQILA TPLQNLQVLA SSSANECISV KGRIYSILKQ IGSGGSSKVF 540 
QVLNEKKQIY AIKYVNLEEA DNQTI.DSYRN EIAYL.NKLQQ HSDKIIRLYD YEITDQYIYM 600 
VMECGNIDLN SWLKKKKSID PWERKSYWKN MIiEAVHTIHQ HGIVHSDLKP ANFLIVDGML 660 
KUDFGIANQ MQPDTTSWK DSQVGTVNYM PPEAIKDMSS SRENGKSKSK ISPKSDVWSL 720 
GCILYYMTYG KTPFQQIINQ ISKLHAIIDP NHEIEFPDIP EKDLQDVLKC CIiKRDPKQRI 780 
SIPEIiLAHPY VQIQTHPVNQ MAKGTTBEMK YVLCQLVGLN SPNSILKAAK TLYEHYSGGE 840 
SHNSSSSKTF EKKRGKK 

Seq ID NO: 648 DNA sequence 
Nucleic Acid Accession #j NM_015507 
Coding sequence: 241.. 1902 

I 11 21 31 41 51 

I I I I i I 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 
CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 120 
CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GCX3GCGGCTT AGCTGCTACG 180 
GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAGA 240 
ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 
GGGAACGCGG CCAGTGC7\AG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 
TGTCACTATG GAACTAAACT GGCCTGCTGC TACX3GCTGGA GAAGAAACAG CAAGGGAGTC 420 
TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 
AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG 540 
AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 600 
CTCAGTGGCC ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 
ATAAACTGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 72 0 
TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 
GGTAAAGTCA TCTGTCCCTA CAATCX3AAGA TGT6TGAACA CATTTGGAAG CTACTACTGC 840 
AAATGTCACA TTGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 
AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 

GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 

ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 

AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1140 

CCAGAACCCA CCAGGACTCC TACCCCTAAG 6TGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 

ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 

GAGGGGCTTG AGGATGAGAA AAGAGJ\AGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 132 0 

AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 

CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 

GACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 150 0 

TGGAATCCTG CTGATCGAGA T7VATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1560 

GGTCACAAGA AAGACATTGG CCGATTGAJ^A CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 

AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 1680 

TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGA;\AAG 1740 

TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 

GAAGCAGAAC GTGGCAAGGG CAAAACCGGC GAAATCGCAG TGGATGGCGT CTTGCTTGTT 1860 

TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGACT GAATGTTACT ATCTTTATAT 1920 

TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 

TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2040 

TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 

TTTCTGAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCCCTCCT 2160 

CAGTATATCT GATTTGTATA AGTAAGTTGA TGAGCTTCTC TCTACAACAT TTCTAGAAAA 2220 

TAGAAAAAAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 

TAT6ACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 



434 



wo 02/086443 

TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 



PCT/US02/12476 



Seq ID NO: 649 Protein sec[uence 
Protein Accession #: NP_056322 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



1 
I 

MPLPWSIiALP 
CEATCEPGCK 
LSGHMLMPDA 
GKVICPYNRR 
GSFKCKCKQG 
PEPTRTPTPK 
SLRGDVPFPK 
WNPADRDNAI 
FVKNSNNALA 
SGLCPDSLLS 



11 

I 

LLLSWVAGGF 
FGECVGPNKC 
TCVNSRTCAM 
CVNTFGSYYC 
YKGNGLRCSA 
VNLQPFNYEE 
VNEAGEFQIiI 
GFYMAVPALA 
WEKTTSEDEK 
VDD 



21 

I 

GNAASARHHG 
RCFPGYTGKT 
INCQYSCEDT 
KCHIGFELQY 
IPENSVKEVL 
IVSRGGNSHG 
LVQRKALTSK 
GHKKDIGRLK 
WKTGKIQLYQ 



31 
1 

LLASARQPGV 
CSQDVNECGM 
EEGPQCLCPS 
ISGRYDCIDI 
RAPGTIKDRI 
GKKGNEBKMK 
LBKKDIjNISV 
LLLPDLQPQS 
GTDATKSriF 



41 
I 

CHYGTKLACC 
KPRPCQHRCV 
SGI/RIiAPNGR 
NECTMDSHTC 
KKLLAHKNSM 
E6LEDEKREE 
DCSPNHGICD 
NFCLLFDYRL 
KAERGKGKTG 



51 

I 

YGWRRNSKGV 
NTHGSYKCFC 
DCLDIDECAS 
SHHTU^CFNTQ 
KKKAKIKNVT 
KALKNDIEER 
WKQDREDDFD 
AGDKVGKLRV 
EIAVDGVIiriV 



Seg ID NO: 650 DNA sequence 

Nucleic Acid Accession #: NM_003 506,1 

Coding sequence: 259. .2379 



GCAGCTCCAG 
TTAGACGGGG 
CTCATTTTCA 
ATCTTTGGAT 
ATCAGGAATT 
CTCCTAAGAG 
ATGGCCTACA 
GCGGTGGAAA 
ACTTTCCTCT 
TGTCGTAAAC 
ATCCGATGGC 
GTAACTTTTG 
AGAGACATTG 
TTTCTGGGAA 
CTAGAGTTTG 
TTCACATTCC 
ATATATTACT 
GGCGATAGCA 
CTAGGCTCTC 
GCTGGCACTG 
TGGAGTTGTG 
CCAGGTTTCC 
GGAGTTTGCT 
CTGTGCCTTT 
CATGTTCGAC 
ATTCGAATTG 
TACGTCTATG 
CGTCAGTACC 
TTATTTATGA 
GGAAGCAAT^ 
CCAATCAGTG 
TCTAAAGTTA 
TCCAAATCCA 
ATTACTAGCC 
ACATCAATGA 
TGTGGTGAAC 
GGGAAGGGCC 
AAGAGTGATA 
TCAGAACCAA 
AGAAAAGAGC 
CAGAAGCAAA 
TACGTTCTTC 
TCAAGAATAA 
AAATGTGCAG 
CCTTTTCTAT 
TTTACCTTTT 
GTATCTTTTT 
ATTCAAGTAT 
ATTTCTAAGA 
GTCTTATAAT 
TGTGATTTTT 
GGTGCTTACT 
ATATTTAAAA 
GGCCAAGTGC 
AACCACTTAC 
TACATTTTGT 



11 
I 

TCCCGGACGC 
ACGGGAAGGG 
GGAAAGCCTG 
GGGGATCTTC 
TGAAGAAAAT 
GGCACAGTCT 
ACATGACGTT 
TGGAGCATTT 
GCAAAGCATT 
TTTGTGAGAA 
CTGAGGAGCT 
ATCCACACAC 
GATTTTGGTG 
TTGACCAGTG 
CAAAAAGTTT 
TTACTTTTTT 
CTGTCTGTTA 
CAGCCTGCAA 
AAAAT7VAGGC 
TGTGGTGGGT 
AAGCCATCGA 
TGACTGTTAT 
TTGTTGGCCT 
GTGTGTTTGT 
AAGTCATACA 
GAGTCTTCAG 
AGCAAGTGAA 
ATATCCCATG 
TAAAATACCT 
AGACATGCAC 
AAAGTCGAAG 
AACACAAAAA 
TGGGAACCAG 
ATGATTACCT 
GAGAGGTGAA 
CTGCCTCGCC 
AGGCAGGCAG 
TTACTGACAC 
GCAGCCTCAA 
AGGGAGGTGG 
TTTGTGTTAC 
TTTTGCACTT 
TATGACTCAT 
GTTAATAATA 
TTATGAAGAT 
TGATATAAAA 
ATACATATTT 
TTTTATCATG 
A/UVTTGTAAA 
AGGAATTTAA 
ATAGTCTCGT 
CAAAGAGTGT 
TAAAATGTCC 
AATTGACTTC 
AGTTGCTTAT 
ATTATACAGT 



21 
1 

AACCCCGGAG 
ACAGCGGCCT 
AAAATGAGTA 
TGAGGATGCA 
GGAGATGTTT 
CTTCACCTGT 
TTTCCCTAAT 
TCTTCCTCTC 
TGTACCAACC 
AGTATATTCT 
TGAATGTGAC 
AGAATTTCTT 
TCCAAGGCAT 
TGCGCCTCCA 
TATTGGAACA 
AATTGATGTT 
CAGCATTGTA 
TAAGGCAGAT 
TTGCACCGTT 
GATTCTTACC 
GCAAAAAGCA 
GCTTCTTGCT 
TTATGACCTG 
TGGGCTCTCT 
ACATGATGGC 
CGGCTTGTAT 
CAGGATTACC 
TCCTTATCAG 
GATGACATTA 
AGAATGGGCT 
AGTACTACAG 
GAAGCACTAT 
CACAGGAGCT 
AGGACAAGAA 
AGCGGACGGA 
AGCAGCATCC 
TGTATCTGAA 
TGGCCTGGCA 
AGGTTCCACA 
TTGTCATTCA 
ACTGGAAGTG 
AAAGTTGCAT 
TTCACACAAA 
TTTTTTTAAT 
TCTACTCTTG 
TCAAGATATT 
GAAAATAAGC 
CTATTGTGAT 
ATAGTCTTCT 
CTTTAAAAAC 
TTTAGGAATT 
CCACTATTGA 
TAAAGGGTTA 
CCTTTTTTAA 
ATTTTTTGTT 
ACCTTTCTCA 



31 
I 

CCGTCTCAGG 
TCGACCGCCC 
AAATAGTGAA 
AAGAGTGATT 
ACATTTTTGT 
GAACCAATTA 
CTGATGGGTC 
GCAAATCTGG 
TGCATAGAAC 
GATTGCAAAA 
AGATTACAAT 
GGTCCTCAGA 
CTTAAGACTT 
TGCCCCAACA 
GTTTCAATAT 
AGAAGATTCA 
TCTCTTATGT 
GAGAAGCTAG 
TTGTTCATGC 
ATTACTTGGT 
GTGTGGTTTC 
CTGAACAAAG 
GATGCTTCTC 
CTTCTTTTAG 
CGGAACCAAG 
CTTGTGCCAT 
TGGGAGATAA 
GCAAAAGCAA 
ATTGTTGGCA 
GGGTTTTTTA 
GAATCATGTG 
AAACCAAGTT 
ACAGCAAATC 
ACTTTGACAG 
GCTAGCACCC 
ATCTCCAGAC 
AGTGCGCGGA 
CAGAGCT^CA 
TCTCTGCTTG 
GATACTTGAA 
ACCTATGCAC 
TGCCTACTGT 
GGTTAATGAC 
AGTGTGGGAG 
GTAAGAGTAT 
TCTTTGCTGA 
TTATATGTAT 
ATTTTAGCAC 
TTTATACTGT 
CCACTTATTG 
TCACAGATCT 
TTGTATTATG 
GTAGACAAAA 
TGTTTCATGA 
TTAACTTTTG 
GACATTTTGT 



41 

I 

TCCCTGGGGG 
CCCGAGTAAT 
ATGAGGAATT 
CATCCAAGCC 
TGACGTGTAT 
CTGTTCCCAG 
ATTATGACCA 
AATGTTCACC 
AAATTCATGT 
AATTAATTGA 
ACTGTGATGA 
AGAAAACAGA 
CTGGGGGACA 
TGTATTTTAA 
TTTGTCTTTG 
GATACCCAGA 
ACTTCATTGG 
AACTTGGTGA 
TTTTGTATTT 
TCTTAGCTGC 
ATGCTGTTGC 
TTGAAGGAGA 
GCTACTTTGT 
CTGGCATTAT 
AAAAACTAAA 
TAGTGACACT 
CTTGGGTCTC 
AAGCTCGACC 
TCTCTGCTGT 
AACGAAATCG 
AGTTTTTCTT 
CACACAAGCT 
ATGGCACTTC 
AAATCCAAAC 
CCAGGTTAAG 
TCTCTGGGGA 
GTGAAGGAAG 
ATTTGCAGGT 
TTCACCCAGT 
GAACATTTTC 
TGTTTTGTAA 
TATACTGGAA 
AACAATATAC 
GACAGAGTTA 
TTTAAGATGT 
AGTATTTAAA 
TTGAACTTTT 
TTTGGTAGCT 
AAAAAAAGAT 
ATACCTTACC 
AAATTATGTA 
CTGCTCACTG 
TGTTAGTCTT 
CCACCCATTG 
TTTCTTAACA 
AG 



51 
I 

GAACGGTGGG 
TGACCCAGGA 
TGAACATTTT 
ATGTGGTAAA 
TTTTCTACCC 
ATGTATGAAA 
GAGTATTGCC 
AAACATTGAA 
GGTTCCACCT 
CACTTTTGGG 
GACTGTTCCT 
ACAAGTCCAA 
AGGATATAAG 
AAGTGATGAG 
TGCAACTCTG 
GAGACCAATT 
ATTTTTGCTG 
CACTGTTGTC 
TTTCACAAT6 
AGGAAGAAAA 
ATGGGGAACA 
CAACATTAGT 
ACTCTTGCCA 
TTCCTTAT^T 
GAAATTTATG 
TCTCGGATGT 
TGATCATTGT 
AG.AATTGGCT 
CTTCTGGGTT 
CAAGAGAGAT 
AAAGCACAAT 
GAAGGTCATT 
TGCAGTAGCA 
CTCACCAGAA 
AGAACAGGAC 
ACAGGTCGAC 
GATTAGTCCA 
CCCCAGTTCT 
TTCAGGAGTG 
TCTCGTTACT 
GAATCACTGT 
AAAATAGAGT 
CTGAAAACAG 
GAGGAATCTT 
ACTATGCTAT 
TCTTATCCTT 
TTGAAATCCT 
TTTACACTGA 
ATACCAAAAA 
ATCTAAAATG 
ACTGAAATAA 
ATCCTTCTGC 
TTGTATATTA 
ATTGTATTAT 
TTTAGAATAT 



60 
120 
ISO 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 



Seq ID NO: 651 Protein sequence 
Protein Accession #: NP_003497.1 



21 
I 



51 

I 



435 



10 



20 
25 
30 
35 
40 
45 
50 
55 
60 



WO 02/086443 

MEMFTFLLTC IFIiPLLRGHS LFTCEPITVP RCMKMAYNMT FFPNLMGHYD QSIAAVEMEH 60 

PLPLANLECS PNIBTFLCKA FVPTCIEQIH WPPCRKLCE KVYSDCKKLI DTFGIRWPEE 120 

LECDRLQYCD ETVPVTFDPH TEPLGPQKKT EQVQRDIGFW CPRHIjKTSGG QGYKFLGIDQ 180 

CAPPCPNMYF KSDBLEFAKS FIGTVSIFCL CATLFTFLTF LIDVRRFRYP ERPIIYYSVC 240 

YSIVSLMYFI GFLLGDSTAC NKADEKLELG DTWLGSQNK ACTVLFMIiLY FFTMAGTVWW 300 

VILTITWFLA AGRKWSCEAI EQKAVWFHAV AWGTPGFLTV MLLAIjNKVEG DNISGVCPVG 360 

LYDLDASRYF VLLPLCLCVF VGIiShLLAGX ISLNHVRQVI QHDGRNQEKL KKFMIRIGVF 420 

SGLYLVPLVT LLGCYVYEQV NRITWEITWV SDHCRQYHIP CPYQAKAKAR PEL7VLFMIKY 480 

LMTLIVGISA VFWVGSKKTC TBWAGFFKRN RKRDPISESR RVLQESCEFF LKHNSKVKHK 540 

KKHYKPSSHK IiKVISKSMGT STGAT/iNHGT SAVAITSHDY LGQETLTEIQ TSPETSMREV 600 

KADGASTPRL REQDCGEPAS PAASISRLSG EQVD6KGQAG SVSESARSE6 RISPKSDITD 660 
TGIiAQSMNLQ VPSSSEPSSL KGSTSLLVHP VSGVRKEQGG GCHSDT 



PCT/US02/12476 



Seg ID NO : 652 DNA sequence 
15 Nucleic Acid Accession #: IIM_014791.1 
Coding sequence: 171.. 2126 



TTGGCGGGCG 
GCCGCGTCTC 
TCCGCCCCTC 
ATGATGAACT 
AGGTCAAACT 
AAAACACACT 
TGAGACATCA 
TGGTTCTTGA 
TGTCAGAAGA 
ACAGCCAGGG 
ATAAATTAAA 
ATCTACAGAC 
CATATCTTGG 
GTGGATTTCT 
GAAAATATGA 
TGCAGGTGGA 
TGCAAGATTA 
ATGATTGCGT 
TAATTTCACT 
AGGCTCGGGG 
CTACCCCATT 
ATAAAAATTA 
GTGCTGCTAC 
AATCTAAATC 
AAAATGTATA 
CAAAGACTCC 
ACACTACACC 
TAAATTCAAC 
GCTCAGTGGA 
CCAAAGTGTT 
GCAAAAGGAA 
CTACAACTAG 
AGAAGCATGT 
TTGGGAAAGT 
TGGGTATCAG 
ACATCCTATC 
GGTGTGATAC 
CTACCAACTT 
GATATTATTT 
TAATCATGTG 
AATGTAAGCT 
TTGTGAATAT 



11 

I 

GAAGCGGCCA 
TCAGGACAGC 
AGGTTCTTTT 
TCTCAAATAT 
TGCCTGCCAT 
AGGGAGTGAT 
GCATATATGT 
GTACTGCCCT 
GGAGACCCGG 
CTATGCTCAC 
GCTGATTGAC 
ATGCTGTGGG 
ATCAGAGGCA 
ACCATTTGAT 
TGTTCCCAAG 
CCCAAAGAAA 
CAACTATCCT 
AACAGAACTT 
GTGGCAGTAT 
AAAACCAGTT 
CACAGACATC 
TGTGGCGGGA 
TCCCCGAACA 
ATTAACTCCA 
TACTCCTAAG 
AGTTAATAAG 
CTCAAAAGCT 
AGGAACAGAC 
ATTGGATCTC 
TGGGAGCCTT 
GGGTTCTGCC 
ATTAGTGAAT 
TGACTTTGTA 
GACAATGCAA 
GAGGCAGCGG 
TAGCTGCAAG 
AGCCTACATA 
GTTTCTAAAG 
TGTGTATGAA 
GTTTTGTATA 
CTTAACTATG 



21 

I 

CAACCCGGCG 
AGGCCCCTGT 
TCTAATTCCA 
TATGAATTAC 
ATCCTTACTG 
TTGCCCCGGA 
CAACTCTACC 
GGAGGAGAGC 
GTTGTCTTCC 
AGGGACCTCA 
TTTGGTCTCT 
AGTCTGGCTT 
GATGTTTGGA 
GATGATAATG 
TGGCTCTCTC 
CGGATTTCTA 
GTTGAGTGGC 
TCTGTACATC 
GATCACCTCA 
CGTTTAAGGC 
AAGTCAAATA 
TTAATAGACT 
TCACAGTTTA 
GCCTTATGCA 
TCTGCTGTAA 
AACCAGCATA 
AGAAACCAGT 
AAGTTAATGA 
AACCAAGCAC 
GAAAGGGGGT 
AGAGACGGGC 
CCAGATCAAC 
CAAAAGGGTT 
TTTGAATTAG 
CTTAAGGGCG 
GTATAATT6A 
AAGACTGTTA 
AGCTATCTTA 
TCTAAATCAA 
TTAATAATTG 
TCTCTTTGTA 



31 
I 

ATCGAAAAGA 
CCTTCTGTCG 
AATAAACTTG 

ATGAAACTAT 
GAGAGATGGT 
TCAAAACGGA 
ATGTGCTAGA 
TGTTTGACTA 
GTCAGATAGT 
AGCCAGAAAA 
GTGCAAAACC 
ATGCAGCACC 
GCATGGGCAT 

TAATGGcrrr 

CCAGTAGCAT 
TGAAAAATCT 
AAAGCAAGAA 
ACAGAAACAA 
CX3GCTACCTA 
TTTCTTCTTT 
ATTGGAGTCT 
ATGATTGGTG 
CCAAGTACTG 
GAACACCTGC 
AGAATGAAGA 
AGAGAGAAAT 
GCCTGAAAGA 
CAGGTGTCAT 
ATATGGAGGA 
TGGATAAGGT 
CCAGAAGACT 
TGTTGAATGA 
ATACACTGAA 
AAGTGTGCCA 
ATGCCTGGGT 
TGGATTCTTC 
TGATCGCTTT 
AGACCAATAT 
GCCCATCTGT 
TTGACTTTCT 
ATGTGTAATT 



41 
I 

TTCTTAGGAA 
GGCGCC6CTC 
CAAGAGGACT 
TGGGACAGGT 
AGCTATAAAA 
GATTGAGGCC 
GACAGCCAAC 
TATAATTTCC 
ATCTGCTGTT 
TTTGCTGTTT 
CAAGGGTAAC 
TGAGTTAATA 
ACTGTTATAT 
ATACAAGAAG 
TCTGCTTCTT 
ATTGAACCAT 
TCCTTTTATT 
CAGGCAAACA 
TCTTCTGCTT 
CTCCTGTGGA 
GGAAGATGTG 
TGAAGATGAT 
GACAGAATCA 
AAATAAATTA 
GTACTTTATG 
ACTCACTACG 
AACTCCAATT 
TAGCCCTGAG 
GACTCCAAAA 
TATCACTGTG 
AAAGCTTCAC 
AATAATGTCT 
GTGTCAAACA 
GCTTCAAAAA 
TTACAAAAGA 
CATCCTGCCG 
GATTTTAAAG 
CTCTTTGTTT 
CATTATGTTA 
TAGATTCACT 
TCTTTCTGAA 



51 

I 

CGCCGTACCA 
AGCCGTGCCC 
ATGAAAGATT 
GGCTTTGCAA 
ATCATGGATA 
TTGAAGAACC 
AAAATATTCA 
CAGGATCGCC 
GCTTATGTGC 
GATGAATATC 
AAGGATTACC 
CAAGGCAAAT 
GTTCTTATGT 
ATTATGAGAG 
CAACAAATGC 
CCCTGGATCA 
CACCTCGATG 
ATGGAGGATT 
CTAGCCAAGA 
CAAGCCAGTG 
ACCGCAAGTG 
TTATCAACAG 
AATGGGGTGG 
AAGAACAAAG 
TTTCCTGAGC 
CCAAATCGTT 
AAAATACCAG 
AGGCGGTGCC 
AGAAAGGGAG 
CTCACCAGGA 
TATAATGTGA 
ATTCTTCCAA 
CAGTCAGATT 
CCCGATGTGG 
TTAGTGGAAG 
GATGAGTGTG 
TTCATTGGAA 
TTAAACAAAA 
CTGTCTTTTT 
TCCATATGTG 
ATATVAACCAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



65 



Seq ID NO: 653 Protein sequence 
Protein Accession #: NP_055606,1 



11 



21 



31 



41 



51 



70 
75 
80 
85 



MKDYDELIiKY 
ItKNIiRHQHIC 
AYVHSQGYAH 
QGKSYLGSEA 
QQMLQVDPKK 
MEDLISLWQY 
TASDKNYVAG 
KNKENVYTPK 
KIPVNSTGTD 
LTRSKRKGSA 
QSDFGKVTMQ 



YELHETIGTG 
QLYHVLETAN 
RDLKPENLLF 
DVWSMGILLY 
RISMKNLLNH 
DHLTATYLLL 
LIDYDWCEDD 
SAVKNEEYPM 
KLMTGVISPE 
RDGPRRLKLH 
FELEVCQLQK 



GFAKVKLACH 

KIFMVLEYCP 
DEYHKLKLID 
VLMCGFLPFD 
PWIMQDYNYP 
LAKKARGKPV 
IiSTGAATPRT 
FPEPKTPVNK 
RRCRSVELDIi 
YNVTTTRLVN 
PDWGIRRQR 



ILTGEMVAIK 
GGELFDYIIS 
FGLCAKPKGN 
DDNVMALYKK 
VEWQSKNPFI 
RLRLSSFSCG 
SQFTKYWTES 
NQHKREILTT 
NQAHMEETPK 
PDQLIiNEIMS 
LKGDAWVYKR 



IMDKNTLGSD 
QDRLSEEETR 
KDYHLQTCCG 
IMRGKYDVPK 
HLDDDCVTEL 
QASATPFTDI 
NGVESKSIiTP 
PNRYTTPSKA 
RKGAKVPGSL 
ILPKKHVDFV 
LVEDILSSCK 



LPRIKTEIEA 
WFRQIVSAV 
SLAYAAPELI 
WLSPSSILLL 
SVHHRNNRQT 
KSNNWSIiEDV 
ALCRTFANKIi 
RNQCLKETPI 
ERGLDKVITV 
QKGYTLKCQT 
V 



Seq ID NO: 654 DNA sequence 
Nucleic Acid Accession #: NM_000582 
Coding sequence : 88 . . 990 

1 11 21 31 41 51 

I I 1 I I I 

GCAGAGCACA GCATCGTCGG GACCAGACTC 6TCTCAGGCC AGTTGCAGCC TTCTCAGCCA 
AACGCCGACC AAGGAAAACT CACTACCATG AQAATTGCAG TGATTTGCTT TTGCCTCCTA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
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WO 02/086443 

GGCATCACCT GTGCCATACC 
CTTTACAACA AATACCCAGA 
CAGAATCTCC TAGCCCCACA 
GATGATATGG ATGATGAAGA 
AACGACTCTG ATGATGTAGA 
TCTGATGAAT CTGATGAACT 
TTCACTCCAG TTGTCCCCAC 
GGACTGAGGT CAAAATCTAA 
GACGAGGACA TCACCTCACA 
CCCGTTGCCC AGGACCTGAA 
GAAACGAGTC AGCTGGATGA 
TATAAGCGGA AAGCCAATGA 
CTTTCCAAAG TCAGCCGTGA 
GTTGTAGACC CCAAAAGTAA 
TTAGATAGTG CATCTTCTGA 
ATTTAGTCAA AAGAAAAAAT 
CTCAGTTTAT TGGTTGAATG 
ATTAGTTTAG TTTGTGGCTT 
CTATGTTCAT TCTATAGAAG 
TCATGAATAG AAATTTATGT 
ATAACATTTT ATGTCACTAT 
TATCTTTTTG TGGTGTGAAT 
AATTGCTTAT TTGTTTTCCC 
GCCTAAAAAA AAAAAAAAAA 



PCT/US02/12476 



AGTTAAACAG 
TGCTGTGGCC 
GACCCTTCCA 
TGATGATGAC 
TGACACTGAT 
GGTCACTGAT 
AGTAGACACA 
GAAGTTTCGC 
CATGGAAAGC 
CGCGCCTTCT 
CCAGAGTGCT 
TGAGAGCAAT 
ATTCCACAGC 
GGAAGAAGAT 
GGTCAATTAA 
GCTTTATAGC 
TGTATCTATT 
CATGGAAACT 
AAATGCAT^C 
AGAAGCAAAC 
AATCTTTTGT 
AAATCTTTTA 



AAAA 



GCTGATTCTG 
ACATGGCTAA 
AGTAAGTCCA 
CATGTGGACA 
GATTCTCACC 
TTTCCCACGG 
TATGATGGCC 
AGACCTGACA 
GAGGAGTTGA 
GATTGGGACA 
GAAACCCACA 
GAGCATTCCG 
CATGAATTTC 
AAACACCTGA 
AAGGAGAAAA 
AAAATGAAAG 
TGAGTCTGGA 
CCCTGTAAAC 
TATCACTGTA 
AAAATACTTT 
TTTTTAAGTT 
TCTTGAATGT 
AGCAATTAAT 



GAAGTTCTGA 
ACCCTGACCC 
ACGAAAGCCA 
GCCAGGACTC 
AGTCTGATGA 
ACCTGCCAGC 
GA6GTGATAG 
TCCAGTACCC 
ATGGTGCATA 
GCCGTGGGAA 
GCCACAAGCA 
ATGTGATTGA 
ACAGCCATGA 
AATTTCGTAT 
AATACAATTT 
AGAACATGAA 
AATAACTAAT 
TAAAAGCTTC 
TTTTAATATT 
TACCCACTTA 
AGTGTATATT 
AATAAGAATT 
AAAACATAAC 



Seq ID NO: 655 Protein sequence 
Protein Accession #: NP 000573 



MRIAVICFCL 
PSKSNESHDH 
DFPTDLPATE 
SEBLNGAYKA 
NEHSDVIDSQ 



11 
I 

LGITCAIPVK 
MDDMDDEDDD 
VFTPWPrVD 
IPVAQDLNAP 

ELSKVSREFH 



21 
I 

QADSGSSEEK 
DHVDSQDSID 
TYDGRGDSW 
SDWDSRGKDS 
SHEFHSHEDM 



31 
I 

QLYNKYPDAV 
SNDSDDVDDT 

YGhRSKSKKF 
YETSQLDDQS 
LWDPKSKEE 



41 
I 

ATWLNPDPSQ 
DDSHQSDESH 

RRPDIQYPDA 
AETHSHKQSR 
DKHLKFRISH 



GGAAAAGCAG 
ATCTCAGAAG 
TGACCACATG 
CATTGACTCG 
GTCTCACCAT 
AACCGAAGTT 
TGTGGTTTAT 
TGATGCTACA 
CAAGGCCATC 
GGACAGTTAT 
GTCCAGATTA 
TAGTCAGGAA 
AGATATGCTG 
TTCTCATGAA 
CTCACTTTGC 
ATGCTTCTTT 
GTGTTTGATA 
AGGGTTATGT 
TGTTATTCTC 
AAAAGAGAAT 
TTGTTGTGAT 
TGGTGGTGTC 
CTTTTTTACT 



51 
I 

KQNLIiAPQTL 
HSDESDELVT 

TDEDITSHME 
LYKRKANDES 
ELOSASSEVN 



Seq ID NO: 656 DNA sequence 

Nucleic Acid Accession #: NM_003 108.1 

Coding sequence : 76 1401 



GGGGTGGGAG 
GCCCTGCAAC 
CGGGAGGCGC 
GAGAGCGACC 
TTCATGGTAT 
AACGCCGAGA 
ATCCCGTTCA 
TACAAGTACC 
CAGAGCCCAG 
GGTGCCAAGA 
GCGGGCGCCA 
GACGACTACG 
ACG6TCAAGT 
CAGCTGCAGA 
CTGCAGCCGC 
CCCGCCAGCC 
GAGGTGCGGG 
AAGAACATCA 
TCGCGCTCGG 
GAGGACGCCG 
GCCAGCGAGC 
GATAAGGATT 
TACTGCACGC 
GACCTG6TGT 
AGCTGGGTTC 
ATGATGGTGG 
ATATTGATAA 
TTAAAGTGAA 
TCCTTTATCG 
AAAATGTGTT 
GAGGGGGCGG 
GTCGGTCTTT 
TCTAGGGAGT 
TTTTTAACAA 



11 
I 

GGGGAGGGGG 
GGATCATGGT 
TGGACACGGA 
CAGACTGGTG 
GGTCCAAGAT 
TCTCCAAGAG 
TCCGGGAGGC 
GGCCCCGGAA 
AGAAGAGCGC 
CCTCCAAGGG 
AGGCGGGCGC 
TGCTGGGCAQ 
GCGTGTTTCT 
TCAAACAGGA 
CGGGGCAGCA 
CTACGCTGAG 
CCGGCGCGAC 
CCAAGCAGCA 
T6TCCACCTC 
ACGACCTGAT 
AGCAGCTGGG 
TGGATTCGTT 
CGGAGCTGAG 
TCACATATTG 
CTTGGGAGGA 
TGTTGATGGT 
GATGTCGTGA 
ATGAGTAGTT 
TGTCTCAAGG 
TTTGTAATTA 
CGCGGCGGAG 
GAAGTCTGGA 
TGGTGGAGAT 
AAAAAGG6 



21 
1 

ACCTCCGCAC 
GCAGCAGGCG 
GGAGGGCGAA 
CAAGACGGCG 
CGAACGCAGG 
GCTGGGCAAG 
GGAGCGGCTG 
AAAGCCCAAA 
GGCCGGCGGC 
CTCCAGCAAG 
GGGCAAGGCG 
CCTGCGCGTG 
GGATGAGGAC 
GCCGGACGAG 
GCCGTCGCAG 
CAGCTCGGCG 
CTCGGGCGCC 
CCCGCCGCCG 
CTCGTCCAGC 
GTTCGACCTG 
GGGCGGCGCG 
CAGCGAGGGC 
CGAGATGATC 
AAAGGCGCCC 
AGTTGTAGTG 
GGCGGTGGTA 
CGCAAAGAAA 
TTTAAACATT 
TAGTTGCATA 
CTATTTCTTT 
GGGAGGTAGG 
A6ACGTCTGC 



31 

I 

GAGACCCAGC 
GAGAGCTTGG 
TTCATGGCTT 
TCGGGCCACA 
AAGATCATGG 
CGCTGGAAAA 
CGGCTCAAGC 
ATGGACCCCT 
GGCGGCGGGA 
AAATGCGGCA 
GCCCAGTCCG 
AGCGGCTCGG 
GAOSACGACG 
GAGGACGAGG 
CTGCTGAGAC 
GAGTCCCCCG 
GGGGGCGGCA 
CTCGCGCAGC 
AGCAGCGGCA 
AGCTTGAATT 
GCGGCCGGGA 
AGCCTGGGCT 
GCGGGGGACT 
GCTGCTCGCT 
GT6ATGATGA 
GGGTGGAGGG 
TTGGAAAACA 
TTTCCTGTCC 
CCTAGTCTGG 
TTCCTGAAAT 



AGAGGACCCT 
CTTAAGAGAA 



41 

I 

GGCCCGGGTT 
AAGCGGAGAG 
GCAGCCCGGT 
TCAAGCGGCC 
AGCAGTCTCC 
TGCTGAAGGA 
ACATGGCCGA 
CGGCCAAGCC 
GCGCGGGCGG 
AGCTCAAGGC 
GGGACTACGG 
GCGGCGGCGG 
ACGACGACGA 
AACCACCGCA 
GCTACAACGT 
AGGGAGCGAG 
GCCGCCTCTA 
CCGCGCTGTC 
GCAGCAGCGG 
TCTCTCAAAG 
ACCTGTCCCT 
CCCACTTCGA 
GGGTGGAGGG 
CTTTCTCTCG 
TGAT6ATGAT 
GAGAGAAGAA 
TGATGAAAAT 
TTTTTTTGTC 
AGTTGTGATT 
TCGTGATTGC 
GAAGGCGCTG 
TTTGGCAGCA 
CTTAAAGAAC 



51 
I 

GGAGCGTCCA 
CAACCTGCCC 
GGCCCTGGAC 
GATGAACGCG 
GGACATGCAC 
CAGCGAGAAG 
CTACCCCGAC 
CAGCGCCAGC 
AGGCGCGGGC 
CCCCGCGGCC 
GGGCGCGGGC 
CGCGGGCAA6 
CGACGAGCTG 
CCAGCAGCTC 
CGCCAAAGTG 
CCTCTACGAC 
CTACAGCTTC 
GCCCGCGTCC 
CAGCAGCGGC 
CGCGCACAGC 
GTCGCTGGTG 
GTTCCCCGAC 
GAACTTCTCC 
GAGGGTGCAG 
AATGATGATG 
GATGCTGATG 
TTTGGTGGAG 
CCCCCTCCCT 
ATTTTCCCAA 
AACATUSLGGCA 
TTTGAAGCTT 
CAACTGTTAC 
TGGTGATTTT 



Seq ID NOs 657 Protein sequence 
Protein Accession #: NP_003099.1 

1 11 21 31 41 SI 

I I I I I I 

MVQQAESLEA ESNLPREALD TEEGEFMACS PVAbDESDPD WCKTASGHIK RPMNAFMVWS 
KIERRKIMEQ SPDMHNi=^IS KRLGKRWKML KDSEKIPFIR EAERIiRLKHM ADYPDYKYRP 
RKKPKMDPSA KPSASQSPEK SAAGGGGGSA GGGAGGAKTS KGSSKKCXSKIi KAPAAAGAKA 
GAGKAAQSGD YGGAGDDYVL GSLRVSGSGG G6AGKTVKCV FLDEDDDDDD DDDELQLQIK 



180 
240 
300 

360 
420 
480 
540 
600 
660 
72 0 
780 
840 
900 
960 
1020 
1080 
1140 
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1260 
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1380 
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60 
120 
180 
240 



60 
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60 
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437 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

QBPDEEDEBP PHQQIiLQPPG QQPSQLIjRRy NVJUWPASPT IiSSSAESPEG ASLYDBVRAG 300 
ATS6AGGGSR IiYYSFKHITK QHPPFIiAQPA IiSPASSRSVS TSSSSSSQ8S SGSSGBDADD 360 
LMFDLSLHFS QSAHSASEQQ LGGGAAAGHIi SLSIiVDKDU) SFSEGSIiOSH FBFPDYCTPE 420 
LSEMIAGDWL EANFSDLVFT Y 

Seq ID NO: 658 DNA sequence 
Nucleic Acid Accession #: NM_001719 
Coding sequence: 123.. 1418 



PCT/US02/12476 



GGGCGCAGCG 
CTGCCACCTG 
CX3ATGCACGT 
CCCTGTTCCT 
GCTTCATCCA 
CCATTTTGGG 
CCATGTTCAT 
GCCAGGGCTT 
GCCTGCAAGA 
TGGAACATGA 
TTTCCAAGAT 
ACATCCGGGA 
AGCACTTGGG 
AGGAGGGCTG 
GGCACAACCT 
AGTTGGCGGG 
TCTTCAAGGC 
GCCAGAACCG 
AGAACAGCAG 
GAGACCTGGG 
AGGGGGAGTG 
AGACGCTGGT 
AGCTCAATGC 
ACAGAAACAT 
TTGGGGCCAA 
CTGCCTTTTG 
AAACATGAGC 
TCCTACAAGC 
GCCGGGCCAG 
TTATGAGCGC 
GGGCACATTG 
CAATAAAAC6 



11 

1 

GGGCCCGTCT 
GGGCGGTGCG 
GCGCTCACTG 
GCTGCGCTCC 
CCGGCGCCTC 
CTTGCCCCAC 
GCTGGACCTG 
CTCCTACCCC 
TAGCCATTTC 
CAAGGAATTC 
CCCAGAAGGG 
ACGCTTCGAC 
CAGGGAATCG 
GCTGGTGTTT 
GGGCCTGCAG 
CCTGATTGGG 
CACGGAGGTC 
CTCCAAGACG 
CAGCGACCAG 
CTGGCAGGAC 
TGCCTTCCCT 
CCACTTCATC 
CATCTCCGTC 
GGTGGTCCGG 
GTTTTTCTGG 
TGAGACCTTC 
AGCATATGGC 
TGTGCAGGCA 
GTCATTGGCT 
CTACCAGCCA 
GTGTCTGTGC 
AATGAATG 



21 

) 

GCAGCAAGTG 
GGCCCGGAGC 
CGAGCTGCGG 
GCCCTGGCCG 
CGCAGCCAGG 
CGCCCGCGCC 
TACAACGCCA 
TACAAGGCCG 
CTCACCGACG 
TTCCACCCAC 
GAAGCTGTCA 
AATGAGACGT 
GATCTCTTCC 
GACATCACAG 
CTCTCGGTGG 
CGGCACGGGC 
CACTTCCGCA 
CCCAAGAACC 
AGGCAGGCCT 
TGGATCATCG 
CTGAACTCCT 
AACCCGGAAA 
CTCTACTTCG 
GCCTGTGGCT 
ATCCTCCATT 
CCCTCCCTAT 
TTTT6ATCAG 
AAACCTAGCA 
GGGAAGTCTC 
GGCCACCCAG 
GAAAGGAAAA 



31 

I 

ACCGACGGCC 
CCGGAGCCCG 
CGCCGCACAG 
ACTTCAGCCT 
AGCGGCGGGA 
CGCACCTCCA 
TGGCGGTGGA 
TCTTCAGTAC 
CCGACATGGT 
GCTACCACCA 
CX3GCAGCCGA 
TCCGGATCAG 
TGCTCGACAG 
CCACCAGCAA 
A6AC6CTGGA 
CCCAGAACAA 
GCATCCGGTC 
AGGAAGCCCT 
GTAAGAAGCA 
CGCCTGAAGG 
ACATGAACGC 
CGGTGCCCAA 
ATGACAGCTC 
GCCACTAGCT 
GCTCGCCTTG 
CCGCAACTTT 
TTTTTCAGTG 
GG7^A7W^7\AA 
AGCCATGCAC 
CCGTGGGAGG 
TTGACCCGGA 



Seq ID NO: 659 Protein sequence 
Protein Accession #: NP_001710 



MHVRSLRAAA 
ILGLPHRPRP 
LQDSHFLTDA 
IRERFDNETF 
HNLGLQLSVE 
QNRSKTPKNQ 
GECAFPLNSY 
RNMWRACGC 



11 
I 

PHSFVALWAP 
HLQGKHNSAP 
DMVMSFVNI4V 
RISVYQVLQE 
TI/DGQSINPK 
EALRMANVAE 
MNATNHAIVQ 
H 



21 

1 

LFliLRSALTU^ 
MFMLDLYNAM 

EHDKEFPHPR 
HLGRESDLFL 
liAGIilGRHGP 
NSSSDQRQAC 
TLVHFIMPBT 



31 
I 

PSLDNEVHSS 
AVEEGGGPGG 
YHHREFRFDL 
LDSRTLWASE 
QNKQPFMVAF 
KKHELYVSFR 
VPKPCCAPTQ 



41 
1 

GGGACGGCCG 
GGTAGCGCGT 
CTTCGTGGCG 
GGACAACGAG 
GATGCAGCGC 
GGGCAAGCAC 
GGAGGGCGGC 
CCAGGGCCCC 
CATGAGCTTC 
TCGAGAGTTC 
ATTCCGGATC 
CGTTTATCAG 
CCGTACCCTC 
CCACTGGGTG 
TGGGCAGAGC 
GCAGCCCTTC 
CPiCGGGGAGC 
GCGGATGGCC 
CGAGCTGTAT 
CTACGCCGCC 
CACCAACCAC 
GCCCTGCTGT 
CAACGTCATC 
CCTCCGAGAA 
GCCAGGAACC 
AAAGGTGTGA 
GCAGCATCCA 
ACAACGCATA 
GGACTCGTTT 
AAGGGGGCGT 
AGTTCCTGTA 



41 
I 

FIHRRLRSQB 
QGPSYPYKAV 
SKIPEGEAVT 
EGWLVFDITA 
FKATEVHPRS 
DLGWQDWIIA 
LNAISVLYPD 



51 

I 

CCTGCCCCCT 
AGAGCCGGCG 
CTCTGGGCAC 
GTGCACTCGA 
GAGATCCTCT 
AACTCGGCAC 
GGGCCCGGCG 
CCTCTGGCCA 
GTCAACCTCG 
CGGTTTGATC 
TACAAGGACT 
GTGCTCCAGG 
TGGGCCTCGG 
GTCAATCCGC 
ATCAACCCCA 
ATGGTGGCTT 
AAACAGCGCA 
AACGTGGCAG 
GTCAGCTTCC 
TACTACTGTG 
GCCATCGTGC 
GCGCCCACGC 
CTGAAGAAAT 
TTCAGACCCT 
AGCAGACCAA 
GAGTATTAGG 
ATGAACAAGA 
AAGAAAAATG 
CCAGAGGTAA. 
GGCAAGGGGT 
ATAAATGTCA 



51 

I 

RREMQREILS 
FSTQGPPLAS 
AAEFRIYKDY 
TSNHWWNPR 
IRSTGSKQRS 
PEGYAAYYCE 
DSSNVILKKY 



Seq ID NO: 660 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sec[uence: 211,. 1895 



GGATCTGAGG 
GGGGCTTGGG 
6AGGAATTAT 
TGATTTTTTT 
GTGCTTTTTC 
CACAGGTTCC 
CTTGTGCTGA 
GAAGGT7\ATT 
AAAATATCGG 
TTCCGACACT 
GCCAATTATT 
TTCTTTGAAC 
GCTGTGGCTA 
CACATGCACT 
GTAGTCCATG 
CAAAATTCCA 
GTTGTGATGT 
TACCTGCATA 
ATCTTGATAG 
ACTCTGGCTG 
GCACCGATCT 
CTAGCTACCA 
AAACTGGCCA 



11 

I 

GGCGCCCAGT 
AGGCAGCCTG 
CTGATAAAAT 
CCCTCGAAAA 
TTTTCTCTTC 
TTGAACAGCT 
AAGCGAAAGT 
GTTTCCCTGA 
CTGTTCCATG 
GTAACCCCAA 
CAGACTGCCT 
GCCTCTATGT 
TTCTCATCAT 
TATTTGTGTC 
CTCACATAGG 
TTGAGGCAAC 
TTATTTACTT 
ATCTCATCTT 
GCTGGGGGTT 
ATGCGAGGTG 
TAGCAGCTAT 
AAATCTGGGA 
AATCGACACT 



21 

I 

CACTTCCTCC 
CTCTCCAGTC 
TCCTGGGTTA 
TGACCTTTTT 
TTTTTCTACG 
GGATTCTGAT 
ACAATGTGAA 
ATGGGATGGA 
CCCTCCTTAT 
TGGAACATGG 
TCGCTTTCTG 
AATGTATACC 
TGGTTACTTC 
TTTCATGCTG 
A6TAAAGGAG 
TTCTGTGGAC 
CCTGGCTACA 
TGTGGCTTTC 
TCCAGCAGCA 
CTGGGAACTT 
TGGGCTGAAT 
GACCAATGCA 
GGTCCTGGTC 



31 

1 

ACGTTCTCGT 
CCTATCCACC 
ATATTTTTAA 
ATGCTTCGAA 
ATAAATGAAA 
GGCACCATTA 
CTCAACATCA 
CTCATTTGTT 
ATTTATGACT 
GATTTTATGC 
CAGCCAGATA 
GTTGGCTACT 
AGACGATTGC 
AGAGCTACAA 
CTGGA6TCCC 
AAATCACAAT 
AATTATTATT 
TTTTCGGACA 
TTTGTTGCAG 
AGTGCTGGAG 
TTTATTCTGT 
GTTGGGCATG 
CTAGTCTTTG 



41 
I 

GCTGGGCGGG 
CACAGGTTTT 
AAACGGAGAG 
GCAGTTTGTC 
GCATTTCTTC 
CTATAGAGGA 
CAGCTCAACT 
GGCCCAGAGG 
TCAACCATAA 
ACAGCTTAAA 
TCAGCATAGG 
CCATCTCTTT 
ATTGCACTAG 
GCATCTTTGT 
TAATAATGCA 
ATATCGGGTG 
GGATCCTGGT 
CCAAATACCT 
CATGGGCTGT 
ACATCAAGTG 
TTCTGAATAC 
ACACAAGGAA 
GAGTGCATTA 



51 

I 

AGGAGCGGAT 
TTGGGTCGGA 
TTTTTAAAAA 
AACCAGCATA 
AAGAAAAAGG 
GCAGATTGTC 
CCAGGAGGGA 
AACAGTGGGG 
AGGAGTTGCT 
TAAAACATGG 
AAAGCT^GAA 
TGGTTCCTTG 
GAACTATATC 
CAAAGACAGA 
GGATGACCCA 
CAAGATTGCT 
GGAAGGTCTC 
GTGGGGCTTC 
GGCACGAGCA 
GATTTATCAA 
GGTTAGAGTT 
GCAATACAGG 
CATCGTGTTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
72 0 
780 
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900 
960 
1020 
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1200 
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180 
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60 
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300 
360 
420 
480 
540 
600 
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438 
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WO 02/086443 

GTATGCCTGC CTCACTCCTT CACTGGGCTC GGGTGGGAGA TCCGCATGCA CTGTGAGCTC 
TTCTTCAACT CCTTTCAOGG TTTCTTTOTG TCTATCATCT ACTGCTACT3 CftATOGAQAQ 
GTTCAGGCAG AGGTGAAGAA GATGTOOAGT OGGTQGAATC TCTCCGTGGA CTGGAAAAGG 
ACACCGCCAT GTGGCAGCCG CAGATGCGGC TCAGTGCTCA CCACCGTGAC GCACAGCACC 
AGCAGCCAGT CACAGGTGGC GGCCAGCACA CGCATGGTGC TTATCTCTGG CAAAGCTGCC 
AAGATCGCCA GCAGACAGCC TGACAGCCAC ATCACTTTAC CTGGCTATGT CTGGAGTAAC 
TCAQAGCAGG ACTGCCTOCC ACACTCTTTC CACQAGGAGA CCAAOGAAOA TAGTGGGAGG 
CAGGQAGATG ATATTCTAAT GGAGAAGCCT TCCAGGCCTA TGGAATCTAA CCCAGACACT 
GAAGGATQCC AAGGAGAAAC TGAG6AT6TT CTCTGA 

Seq ID NOi 661 Protein sequence 
Protein Accession #: Eos sequence 



PCT/US02/12476 



MLRSSI/STSI 
LNITAQLQEG 
DFMHSLNKTW 
RRLHCTRNYI 
KSQYIGCKIA 
FVAAWAVARA 
VGHDTRKQYR 
SIIYCYCNGE 
RMVLISGKAA 
SRPMESNPDT 



11 
I 

VlfFLFSSFST 
EGNCFPEWDG 
ANYSDCLRFL 
HMHLFVSFML 
WMFIYFLAT 
TLADARCWEL 
KLAKSTLVLV 
VQAEVKKMWS 
KIASRQPDSH 
EGCQGETEDV 



21 
1 

INESISSRKR 
LICWPRGTVG 
QPDISIGKQE 
RATSIFVKDR 
NYYWIIiVEGL 
SAGDIKWIYQ 
LVFGVHYIVF 
RWNLSVDWKR 
ITLPGYVWSN 
L 



31 
I 

HRFiEQLDSD 
KISAVPCPPY 
FFERLYVMYT 
WHAHIGVKE 
YLHNLIFVAF 
APILAAIGLN 
VCLPHSFTGL 
TPPCGSRRCG 
SEQDCIiPHSF 



41 

I 

GTITIEEQIV 
lYDFNHKGVA 
VGYSISFGSL 
XiBSZiZMQODP 
FSDTKYLWGF 
FILFLNTVRV 
GWEIRMHCEL 
SVIiTTVTHST 
HEETKEDSGR 



51 

! 

LVLKAKVQCE 
FRHCNPNGTW 
AVAILIIGYF 
QNSIEATSVD 
ILIGWGFPAA 
LATKIWETNA 
FFNSFQGFFV 
SSQSQVAAST 
QGDDILMEKP 



Seq ID MO: €62 DNA sequence 
Nucleic Acid Accession #: NM_005048 
Coding sequences 143.. 1795 



GGCCGGTGGC 
TGGGCCAGCC 
TCTTCCTACA 
GCTAATGCTC 
TATAGAGGAG 
AGCTCAACTC 
GCCCAGAGGA 
CAACCATAAA 
CAGCTTAAAT 
CAGCATAGGA 
CATCTCTTTT 
TTGCACTAGG 
CATCTTTGTC 
AATAATGCAG 
TATCGGGTGC 
GATCCTGGTG 
CAAATACCTG 
ATGGGCTGTG 
CATC7VAGTGG 
TCTGAATACG 
CACAAGGAAG 
AGTGCATTAC 
CC6CAT6CAC 
CTGCTACTGC 
CTCCGTGGAC 
CACCGTGACG 
TATCTCTGGC 
TGGCTATGTC 
CAAGGAAGAT 
GGAATCTAAC 
CATTTGTGGC 
ATACTCCTAT 
TTTTAGGCTC 
GGAGTAGTTT 
GCTCTGTGAT 
GCTGTAGCTT 
ATTTTCCTTT 
ATTTATTtTG 
GATCTAAGAA 
TTATAACAAT 
ACATCCCTTC 
TTCTTTGTAA 
TTGATTTTGT 
TTTGAGCTGT 
T 



11 

1 

CCGGGCCCGA 
AAGTTGGCAA 
GCCGTTCCGG 
GGCAGCTGCC 
CAGATTGTCC 
CAGGAG6GAG 
ACAGTGGGGA 
GGAGTTGCTT 
AAAACATGGG 
AAGCAAGAAT 
GGTTCCTTGG 
AACTATATCC 
AAAGACAGAG 
GATGACCCAC 
AAGATTGCTG 
GAAGGTCTCT 
TGGGGCTTCA 
GCAC6A6CAA 
ATTTATCAAG 
GTTAGAGTTC 
CAATACAGGA 
ATCGTGTTCG 
TGTGAGCTCT 
AATGGAGAGG 
TGGAAAAGGA 
CACAGCACCA 
AAAGCTGCCA 
TGGAGTAACT 
AGTGGGAGGC 
CCAGACACTG 
TGACTTTCAT 
GCTTGAGTTC 
CATGAATTGG 
ATTACCTTCT 
TGTTCATTTT 
TCTCTCATAT 
TAGAAACTAG 
CCTGTGCATA 
CAAGTACTTG 
TACATGTGTT 
TTTTGAATGG 
ACCATGTCAT 
TTGTAATGTA 
TACTACATTG 



21 

1 

CCACCCCAGC 
CTTGGAAGCT 
GCATGGCCGG 
TCCTGGCCAG 
TTGTGCTGAA 
AAGGTAATTG 
AAATATCGGC 
TCCGACACTG 
CCAATTATTC 
TCTTTGAACG 
CTGTGGCTAT 
ACATGCACTT 
TAGTCCATGC 
AAAATTCCAT 
TTGTGATGTT 
ACCTGCATAA 
TCTTGATAGG 
CTCTGGCTGA 
CACCX3ATCTT 
TAGCTACCAA 
AACTGGCCAA 
TATGCCTGCC 
TCTTCAACTC 
TTCAGGCAGA 
CACCGCCATG 
GCAGCCAGTC 
AGATCGCCAG 
CAGAGCAGGA 
AGGGAGATGA 
AAG6ATGCCA 
GGGCTGGTCC 
AAAGGCTGAA 
CTCCTGTAAA 
ATTGGCATCA 
TTTCTGCTAC 
ATATCACCCT 
TATTCTCTTA 
GGAGCAATTA 
CTGGAAAATT 
TTTGGGAACA 
CCTCTTTGTG 
GTGGAAAGAT 
TTTTGATAGC 
TACATGGCAT 



31 
I 

TGCGCGTCGT 
TCTCCCGGGC 
GCTGGGGGCG 
AGCCCAGCTG 
,AGCGAAAGTA 
TTTCCCTGAA 
TGTTCCATGC 
TAACCCCAAT 
AGACTGCCTT 
CCTCTATGTA 
TCTCATC7VTT 
ATTTGTGTCT 
TCACATAGGA 
TGAGGCAACT 
TATTTACTTC 
TCTCATCTTT 
CTGGGGGTTT 
TGCGAGGTGC 
AGCAGCTATT 
T^TCTGGGAG 
ATCGACACTG 
TCACTCCTTC 
CTTTCAGGGT 
GGT6AAGAAG 
TGGCAGCCGC 
ACAGGTGGCG 
CAGACAGCCT 
CTGCCTGCCA 
TATTCTAATG 
AGGAGAAACT 
AATG6CTGGT 
AATTCAGTTA 
TACTAACGAC 
AGTTTTCCTC 
TTTTGGGTAG 
AAATATAATG 
TTTCTTACTT 
GGATCTAAAA 
AGTTGGCTGG 
AGGAAAATTT 
ACCAGCCAGA 
TTCCTCAGTT 
AAATCATGCT 
GTGGGATCAA 



41 

t 

TACTGGCCAC 
TCTGGAGGAG 
TCGCTCCACG 
GATTCTGATG 
CAATGTGAAC 
TGGGATGGAC 
CCTCCTTATA 
GGAACATGGG 
CGCTTTCTGC 
ATGTATACCG 
GGTTACTTCA 
TTCATGCTGA 
GTAAAGGAGC 
TCTGTGGACA 
CTGGCTACAA 
GTGGCTTTCT 
CCAGCAGCAT 
TGGGAACTTA 
GGGCTGAATT 
ACCAATGCAG 
GTCCTGGTCC 
ACTGGGCTCG 
TTCTTTGTGT 
ATGTGGAGTC 
AGATGCGGCT 
GCCAGCACAC 
GACAGCCACA 
CACTCTTTCC 
GAGAAGCCTT 
GAGGATGTTC 
TGTGTGAGAG 
AGGTGTTACT 
ATGAAAATGC 
TAAATTAATG 
AAAAAAGATT 
AAGATCTTTT 
TAATGTACTT 
AAATATATGG 
ACATTGATAA 
CTCAAAAAAG 
CCTCAGGTCT 
AGTGAGCTTG 
GCATCTATAT 
TTAAAAATTT 



Seq ID NO: 663 Protein sequence 
Protein Accession #: NP_005039 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



51 
I 

AAGTTTGCTC 
GGTCCCTGCT 
TCTGGGGTTG 
GCACCATTAC 
TCAACATCAC 
TCATTTGTTG 
TTTATGACTT 
ATTTTATGCA 
AGCCAGATAT 
TTGGCTACTC 
GACGATTGCA 
GAGCTACAAG 
TGGAGTCCCT 
AATCACAATA 
ATTATTATTG 
TTTCGGACAC 
TTGTTGCAGC 
GTGCTGGAGA 
TTATTCTGTT 
TTGGGCATGA 
TAGTCTTTGG 
GGTGGGAGAT 
CTATCATCTA 
GGTGGAATCT 
CAGTGCTCAC 
GCATGGTGCT 
TCACTTTACC 
ACGAGGAGAC 
CCAGGCCTAT 
TCTGAATGGA 
GGCTTGGCTG 
TAATAATAGT 
AAGTGTCAAT 
TATGGTATTT 
CAATTGCTTG 
AGTGTGTATC 
CTATCACTGC 
GAAGATAAAA 
AATAATGCAT 
AATATTTCAC 
TCACTCTTTC 
TGTCTGCAT^ 
CTTTTTCTTG 
GTTTTAAAAA 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



1 11 21 31 41 51 

I 1 I I I I 

MAGLGASLHV WGWLMLGSCL LARAQLDSDG TITIBEQIVL VLKAKVQCEL NITAQLQEGE 60 

85 GNCFPEWDGIi ICWPRGTVGK ISAVPCPPYI YDFNHKGVAP RHCNPNGTWD FMHSLNKTWA 120 

NYSDCIiRFLQ PDISIGKQEP FERLYVMYTV GYSISFGSLA VAILIIGYFR RLHCTRNYIH 180 

MHIiFVSFMLR ATSIFVKDRV VHAHIGVKEIi ESIiIMQDDPQ NSIEATSVDK SQYIGCKIAV 240 



439 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

VMFIYFLATN YYHHiVEGIiY 
LADARCHEIiS ASDIKWIYQA 
LAKSTLVLVIi VFGVHTflVFV 
QAEVKKMWSR HNLSVDWKRT 
lASRQPDSHI TLPGYVWSNS 
GCQGETEDVL 



PCT/US02/12476 



LHNLIFVAFF SDTKYLWGFI IiIGWGFPAAF VTiAWAVARAT 300 

PILAAIOI^F IIiFLNTVRVIi ATKIWBTNAV (aiDTRKQYRK 360 

CLPHSFTGL6 WEIRMHCEIiF FNSFQOFFVS IIYCYCNGEV 420 

PPC6SRRCGS VLTTVTHSTS SQSQVAASTR MVI.ISGKAAK 480 

EQDCIiPKSFB EETKEDSQRQ GDDIIiMEKPS RPMBSKPDTE S40 



Seq ID NO: 664 DNA sequence 
Nucleic Acid Accession #: NM_012152 
Coding sequence: 43.. 1104 



1 
I 

CTTCTTTAAA 
GACAAGCACA 
GGAACAAAGC 
TCTAATTCTC 
TACCTGTTGG 
ATGTTTAACA 
GGGCTTCTGG 
AGGCACATGT 
CTGCTCATTT 
TGGAATTGCC 
TACCTTGTTT 
CTGCGGATCT 
TCCATCAGCC 
GCGTTTGTGG 
AGGCAGTGTG 
GTCGTGAACC 
ATGATCTGCT 
GTCCTCAGCA 
GTCTGCAATA 
GTCTTAGG 



11 

1 

TTTCTTTCTA 
TGGACTTTTT 
TTGTGATTGT 
TGGTCATCGC 
CTAATTTAGC 
CAGGCCCAGT 
ACAGTAGCTT 
CAATCATGAG 
TGCTTGTCTG 
TCTGCAACAT 
TCTGGACAGT 
ACGTGTACGT 
GCCGGAGGAC 
TATGCTGGAC 
GCGTGCAGCA 
CCATCATCTA 
GCTTCTCTCA 
GGAGTGACAC 
AAAGCACTTC 



21 
I 

GGATGTTCAC 
TTATAATAGG 

TTTGTGTGTT 
GGCAGTGATC 
TGCTGCCGAT 
TTCAAAAACT 
GACTGCTTCC 
GATGCGGGTC 
GGCCATCGCC 
CTCTGCCTGC 
GTCCAACCTC 
CAAGAGGAAA 
ACCCATGAA6 
CCCGGGCCTG 
TGTGAAAAGG 
CTCCTACAAG 
GGAGAACCCA 
AGGCAGCCAG 
CTAAACTCTG 



31 
I 

TTCTTCTCCA 
AGCAACACTG 
GGGACGTTTT 
AAA/^CAGAA 
TTCTTCGCTG 
TTGACTGTCA 
CTCACCAACT 
CATAGCAACC 
ATTTTTATGG 
TCTTCCCTGG 
ATGGCCTTCC 
AGCAACGTCT 
CTAATGAAGA 
GTGGTTCTGC 
TGGTTCCTGC 
GACGAGGACA 
GAGAGGCGTC 
TACATAGAGG 
GATGCCTCTC 



Seq ID NO: 665 Protein sequence 
Protein Accession #: NP 036284 



MNECHYDKHM 
FHFPFYYLLA 
LVIAVERHMS 
PIYSRSYLVF 
VMTVLGAFW 
YGTMKKMICC 



11 

I ■ 

DFFYNRSNTD 
NLAAADFFAG 
IMRMRVHSNL 
WTVSNLMAFL 
CWTPGLWLL 
FSQENPERRP 



21 
I 

TVDDWTGTKL 
lAYVFLMFNT 
TKKRVTliLIL 
IMVWYLRIY 
IiDGLNCRQCG 
SRIPSTVLSR 



31 
I 

VIVLCVGTFF 
GPVSKTLTVN 
LVWAIAIFMG 
VYVKRKTNVIi 
VQHVKRWFLL 
SDTGSQYIED 



41 

I 

CAATGAATGA 
ATACTGTCGA 
TCTGCCTGTT 
AATTTCATTT 
GAATTGCCTA 
ACCX3CTGGTT 
TGCTGGTTAT 
TGACCAAAAA 
GGGCGGTCCC 
CCCCCATTTA 
TCATCATGGT 
TGTCTCCGCA 
OSGTGATGAC 
TCCTCGACGG 
TGCTGGCGCT 
TGTATGGCAC 
CCTCTCGCAT 
ATAGTATTAG 
GGCCCACCCA 



41 

1 

CLFIFFSNSL 
RWFLRQGLLD 
AVPTLGWNCIi 
SPHTSGSISR 
IiALLNSWNP 
SISQGAVCNK 



51 
1 

GTGTCACTAT 
TGACTGGACA 
TATTTTTTTT 
CCCCTTCTAC 
TGTATTCCTG 
TCTCCX3TCAG 
CGCCGTGGAG 
GAGGGTGACA 
CACACTGGGC 
CAGCAGGAGT 
TGTGGTGTAC 
TACAAGTGGG 
TGTCTTAGGG 
CCTGAACTGC 
GCTCAACTCC 
CATGAAGAAG 
CCCCTCCACA 
CCAAGGTGCA 
GGTGATGACT 



51 
I 

VIAAVIKNRK 
SSLTASLTNL 
CNISACSSLA 
RRTPMKLMKT 
IIYSYKDEDM 
STS 



Seq ID NO: 666 DNA sequence 
Nucleic Acid Accession #: NM_002821 
Coding sequences 150.. 3362 



AACTCCCGCC 
GCGCTCCGGT 
CCTCAGCTCC 
CCCGCCGGTT 
CCATTGTCTT 
TTCGCTGTGA 
CTGTCCAGGA 
ACCGGCTGCA 
AAGCCCGCAG 
TGAAGCATCC 
ACATTGATGG 
ATGGTCAGAG 
GTCCTGAGCA 
GCAGCCAGAA 
CCCAGGACGT 
AGCCACCCCC 
GCCCCCCACA 
TCCGGCCACG 
TCATCCTGGA 
GGGTGTTTAC 
AGCCCAGCGT 
AGAAGGGCCA 
GCCACGCGGC 
TGCCCTCCTG 
TGGATTGCCT 
TCATCTCAGA 
TGGAGGTGTA 
AGGCGCAAGC 
AGCAGTGCAT 
AGCCCACTAT 
ACGCTGGGAC 
TTGCCTCCAA 
TTATCACCTT 
TGCAGTGCGA 
TCCTGGACCC 



11 

I 

TCGGGACGCC 
GCGTCCGCCT 
TTTTCCTGAG 
GCCTCTGCTC 
CATCAAGCAG 
GGTTGAGGCT 
CACGGAGCGG 
GGACTCTGGC 
TGCCAACGCC 
AGCCTCGGAA 
GCACCCTCGG 
CAACCACACA 
TAGTGGGCTG 
CTTCACCTTG 
GGTAGTAGCG 
GAGCCTGCAG 
CCTCCGCAGA 
CAATGCAGGG 
AGCCACACTT 
AGCTGGCAGC 
GTGGTGGGAG 
CGAGCTGGTG 
CAACCTGGCT 
GCTGAAGAAG 
GACCCAGGCC 
GGACTCACGG 
TGATGGGACA 
CCGTGTCCAA 
GGAGTTTGAC 
TAAGTGGGAA 
CCTGCATTTT 
CGGGCCGCAG 
CAAAGTGGAA 
GGCCCAGGGG 
CACCAAGCTG 



21 

I 

TCGGGGTCGG 
CCTGTGCCCG 
CCCGCCGCGA 
AGCGTCCTGC 
CCGTCCTCCC 
CCGGGCCCGG 
CGTTTCGCCC 
ACCTTCCAGT 
TCCTTCAACA 
GCTGAGATCC 
CCCACCTACC 
GTCAGCAGCA 
TATTCCTGCT 
AGCATTGCTG 
AGGTAT6AGG 
TGGCTCTTTG 
GCCACAGTGT 
ATCTACCGCT 
CACCTAGCAG 
GAGGAGCGTG 
CACGCGGGAG 
TTGGCCAATA 
GGTCAGCGGA 
CCCCAAGACA 
ACACCAAAAC 
TTCGAGGTCT 
TGGTACCGTT 
GTGCTGGAAA 
AAGGAGGCCA 
CGGGCAGATG 
GCCCGGGTGA 
GGCCAGATTC 
CCAGAGCGTA 
GACCCCAAGC 
GGACCCAGGA 



31 
I 

GCTCCGGCTG 
CCGCGGAGCA 
TGGGAGCTGC 
TGCTGGCGCT 
AGGATGCACT 
TACATGTGTA 
AGGGCAGCAG 
GTGTGGCTCG 
TCAAATGGAT 
AGCCACAGAC 
AATGGTTCCG 
AGGAGCGGAA 
GCGCCCACAG 
ATGAAAGCTT 
AGGCCATGTT 
AGGATGAGAC 
TTGCCAACGG 
GCATTGGCCA 
AGATTGAAGA 
TGACCTGCCT 
TCCGGCTGCC 
TTGCTGAAAG 
GACAGGATGT 
GCCAGCTGGA 
CTACAGTTGT 
TCAAGAATGG 
GTATGAGCAG 
AGCTCAAGTT 
CGGTGCCCTG 
GGAGCAGCCT 
CTCGAGATGA 
GTGCCCATGT 
CGACTGTGTA 
CGCTGATTCA 
TGCACATCTT 



41 

1 

CGGCTGCTGC 
GTCTGCGGCC 
GCGGGGATCC 
GCTGGGCGGT 
GCAGGGGCGC 
CTGGCTGCTC 
CCTGAGCTTT 
GGATGATGTC 
TGAGGCAGGT 
CCAGGTCACA 
AGATGGGACC 
CCTGACGCTC 
TGCTTTTGGC 
TGCCAGGGTG 
CCATTGCCAG 
TCCCATCACT 
GTCTCTGCTG 
GGGGCAGAGG 
CATGCCGCTA 
TCCCCCCAAG 
CACCCATGGC 
TGATGCTGGT 
CAACATCACT 
GGAGGGCAAA 
CTGGTACAGA 
GACCTTGCGC 
CACCCCAGCC 
CACACCACCA 
TTCAGCCACA 
CCCAGAGTGG 
CGCTGGCAAC 
CCAGCTCACT 
CCAGGGCCAC 
GTGGAAAGGC 
CCAGAATGGC 



51 
I 

TGCGGCGCCC 
CGCCGTGCGC 
CCGGCCAGAC 
ACCCAGACAG 
CGGGCGCTGC 
GATGGGGCCC 
GCAGCTGTGG 
ACTGGAGAAG 
CCTGTGGTCC 
CTTCGTTGCC 
CCCCTTTCTG 
CGGCCAGCTG 
CAGGCTTGCA 
GTGCTGGCAC 
TTCTCAGCCC 
AACCGCAGTC 
CTGACCCAGG 
GGCCCACCCA 
TTTGAGCCAC 
GGTCTGCCAG 
AGGGTCTACC 
GTCTACACCT 
GTGGCCACTG 
CCCGGCTACT 
AA.CCAGATGC 
ATCAACAGCG 
GGCAGCATCG 
CCCCAGCCAC 
GGCCGAGAGA 
GTGACAGACA 
TACACTTGCA 
GTGGCAGTTT 
ACAGCCCTAC 
AAGGACC6CA 
TCCCTGGTGA 



€0 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
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€00 
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1380 
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1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



440 
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WO 02/086443 

TCCATGACGT GGCCCCTGAG GACTCAGGCC GCTACACCTG CATTGCAGGC 
ACATCAAGCA CACGGAGGCC CCCCTCTATG TCGTGGACAA GCCTGTGCCG 
AGGGCCCTGG CAGCCCTCCC CCCTACAAGA TGATCCAGAC CATTGGGTTG 
CCGCTGTGGC CTACATCATT GCCGTGCrCG GCCTCATGTT CTACTGCAAG 
AAGCCAAGCG GCTGCAGAAG CAGCCCGAGG GCGAGGAGCC AGAGATGGAA 
GAGGGCCTTT GCAGAACGGG CAGCCCTCAG CAGAGATCCA AGAAGAAGTG 
GCTTGGGCTC CGGCCCCGCG GCCACCAACA AACGCCACAG CACAAGTGAT 
TCCCACGGTC TAGCCTGCAG CCCATCACCA CGCTGGGGAA GAGTGAGTTT 
TCCTGGCAAA GGCTCAGGGC TTGGAGGAGG GAGTGGCAGA GACCCTGGTA 
GCCTGCAGAC GAAGGATGAG CAGCAGCAGC TGGACTTCCG GAGGGAGTTG 
GGiW^GCTGAA CCACGCCAAC GTGGTGCGGC TCCTGGGGCT GTGCCGGGAG 
ACTACATGGT GCTGGAATAT GTGGATCTGG GA6ACCTCAA GCAGTTCCTG 
AGAGCAAGGA TGAAAAATTG AAGTCACAGC CCCTCAGCAC CAAGCAGAAG 
GCACCCAGGT AGCCCTGGGC ATGGAGCACC TGTCCAACAA CCGCTTTGTG 
TGGCTGCGCG TAACTGCCTG GTCAGTGCCC AGAGACAAGT GAAGGTGTCT 
TCAGCAAGGA TGTGTACAAC AGTGAGTACT ACCACTTCCG CCAGGCCTGG 
GCTGGATGTC CCCCGAGGCC ATCCTGGAGG GTGACTTCTC TACCAAGTCT 
CCTTCGGTGT GCTGATGTGG GAAGTGTTTA CACATGGAGA GATGCCCCAT 
CAGATGATGA AGTACTGGCA GATTTGCAGG CTGGQT^GGC TAGACTTCCT 
GCTGCCCTTC CAAACTCTAT CGGCTGATGC AGCGCTGCTG GGCCCTCAGC 
GGCCCTCCTT CAGTGAGATT GCCAGCGCCC TGGGAGACAG CACCGTGGAC 
GAGGAGGGAG CCCGCTCAGG ATGGCCTGGG CAGGGGAGGA CATCTCTAGA 
CAGCATGATG GGCAAGATCC CTGTCCTCCT GGGCCCTGAG GTGCCCTAGT 
TTGCTGAGGT CTGAGCAGGG CCTGGCCTTT CCTCCTCTTC CTCACCCTCA 
GGCTGACTTG GACCCAAACT GGGCGACTAG GGCTTTGAGC TGGGCAGTTT 
CTCTTCCTCT ATCAGGGACA GTGTGGGTGC CACAGGTAAC CCCAATTTCT 
TTCTCCCCTT GACCGGGTCC J\ACTCTGCCA CTCATCTGCC AACTTTGCCT 
AGGCTTGGGA TGAGCTGGGT TTGTGGGGAG TTCCTTAATA TTCTCAAGTT 
AGGGTTAATG AGTCTCTTGC CCACTGGTCC ACTTGGGGGT CTAGACCAGG 
ACACAGCAAG TGAGTCCTCC CCACTCTGGG CTTGTGCACA CTGACCCAGA 
CCCCACCCTT CTCTCCTTTC CTCATCCTAA GTGCCTGGCA GATGAAGGAG 
CTTTTGACAC TATATAAACC GCCCTTTTTG TATGCACCAC GGGCGGCTTT 
TGCAGCGTGG GGTGGGTGGG CATGGGAGGT AGGGGTGGGC CCTGGAGATG 
GCCATCCTTA CCCCACACTT TTATTGTTGT CGTTTTTTGT TTGTTTTGTT 
TGTTTTTGTT TTTACACTCG CTGCTCTCAA TAAATAAGCC TTTTTTA 



Seq ID NOt 667 Protein sequence 
Protein Accession #: NP 002812 



MGAARGSPAR 
VHVYWLLDGA 
IKWIEAGPW 
KERNLTLRPA 
EAMPHCQFSA 
CIGQGQRGPP 
VRLPTHGRVY 
SQIiEEGKPGY 
CMSSTPAGSI 
GSSLPEWVTD 
TTVYQGHTAL 
RYTCIAGNSC 
GLMFYCKKRC 
KRHSTSDKMH 
LDFRRELEMP 
PLSTKQKVAL 
YHFRQAWVPL 
AGKARLPQPE 



11 

I 

PRRIiPLLSVL 
PVQDTERRFA 
LKHPASEAEX 
GPEHSGLYSC 
QPPPSLQWLF 
IILBATLHLA 
QKGHELVLAN 
LDCLTQATPK 
EAQARVQVLE 
NAGTLHFARV 
LQCEAQGDPK 
NIKHTEAPLY 
KAKRLQKQPE 
FPRSSLQPIT 
GKLNHANWR 
CTQVALGMEH 
RWMSPEAILE 
GCPSKLYRLM 



21 
I 

LLPLLGGTQT 
QGSSIiSFAAV 
QPQTQVTLRC 
CAHSAPGQAC 
EDETPITNRS 
EIEDMPLFEP 
lAESDAGVYT 
PTWWYRNQM 
KLKFTPPPQP 
TRDDAGNYTC 
PLIQWKGKDR 
WDKPVPEES 
GEEPEMECLN 
TLGKSEFGEV 
LLGLCREAEP 
LSNNRPVHKD 
GDFSTKSDVW 
QRCWALSPKD 



31 
I 

AIVFIKQPSS 
DRLQDSGTFQ 
HIDGHPRPTY 

SSQNFTLSIA 
RPPHLRRATV 
RVFTAGSEER 
CHAANIjAGQR 
LISEDSRFEV 
QQCMEFDKEA 
IASNGPQGQI 
ILDPTKLGPR 
EGPGSPPPYK 
GGPLQNGQPS 
FLAKAQGLEE 
HYMVLEYVDL 
LAARNCLVSA 
AFGVLMWEVF 
RPSFSEIASA 



41 
I 

QDALQGRRAL 
CVARDDVTGE 
QWFRDGTPLS 
DESFARWLA 
FANGShhhTQ 
VTCLPPKGLP 
RQDVNITVAT 
FKNGTLRINS 
TVPCSATGRE 
RAHVQLTVAV 
MHIFQNGSLV 
MXQTXGhSVG 
AEIQEEVALT 
GVAETLVLVK 
GDLKQFLRIS 
QRQVKVSALG 
THGEMPHGGQ 
LGDSTVDSKP 



AACAGCTGCA 
GAGGAGTCGG 
TCGGTGGGTG 
AAGCGCTGCA 
TGCCTCAACG 
GCCTTGACCA 
AAGATGCACT 
GGGGAGGTGT 
CTTGTGAAGA 
GAGATGTTTG 
GCTGA6CCCC 
AGGATTTCCA 
GTGGCCCTAT 
CATAAGGACT 
GCCCTGGGCC 
GTGCCGCTGC 
GATGTCTGGG 
GGTGGGCAGG 
CAGCCCGAGG 
CCCAAGGACC 
AGCAAGCCGT 
GGGAAGCTCA 
GCAACAGGCA 
TCCTTTGGGA 
CCCCTGCCAC 
GGCCTTCAAC 
GGGGAGGGCT 
CTGGGCACAC 
ATTATAGAGG 
CCCACGTCTT 
TTTTCAGGAG 
TATATGTAAT 
AGGAGGGTGG 
TTTTTGTTTT 



51 
I 

liRCEVEAPGP 
EARSANASFN 
DGQSNHTVSS 
PQDVWARYE 
VRPRNAGIYR 
EPSVWWEHAG 
VPSWLKKPQD 
VEVYDGTWYR 
KPTIKWERAD 
PITFKVEPER 
IHDVAPEDSG 
AAVAYIIAVIi 
SLGSGPAATN 
SLQTKDEQQQ 
KSKDEKLKSQ 
LSKDVYNSEY 
ADDEVIiADLQ 



Seq ID NO: 668 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: l.,1389 



1 
1 

ATGGGCTACC 
ACCCTTGTTT 
GTTGTCAACT 
GGGTTTCCTT 
GTTTTATTGA 
AAAACTTTCG 
ATAGCAATGA 
ATCCCAGGAG 
ACAGTTACCT 
TCCCTCATCT 
TCACTGGGTC 
ATTCAAGCGG 
TACAGTTCTC 
GTGATTTCTG 
TTCACCCAAG 
AGATTTTGTT 
GAGGTAATTG 
ACAGTGATGG 
GTTCTAGAAC 
TGTTATCTGA 
ATGCTTCCCA 



11 

1 

AGAGGCAGGA 
CTGAACATGA 
CGATTATAGG 
TGGGAATATT 
TAAAAGGAGG 
GCTTTCCAGG 
TAAGTTACAA 
TTGATCCTGA 
TTACTCTGCC 
CTACAGGTTT 
CACACATACC 
TCGGGGTTAT 
TAGAAGAACC 
TATTTATCTG 
GGGACTTATT 
ATGGTGTCAC 
CCAATGTGTT 
TCATCACTGT 
TCAATGGTGT 
AACTGTCTGA 
TTGGTGCTGT 



21 
I 

GCCTGTCATC 
GTATAAAGAG 
ATCTGGTATA 
GCTTTTATTC 
GGCCCTCTCT 
GTATCTGCTC 
TATAATAGCT 
AAACGTGTTT 
TTTATCCTTG 
AACAACTCTG 
AAAAACAGAA 
GTCTTTTGCA 
CACAGTAGCT 
TATATTCTTT 
TGAAAATTAC 
TGTCATTTTG 
TTTTGGTGGG 
AGCCACGCTT 
GCTCTGTGCA 
AGAACCAAGG 
GGTGATGGTT 



31 
I 

CCGCCGCAGA 
AAAACCTGTC 
ATAGGATTGC 
TGGGTTTCAT 
GGAACAGATA 
CTCTCTGTTC 
GGAGATACTT 
ATTGGTCGCC 
TACCGAAATA 
ATTCTTGGAA 
GACGCTTGGG 
TTTATTTGCC 
AAGTGGTCCC 
GCTACATGTG 
TGCAGAAATG 
ACATACCCTA 
AATCTTTCAT 
GTGTCATTGC 
ACTCCCCTCA 
ACACACTCCG 
TTTGGATTCQ 



41 

I 

GAGATTTAGA 
AGTCTGCTGC 
CTTATTCAAT 
ATGTTACGGA 
CCTACCAGTC 
TTCAGTTTTT 
TGAGCAAAGT 
ACTTCATTAT 
TAGCAAAGCT 
TTGTAATGGC 
TATTTGCAAA 
ACCATAACTC 
GCCTTATCCA 
GATACTTGAC 
ATGACCTGGT 
TGGAATGCTT 
CGGTTTTCCA 
TGATTGATTG 
TTTTTATCAT 
ATAAGATTAT 
TCATGGCTAT 



51 

I 

TGACAGAGAA 
TCTTTTTAAT 
GAAGCAAGCT 
CTTTTCCCTT 
TTTGGTCAAT 
GTATCCTTTT 
TTTTCAAAGA 
TGGACTTTCC 
TGGAAAGGTC 
AAGGGCAATT 
GCCCAATGCC 
CTTCTTAGTT 
TATGTCCATC 
ATTTACTGGC 
AACATTTGGA 
TGTGACAAGA 
CATTGTTGTA 
CCTCGGGATA 
TCCATCAGCC 
GTCTTGTGTC 
TACAAATACT 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 



PCT/US02/12476 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



441 



10 



15 



25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CAAGACTGCA CCCATGGGCA GGAA2^T6TTC TACTGCTTTC CTGACAATTT CTCTCTCACA 
AATACCTCAG AGTCTC&TGT TCA6CAGACA ACACRACTTT CTACTTTAAA TATTAGTATC 
TTTCAATGA 

Seq ID NO: S69 Protein sequence 
Protein Accession Eos sequence 



1320 
13 BO 



PCT/US02/12476 



1 
I 

MGYQRQEPVI 
GFPLGILLLF 
lAMISYNIIA 
SLISTGLTTL 
YSSLEEPTVA 
RFCYGVTVIL 
VLELNGVtiCA 
QDCTHGQEMF 



11 

1 

PPQRDLDDRE 
WVSYVTDFSL 
GDTLSKVFQR 
ILGIVMARAI 
KWSRLIHMSI 
TYPMECFVTR 
TPLIFIIPSA 
YCFPDNFSLT 



21 

1 

TLVSEHEYKE 
VLIiIKGGAIiS 
IPGVDPENVF 
SLGPHIPKTE 
VISVFICIFF 
EVIANVFFGG 
CYLKIiSEEPR 
NTSESHVQQT 



31 

1 

KTCQSAALFN 
GTDTYQSLVN 
IGRHFIIGIiS 
DAWVFAKPNA 
ATCGYLTFTG 
NLSSVPHIW 
THSDKIMSCV 
TQLSTLNISI 



41 

I 

WNSIIGSGI 
KTFGFPGYLL 
TVTFTLPLSL 
IQAVGVMSFA 
FTQGDLFENY 
TVMVITVATL 
MLPIGAWMV 
FQ 



Seq ID NO: 670 DNA sequence 
20 Nucleic Acid Accession #: Eos sequence 
Coding sequence: 1..1284 



ATGGGCTACC 
AAGCAAGCTG 
TTTTCCCTTG 
TTGGTCAATA 
TATCCTTTTA 
TTTCAAAGAA 
GGACTTTCCA 
GGAAAGGTCT 
AGGGCAATTT 
CCCAATGCCA 
TTCTTAGTTT 
ATGTCCATCG 
TTTACTGGCT 
ACATTTGGAA 
GTGACAAGAG 
ATTGTTGTAA 
CTCGGGATAG 
CCATCAGCCT 
TCTTGTGTCA 
ACAAATACTC 
TCTCTCACAA 
ATTAGTATCT 



11 
I 

AGAGGCAGGA 
GGTTTCCTTT 
TTTTATTGAT 
AAACTTTCGG 
TAGCAATGAT 
TCCCAGGAGT 
CAGTTACCTT 
CCCTCATCTC 
CACTGGGTCC 
TTCAAGCGGT 
ACAGTTCTCT 
TGATTTCTGT 
TCACCCAAGG 
GATTTTGTTA 
AGGTAATTGC 
CAGTGATGGT 
TTCTAGAACT 
GTTATCTGAA 
TGCTTCCCAT 
AAGACTGCAC 
ATACCTCAGA 
TTCAACTCGA 



21 
1 

GCCTGTCATC 
GGGAATATTG 
PiPJkAGGAGGG 
CTTTCCAGGG 
AAGTTACAAT 
TGATCCTGAA 
TACTCTGCCT 
TACAGGTTTA 
ACACATACCA 
CGGGGTTATG 
AGAAGAACCC 
ATTTATCTGT 
GGACTTATTT 
TGGTGTCACT 
CAATGTGTTT 
CATCACTGTA 
CAATGGTGTG 
ACTGTCTGAA 
TGGTGCTGTG 
CCATGGGCAG 
GTCTCATGTT 
GTAA 



31 
I 

CCGCCGCAGA 
CTTTTATTCT 
GCCCTCTCTG 
TATCTGCTCC 
ATAATAGCTG 
AACGTGTTTA 
TTATCCTTGT 
ACAACTCTGA 
AATIACAGAAG 
TCTTTTGCAT 
ACAGTAGCTA 
ATATTCTTTG 
GAAAATTACT 
GTCATTTTGA 
TTTGGTGGGA 
GCCACGCTTG 
CTCTGTGCAA 
GAACCAAGGA 
GTGATGGTTT 
GAAATGTTCT 
CAGCAGACAA 



41 

1 

GAGGATTGCC 
GGGTTTCATA 
GAACAGATAC 
TCTCTGTTCT 
GAGATACTTT 
TTGGTCGCCA 
ACCGAAATAT 
TTCTTGGAAT 
ACGCTTGGGT 
TTATTTGCCA 
AGTGGTCCCG 
CTACATGTGG 
GCAGAAATGA 
CATACCCTAT 
ATCTTTCATC 
TGTCATTGCT 
CTCCCCTCAT 
CACACTCCGA 
TTGGATTOGT 
ACTGCTTTCC 
CACAACTTTC 



Seq ID NO: 671 Protein sequence 
Protein Accession #; Eos sequence 



MGYQRQEPVI 
LVNKTFGFPG 
GLSTVTPTLP 
PNAIQAVGVM 
FTGFTQGDLF 
IWTVMVITV 
SCVMLPIGAV 
ISIFQLE 



11 

I 

PPQRGLPYSM 
YLLIiSVLQFD 
LSLYRNIAKL 
SFAFICHHNS 
ENYCRNDDLV 
ATLVSLLIDC 
VMVFGFVMAI 



21 

I 

KQAGFPLGIIj 
YPFIAMISYN 
GKVSLISTGL 
FLVYSSLEEP 
TFGRFCYGVT 
LGIVLELNGV 
TNTQDCTHGQ 



31 
I 

LLFWVSYVTD 
IIAGDTXiSKV 
TTLILGIVMA 
TVAKWSRLIH 

VILTYPMECF 
LCATPLIFII 
EMFYCFPDNF 



41 

1 

FSLVLLIKGG 
FQRIPGVDPB 
RAISLGPHIP 
MSIVISVFIC 
VTREVIANVF 
PSACYLKLSE 
SIiTNTSESHV 



51 

1 

IGLPYSMKQA 
LSVLQFLYPF 
YRNIAKLGKV 
FICHHNSFLV 
CRNDDLVTFG 
VSLLIDCLGI 
FGFVMAITNT 



51 
I 

TTATTCAATG 
TGTTACAGAC 
CTACCAGTCT 
TCAGTTTTTG 
GAGCAAAGTT 
CTTCATTATT 
AGCAAAGCTT 
TGTAATGGCA 
ATTTGCAAAG 
CCATAACTCC 
CCTTATCCAT 
ATACTTGACA 
TGACCTGGTA 
GGAATGCTTT 
GGTTTTCCAC 
GATTGATTGC 
TTTTATCATT 
TAAGATTATG 
CATGGCTATT 
TGACAATTTC 
TACTTTAAAT 



51 
I 

TUCiSGTDTYQS 
NVFIGRHFII 
KTEDAWVFAK 
IFFATCGYLT 
PGGNLSSVFH 
EPRTHSDKIM 
QQTTQLSTLN 



Seq ID NO: 672 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1203 



1 
I 

ATGGGCTACC 
AAAGGAGGGG 
TTTCCAGGGT 
AGTTACAATA 
GATCCTGAAA 
ACTCTGCCTT 
ACAGGTTTAA 
CACATACCAA 
GGGGTTATGT 
GAAGAACCCA 
TTTATCTGTA 
GACTTATTTG 
GGTGTCACTG 
AATGTGTTTT 
ATCACTGTAG 
AATGGTGTGC 
CTGTCTGAAG 
GGTGCTGTGG 
CATGGGCAGG 
TCTCATGTTC 



11 
I 

AGAGGCAGGA 
CCCTCTCTGG 
ATCTGCTCCT 
TAATAGCTGG 
ACGTGTTTAT 
TATCCTTGTA 
CAACTCTGAT 
AAACAGAAGA 
CTTTTGCATT 
CAGTAGCTAA 
TATTCTTTGC 
AAAATTACTG 
TCATTTTGAC 
TTGGTGGGAA 
CCACGCTTGT 
TCTGTGCAAC 
AACCAAGGAC 
TGATGGTTTT 
AAATGTTCTA 
AGCAGACAAC 



21 
I 

GCCTGTCATC 
AACAGATACC 
CTCTGTTCTT 
AGATACTTTG 
TGGTCGCCAC 
CCGAAATATA 
TCTTGGAATT 
CGCTTGGGTA 
TATTTGCCAC 
GTGGTCCCGC 
TACATGTGGA 
CAGAAATGAT 
ATACCCTATG 
TCTTTCATCG 
GTCATTGCTG 
TCCCCTCATT 
ACACTCCGAT 
TGGATTCGTC 
CTGCTTTCCT 
ACAACTTTCT 



31 

I 

CCGCCGCAGT 
TACCAGTCTT 
CAGTTTTTGT 
AGCAAAGTTT 
TTCATTATTG 
GCAAAGCTTG 
GTAATGGCAA 
TTTGCAAAGC 
CATAACTCCT 
CTTATCCATA 
TACTTGACAT 
GACCTGGTAA 
GAATGCTTTG 
GTTTTCCACA 
ATTGATTGCC 
TTTATCATTC 
AAGATTATGT 
ATGGCTATTA 
GACAATTTCT 
ACTTTAAATA 



41 
I 

TTTCCCTTGT 
TGGTCAATAA 
ATCCTTTTAT 
TTCAAAGAAT 
GACTTTCCAC 
GAAAGGTCTC 
GGGCAATTTC 
CCAATGCCAT 
TCTTAGTTTA 
TGTCCATCGT 
TTACTGGCTT 
CATTTGGAAG 
TGACAAGAGA 
TTGTTGTAAC 
TCGGGATAGT 
CATCAGCCTG 
CTTGTGTCAT 
CAAATACTCA 
CTCTCACAAA 
TTAGTATCTT 



51 

1 

TTTATTGATA 
AACTTTCGGC 
AGCAATGATA 
CCCAGGAGTT 
AGTTACCTTT 
CCTCATCTCT 
ACTGGGTCCA 
TCAAGCGGTC 
CAGTTCTCTA 
GATTTCTGTA 
CACCCAAGGG 
ATTTTGTTAT 
GGTAATTGCC 
AGTGATGGTC 
TCTAGAACTC 
TTATCTGAAA 
GCTTCCCATT 
AGACTGCACC 
TACCTCAGAG 
TCAACTCGAG 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



442 



TAA 



WO 02/086443 



PCT/US02/12476 



Seq ID NO: 673 Protein sequence 
Protein Accession # : Eos sequence 



10 
15 
20 
25 
30 
35 
40 



1 

I 

MGYQRQEPVI 
SYNIIAGDTL 
TGLTTLILGI 
EEPTVAKWSR 
GVTVILTYPM 
NGVLCATPLI 
HGQEMPYCFP 



11 
I 

PPQFSLVLLI 
SKVFQRIPGV 
VMARAISLGP 
IiIHMSIVISV 
BCFVTREVIA 
FIIPSACYIiK 
DNFSLTNTSE 



21 
I 

KGGALSGTDT 
DPENVFIGRH 
HIPKTEDAWV 
PICIFFATCG 
NVFFGGNLSS 
LSEEPRTHSD 
SHVQQTTQLS 



31 

1 

YQSLVNKTFG 
FIIGLSTVTF 
FAKPNAIQAV 
YLTFTGPTQG 
VFHIWTVMV 
KIMSCVMLPI 
TLNTISIPQLE 



41 
I 

FPGYLLIiSVL 
TLPLSLYRNI 
GVMSFAFICH 
DLFENYCRND 
ITVATIiVSIjL 
GAWMVFGFV 



51 
I 

QFLYPFIAMI 
AKLGKVSLIS 
HNSFLVYSSL 
DIiVTFGRFCY 
IDCLGIVLEL 
MAITNTQDCT 



Seq ID NO: 674 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1140 



11 



21 



31 



41 



51 



ATGGGCTACC 
CCAGGGTATC 
TACAATATAA 
CCTGAAAACG 
CTGCCTTTAT 
GGTTTAACAA 
ATACCAAAAA 
GTTATGTCTT 
GAACCCACAG 
ATCTGTATAT 
TTATTTGAAA 
GTCACTGTCA 
GTGTTTTTTG 
ACTGTAGCCA 
GGTGTGCTCT 
TCTGAAGAAC 
GCTGTGGTGA 
GGGCAGGAAA 
CATGTTCAGC 



AGAGGCAGGA 
TGCTCCTCTC 
TAGCTGGAGA 
TGTTTATTGG 
CCTTGTACCG 
CTCTGATTCT 
CAGAAGACGC 
TTGCATTTAT 
TAGCTAAGTG 
TCTTTGCTAC 
ATTACTGCAG 
TTTTGACATA 
GTGGGAATCT 
CGCTTGTGTC 
GTGCAACTCC 
CAAGGACACA 
TGGTTTTTGG 
TGTTCTACTG 
AGACAACACA 



GCCTGTCATC 
TGTTCTTCAG 
TACTTTGAGC 
TCGCCACTTC 
TUVATATAGCA 
TGGAATTGTA 
TTGGGTATTT 
TTGCCACCAT 
GTCCCGCCTT 
ATGTGGATAC 
AAATGATGAC 
CCCTATGGAA 
TTCATCGGTT 
ATTGCTGATT 
CCTCATTTTT 
CTCCGATAAG 
ATTCGTCATG 
CTTTCCTGAC 
ACTTTCTACT 



CCGCCGCAGG 
TTTTTGTATC 
AAAGTTTTTC 
ATTATTGGAC 
AAGCTTGGAA 
ATGGCAAGGG 
GCAAAGCCCA 
AACTCCTTCT 
ATCCATATGT 
TTGACATTTA 
CTGGTAACAT 
TGCTTTGTGA 
TTCCACATTG 
GATTGCCTCG 
ATCATTCCAT 
ATTATGTCTT 
GCTATTACAA 
AATTTCTCTC 
TTAAATATTA 



TCAATAAAAC 
CTTTTATAGC 
AAAGAATCCC 
TTTCCACAGT 
AGGTCTCCCT 
CAATTTCACT 
ATGCCATTCA 
TAGTTTACAG 
CCATCGTGAT 
CTGGCTTCAC 
TTGGAAGATT 
CAAGAGAGGT 
TTGTAACAGT 
GGATAGTTCT 
CAGCCTGTTA 
GTGTCATGCT 
ATACTCAAGA 
TCACAAATAC 
GTATCTTTCA 



TTTCGGCTTT 
AATGATAAGT 
AGGAGTTGAT 
TACCTTTACT 
CATCTCTACA 
GGGTCCACAC 
AGCGGTCGGG 
TTCTCTAGAA 
TTCTGTATTT 
CCAAGGGGAC 
TTGTTATGGT 
AATTGCCAAT 
GATGGTCATC 
AGAACTCAAT 
TCTGAAACTG 
TCCCATTGGT 
CTGCACCCAT 
CTCAGAGTCT 
ACTCGAGTAA 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



Seq ID NO: 675 Protein sequence 
Protein Accession #: Bos sequence 



45 
50 
55 
60 
65 
70 
75 
80 
85 



11 



21 



31 



41 



51 



MGYQRQEPVI 
PENVFIGRHF 
IPKTEDAWVF 
ICIFFATCGY 
VFFGGNLSSV 
SEEPRTHSDK 
HVQQTTQIiST 



PPQVNKTFGF 
IIGLSTVTFT 
AKPNAIQAVG 
LTFTGFTQGD 
FHIWTVMVI 
IMSCVMLPIG 
LNISIFQLE 



PGYLLLSVLQ 
LPLSIiYRNIA 
VMSFAFICHH 

LFENYCRNDD 
TVATIiVSLLI 
AWMVFGFVM 



FLYPFIAMIS 
KLGKVSLIST 
NSFLVYSSLE 

LVTFGRFCYG 
DCLGIVLELN 
AITNTQDCTH 



YNIIAGDTLS 
GliTTLIIiGIV 
EPTVAKWSRL 
VTVILTYPME 
GVIiCATPLIF 
GQEMFYCFPD 



KVFQRIPGVD 
MARAISIjGPH 
IHMSIVISVP 

CFVTREVIAN 
IIPSACYLKL 
NFSIiTNTSES 



Seq ID NO: 676 DNA sequence 

Nucleic Acid Accession #: NM_0068S3.1 

Coding sequence: 26.. 874 



AGGAATCTGC 
ATCGGGCAGA 
CATGAGGATT 
CAGGATCATC 
CGAGAAGACG 
AGCCCACTGC 
GGAGGGCTGT 
CAGCCTCCCC 
CTCCATCACC 
CAGCTGCCTC 
CTTGCGATGC 
CAACATCACA 
GGGTGACTCC 
CCAGGATCCG 
GGACTGGATC 
ACCCTCCATT 
CAAGACCCTC 
AATCAACCTG 
GACTCTGGGA 
TCCTGGCCAT 



11 
1 

GCTCGGGTTC 
GGTCTCACAG 
CTGCAGTTAA 
AAGGGGTTCG 
CGGCTACTCT 
CTCAAGCCCC 
GAGCAGACCC 
AACAAAGACC 
TGGGCTGTGC 
ATTTCCGGCT 
GCCAACATCA 
GACACCATGG 
GGGGGCCCTC 
TGTGCGATCA 
CAGGAGACGA 
TCCACTTGGT 
TACGT^CATT 
GGGTTCGAAA 
ATGACAACAC 
ATATCAAGGT 



21 
I 

CGCAGATGCA 
CAGCCAAGGA 
TCCTGCTTGC 
AGTGCAAGCC 
GTGGGGCGAC 
GCTACATAGT 
GGACAGCCAC 
ACCGCAATGA 
GACCCCTCAC 
GGGGCAGCAC 
CCATCATTGA 
TGTGTGCCAG 
TGGTCTGTAA 
CCCGAAAGCC 
TGAAGAACAA 
GTTTGGTTCC 
CTTTGGGCCT 
TCAGTGAGAC 
CT6GTTTGTT 
TTCAATAAAT 



31 
I 

GAGGTTGAGG 
ACCTGGGGCC 
TCTGGCAACA 
TCACTCCCAG 
GCTCATCGCC 
TCACCTGGGG 
TGAGTCCTTC 
CATCATGCTG 
CCTCTCCTCA 
GTCCAGCCCC 
GGACCAGAAG 
CGTGCAGGAA 
CCAGTCTCTT 
TGGTGTCTAC 
TTAGACTGGA 
TGTTCACTCT 
CCTGGACTAC 
CTGGATTCAA 
CTCTGTTGTA 
ATTTGCTAAA 



41 
I 

TGGCTGCGGG 
CGCTCCTCCC 
GGGCTTGTAG 
CCCTGGCAGG 
CCCAGATGGC 
CAGCACAACC 
CCCCACCCCG 
GTGAAGATGG 
CGCTGTGTCA 
CAGTTACGCC 
TGTGAGAACG 
GGGGGCAAGG 
CAAGGCATTA 
ACGAAAGTCT 
CCCACCCACC 
GTTAATAAGA 
AGGAGATGCT 
ATTCTGCCTT 
TCCCCAGCCC 
TGAGTG 



Seq ID NO: 577 Protein sequence 
Protein Accession #: NP_006844.1 

1 . 11 



41 



51 
I 

ACTGGAAGTC 
CCCTCCAGGC 
GGGGAGAGAC 
CAGCCCTGTT 
TCCTGACAGC 
TCCAGAAGGA 
GCTTCAACAA 
CATCGCCAGT 
CTGCTGGCAC 
TGCCTCACAC 
CCTACCCCGG 
ACTCCTGCCA 
TCTCCTGGGG 
GCAAATATGT 
ACAGCCCATC 
AACCCTJAGC 
GTCACTTAAT 
GAAATATTGT 
CAAAGACAGC 



51 



60 
120 
180 

240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



21 31 

I I I I I I 

MRILQLILLA LATGLVGGET RIIKGFECKP HSQPWQAALF EKTRLLCGAT LIAPRWLLTA 



443 



wo 02/086443 

AHCLKPRYIV HLGQHNLQKE BGCEQTRTAT ESFPHPGFNN SLPNKDHRND IMLVKMASPV 
SITWAVRPIiT IiSSRCVTAGT SCLISGWGST SSPQLRLPHT LRCANITIIE HQKCENAYPG 
NITDTMVCAS VQEGGKDSCQ GDSGGPLVC2J QSLQGIISWG QDPCAITRKP GVYTKVCKYV 
DWIQETMKNN 

Seq ID NO: 678 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..933 



ATGTGCAGCA 
TTCGACAAGA 
TTCCCCTGTG 
GACTGTCCCG 
GCCCGCTACC 
AATAACTGTC 
GGGCAGGTGT 
ATCATCGGCA 
CACCAGCGGA 
CTGCTGTCCC 
AATAATGGCA 
CCACCCTCCT 
CCGCCCTACT 
CGGTCCGGGA 
GACACCAGCC 
TCTGAGCCGA 



11 
I 

ATGGACGGTG 
GTGATGAGAA 
CCAGCGGCAT 
ATGGCAGCGA 
ACTGCAAGAA 
AAGACAACAG 
TTGTGACTTC 
GCTCCGTCAT 
AGCGGAACAA 
GCCTGGTGGT 
TCCAGTATGT 
ACTCCGAGGC 
CTTCTGACAC 
GTGCCAACAG 
ACAGCCCGGG 
GCCAGGGCAC 



21 
1 

CATCCCGGGC 
GGAGTGCCCC 
CCATTGCATC 
TGAAGAGAAC 
CGGCCTCTGT 
TGATGAGGAA 
AGAGAACCAA 
TTTTGTGCTG 
CCTCATGACG 
CCTGGACCAC 
GGCCAGCCAG 
CTTGCTGGAC 
GGAATCTCTG 
TGCCAGCTCC 
GCAGCCTGGC 
TGAAGAAGTA 



31 
I 

GCCTGGCAGT 
AAGGCTAAGT 
ATTGGTCGCT 
TGCACAGCAA 
ATTGACAAGA 
AGCTGTGAAA 
CTTQTGTATT 
GTGGTGGCCC 
CTGCCCGTGC 
CCCCACCACT 
GCGGAGCAGA 
CA6AGGCCTG 
AACCAAGCCG 
CAGGCAGCCA 
CCCCAGGAGG 
TAA 



41 
1 

GTGACGGGCT 
CGAAATGTGG 
TCCGGTGCAA 
ACCCTCTGCT 
GCTTCATCTG 
GTTCTCAAGA 
ACCCCAGCAT 
TGCTGGCACT 
ACCGGCTGCA 
GCAACGTCAC 
ATGCGTCGGA 
CGTGGTATGA 
ACCTGCCCCC 
GCAGCCTCCT 
GCACTGCTGA 



51 

r 

GCCTGACTGC 
CCCGACCTTC 
TGGGTTTGAG 
TTGCTCCACC 
CGATGGACA6 
ACCCGGCAGT 
CACCTATGCC 
GGTCTTGCAC 
GCACCCTGTG 
CTACAACGTC 
AGTAGGCTCC 
CCTTCCTCCA 
CTACCGCTCC 
GAGCGTGGAA 
GCCCAGGGAC 



120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



Seq ID NO: 679 Protein sequence 
Protein Accession #: Eos sequence 



MCSNGRCIPG 
DCPDGSDEEN 
GQVFVTSENQ 
LLSRLWLDH 
PPYSSDTESL 
SEPSQGTEEV 



11 

1 

AWQCDGLPDC 
CTANPLLCST 
IjVYYPSITYA 
PHHCNVTYNV 
NQADIiPPYRS 



21 

1 

FDKSDEKECP 
ARYHCKNGLC 
IIGSSVIFVL 
NNGIQYVASQ 
RSGSANSASS 



31 

1 

KAKSKCGPTF 
IDKSFIC33GQ 
WALIiAIiVLH 
ABQNASEVGS 
QAASSLLSVE 



41 
I 

FPCASGIHCI 
NNCQDNSDEE 
HQRKRNNIiMT 
PPSYSEALLD 
DTSHSPGQPG 



51 
I 

IGRFRCNGFE 
SCESSQEFGS 
LPVHRLQHPV 
QRPAWYDLPP 
PQEGTAEPRD 



60 
120 
180 
240 
300 



Seq ID NO: 680 DNA sequence 
Nucleic Acid Accession #s S78203.1 
Coding sequence: 1. .2190 



ATGAATCCTT 
GAGGTACCAC 
AACTATCCAC 
TATGGAATGA 
ACCTCCACAT 
GCAGCCATTG 
TATGTGCTTG 
GTACACACAG 
AAACCCTGTG 
ACTAGATACT 
ATCACACCCA 
TTTGGAGTTC 
ATATACAATA 
TTTGCTATTT 
CTAGACTGGG 
AGGGTACTAT 
TCACGATGGA 
GCGGAGCAGA 
TTTGTCATTT 
GCTGTTGGTA 
ATAAATGAAA 
CTGGCAGATG 
GAGTCCATCA 
AGCCAGGATT 
GTGCAGGAGA 
ATGATGGTAA 
AACACTTT6C 
GAAGACTATG 
TGTAGAACAG 
TATCTGTTTG 
ATTCCAGCCA 
6GGGAGGTCA 
ATGAAATCTG 
CTTGTTGTGG 
CTCCTGCTGG 
ACAGAGGATA 
AAACTAGAGA 



11 
t . 

TCCAGAAAAA 
CTCGACCACC 
TGAGCATTGC 
AAGCT6TGCT 
CTATATACCA 
CTGACTCGTG 
GCCATGTGAT 
TCCTATCATT 
TGGCAGCTTT 
TCTCAGTCTT 
TGCTGAGAGG 
CAGGACTGCT 
AACCACCCCC 
CCAATCGTTT 
CAGCTGAGAA 
TCCTTTATAT 
CTTTGCAAGC 
TGCAGGTTCT 
ATCGTCTGGT 
TGATCCTAGC 
TGGCCCCAGC 
ATGAGGTGAA 
AATCCTTTCA 
TTCACTTCCA 
AGAACTGGTA 
AGGATACAGA 
ATAAAGATGT 
GTGTGTCTGC 
AAGATAAGAA 
TTATTACTAA 
ACAAAATGTC 
TGTTCTCTGT 
TGCTCCAGGC 
CACAGTTCAG 
TGATCTGCCT 
TGCGGGGTCC 
CCAAGAAGAC 



21 

1 

TGAGTGCAAG 
TAGCCCTCCA 
CTTCATTGTG 
GATCCTGTAT 
TGCCTTCAGC 
GTTGGGAAAA 
CAAGTCCTTG 
GATCGGCCTG 
TGGTGGAGAC 
CTACCTGTCC 
AGATGTGCAA 
CATGGTAATT 
TGAAGGAAAC 
CAAGAACCGT 
ATATCCAAAG 
CCCATTGCCC 
CATCAGGATG 
AAATCCCTTT 
CTCCAAGTGT 
GTGCCTGGCA 
CCAGTCAGGT 
GGTGACAGTG 
GAAAACACCA 
CCTGAAATAT 
CAGTCTTGTC 
AAGCAAAACA 
CAACATCTCC 
TTATAGAACT 
CTTTTCTCTG 
TAACACCAAT 
CATTGCGTGG 
CACAGGTCTT 
AGCTTGGCTA 
TGGCCTGGTA 
GATCTTCTCC 
AGCAGATAAG 
AAAACTCTGA 



31 

! 

GAAACTCTTT 
AAGAAGCCAT 
GTGAATGAAT 
TTCCTGTATT 
AGCCTCTGTT 
TTCAAGACAA 
GGTGCCTTAC 
AGTCTAATAG 
CAGTTTGAAG 
ATCAATGCAG 
TGTTTTGGAG 
GCACTTGTTG 
ATAGTGGCTC 
TCTGGAGACA 
CAGCTCATTA 
ATGTTCTGGG 
AATAGGAATT 
CTGGTTCTTA 
GGAATTAACT 
TTTGCAGTTG 
CCCCAGGAGG 
GTGGGAAATG 
CACTATTCCA 
CACAATTTGT 
ATTCGTGAAG 
ACCAATGGGA 
CTGAGTACAG 
GTGCAAAGAG 
AATTTGGGTC 
CAGGGTCTTC 
CAGCTACCAC 
GAGTTTTCTT 
TTGACAATTG 
CAGTGGGCCG 
ATCATGGGCT 
CACATTCCTC 



41 

1 

TTTCACCTGT 
CTCCGACAAT 
TCTGCGAGCG 
TCCTGCACTG 
ATTTTACTCC 
TCATCTATCT 
CAATACTGGG 
CTTTGGGGAC 
AAAAACATGC 
GGAGCTTGAT 
AAGACTGCTA 
TGTTTGCAAT 
T^GTTTTCAA 
TTCCARAGCG 
TGGATGTAAA 
CTCTTTTGGA 
TGGGGTTTTT 
TCTTCATCCC 
TCTCATCACT 
CGGCAGCTGT 
TTTTCCTACA 
AAAACAATTC 
AACTGCACCT 
CTCTCTACAC 
ATGGGAACAG 
TGACAACCGT 
ATACCTCTCT 
GAGAATACCC 
TTCTAGACTT 
AGGCCTGGAA 
AATATGCCCT 
ATTCTCAGGC 
CAGTTGGGAA 
AATTCATTTT 
ACTACTATGT 
ACATCCAGGG 



51 
I 

CTCCATTGAA 
CTGTGGCTCC 
CTTTTCCTAT 
GAATGAAGAT 
CATCCTGGGA 
CTCCTTGGTG 
AGGACAAGTG 
AGGAGGCATC 
AGAGGAACGG 
TTCTACATTT 
TGCATTGGCT 
GGGAAGCAAA 
ATGTATCTGG 
ACAGCACTGG 
GGCACTGACC 
TCAGCAGGGT 
TGTGCTTCAG 
GTTGTTTGAC 
TAGGAAAATG 
AGAGATAAAA 
AGTCTTGAAT 
TCTGTTGATA 
GAAAACAAAA 
TGAGCATTCT 
TATCTCCAGC 
GAGGTTTGTT 
CAATGTTGGT 
TGCAGTGCAC 
TGGTGCAGCA 
GATTGAAGAC 
GGTTACAGCT 
TCCCTCTAGC 
TATCATCGTG 
GTTTTCCTGC 
TCCTGTAAAG 
GAACATGATC 



60 
120 
ISO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



Seq ID NO: 681 Protein sequence 
Protein Accession #: AAB34388.1 
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5 
10 
15 
20 
25 
30 
35 
40 
45 



MNPFQKNESK 
YGMKAVLILY 
YVLGHVIKSL 
TRYFSVFYhS 
lYNKPPPEGN 
RVLFLYIPLP 
FVIYRLVSKC 
LADDEVKVTV 
VQEKNWYSLV 
EDYGVSAYRT 
IPANKMSIAW 
LWAQFS6LV 
KLETKKTKL 



11 
I 

ETLFSPVSIE 
FLYFLHWNED 
GALPILGGQV 
INAGSLISTF 
IVAQVFKCIW 
MFWALLDQQG 
GINFSSLRKM 
VGNENNSLLI 
IREDGNSISS 
VQRGEYPAVH 
QLPQYALVTA 
QWAEFILFSC 



21 
I 

EVPPRPPSPP 
TSTSIYHAFS 
VHTVLSLIGL 
ITPMLRGDVQ 
FAISNRFKNR 
SRWTLQAIRM 
AVGMILACIiA 
ESIKSFQKTP 
MMVKDTESKT 
CRTEDKNFSL 
GEVMFSVTGL 
LLLVICIiIFS 



31 
I 

KKPSPTICGS 
SLCYFTPILG 
SLIALGTGGI 
CFGEDCYALA 
SGDIPKRQHW 
NRNLGFFVLQ 
FAVAAAVEIK 
HYSKLHIiKTK 
TNGMTTVRFV 
NLGLLDFGAA 
EFSYSQAPSS 
IMGYYYVPVK 



41 
I 

NYPLSIAFIV 
AAIADSWLGK 
KPCVAAFGGD 
FGVPGLLMVI 
LDWAAEKYPK 
PDQMQVLNPF 
INEMAPAQSG 
SQDPHFHIiKY 
NTLHKDVNIS 
YLFVITNNTN 
MKSVLQAAWL 
TEDMRGPADK 



51 

I 

VNEFCERPSY 
FKTIIYLSLV 
QFEEKHAEER 
ALWFAMGSK 
QLIMDVKALT 
LVLIFIPLFD 
PQEVFLQVLN 
HNLSLYTEHS 
LSTDTSLNVG 
QGLQAWKIED 
LTIAVGNIIV 
HIPHIQGNMI 



Seq ID NO: 682 DNA sequence 

Nucleic Acid Accession #i NM_016077,1 

Coding sequence : 128 667 



TCGCTTTGTG 
CGCGATAGAA 
ACTGTAGATG 
CTTGGCTGTT 
GATGCTCCCC 
CTTGGGAGAC 
AAAAGGGAAA 
AAGAAGAAAT 
CAAAGCTCCT 
GACTGTAAGT 
CCTAGG6ATT 
TTACTAGGTG 
GATTCTAACA 
AAACCTATTC 



11 
1 

ATTCTTGATC 
ACGTGTTCGC 
CCCTCCAAAT 
GGAGTTGCTT 
AAAAGCAAGA 
AGCGGGGAGT 
GTGGCTGCCC 
CCTGAAATGC 
GATGAAGAAA 
TTAATTCAAG 
GGGCCAGGAC 
GACTTTGATA 
ACAAAAGCTG 
CCATGTTCTA 



21 

I 

CGGAACTTTG 
TTGCCCAGAA 
CCTTGGTTAT 
GTGGCATGTG 
CGAGCAAGAC 
ACAAGATGAT 
AGTGCTCTCA 
TCA7VACAATG 
CCCTGATTGC 
ATGCTGGACG 
CAGCAGACCT 
TGACAACAAC 
AATTTCTTCA 
AAAAAA 



31 
1 

TCACCCAGGA 
GAAGGGAAGG 
GGAATATTTG 
CCTGGGCTGG 
ACACACAGAT 
TCTTGTGGTT 
TGCTGCTGTT 
GGAATACTGT 
ATTATTGGCC 
TACTCAGATT 
AATTGACAAA 
CCCTCCATCA 
CCCAACTTAA 



41 

1 

ACCCCGGAAG 
CGCGAGTGAG 
GCTCATCCCA 
AGCCTTCGAG 
ACTGAAAGTG 
CGAAATGACT 
TCAGCCTACA 
GGCCAGCCCA 
CAT6CAAAAA 
GCACCAGGCT 
GTCACTGGTC 
CAAGTGTTTG 
ATGTTCTTGA 



51 

1 

AGGTAGCTCA 
GAAAGGAGGT 
GTACACTCGG 
TATGCTTTGG 
7UVGCAAGCAT 
TAAAGATGGG 
AGCAGATTCA 
AGGTGGTGGT 
TGCTGGGACT 
CTCAAACTGT 
ACCTAAAACT 
AAGCCTGTCA 
GATGAAAATA 



Seq ID NO: 683 Protein sequence 
Protein Accession #: NP_057161.1 



11 



21 



31 



41 51 

I I I 1 I i 

MPSKSLVMEY LAHPSTLGLA VGVACX3MCLG WSLRVCFGML PKSKTSKTHT DTESEASILG 
DSGEYKMILV VRNDLKMGKG KVAAQCSHAA VSAYKQIQRR NPEMLKQWEY CGQPKVWKA 
PDEETLIAIiIi AHAKMLGLTV SblQDAGRTQ lAPGSQTVLG IGPGPADLID KVTGHLKLY 



60 

120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 684 DNA sequence 

Nucleic Acid Accession #; NM_004864.1 

Coding sequence: 2 6.. 952 



CGGAACGAGG 
TCAGATGCTC 
GGCCGAGGCG 
ATTCCGAGAG 
CTGGGAAGAT 
AGTGCGGCTG 
GGGGCTCCCC 
AAGGTCGTGG 
GCCCGCGCTG 
ATCTTCGTCC 
CCGCAGAGCG 
TCTGCACACG 
ACGGGAGGTG 
CATGCACGCG 
CTGCTGCGTG 
GTCGCTCCAG 
GGTCCTTCCA 
GGGCTCAAGG 
TTATTTATTA 
ACTGTGTATT 
AAAA 



11 
I 

GCAACCTGCA 
CTGGTGTTGC 
AGCCGCGCAA 
TTGCGGAAAC 
TCGAACACCG 
GGATCCGGCG 
GAGGCCTCCC 
GACGTGACAC 
CACCTGCGAC 
GCACGGCCCC 
CGTGCGCGCA 
GTCCGCGCGT 
CAAGTGACCA 
CAGATCAAGA 
CCCGCCAGCT 
ACCTATGATG 
CTGTGCACCT 
TTCCTGAGAC 
TTAATTTATT 
TATTTAAAAC 



21 

I 

CAGCCATGCC 
TGGTGCTCTC 
GTTTCCCGGG 
GCTACGAGGA 
ACCTCGTCCC 
GCCACCTGCA 
GCCTTCACCG 
GACCGCTGCG 
TGTCGCCGCC 
AGCTGGAGTT 
ACGGGGACGA 
CGCTGGAAGA 
TGTGCATCGG 
CGAGCCTGCA 
ACAATCCCAT 
ACTTGTTAGC 
GCGCGGGGGA 
ACCCGATTCC 
GGGGTGACCT 
TCTGGTGATA 



31 

1 

CGGGCAAGAA 
GTGGCTGCCG 
ACCCTCAGAG 
CCTGCTAACC 
GGCCCCTGCA 
CCTGCGTATC 
GGCTCTGTTC 
GCGTCAGCTC 
GCCGTCGCAG 
GCACTTGCGG 
CTGTCCGCTC 
CCTGGGCTGG 
CGCGTGCCCG 
CCGCCTGAAG 
GGTGCTCATT 
CAAAGACTGC 
GGCGACCTCA 
TGCCCAAACA 
TCTTGGGGAC 
AAAATAAAGC 



41 

1 

CTCAGGACGG 
CATGGGGGCG 
TTGCACTCCG 
AGGCTGCGGG 
GTCCGGATAC 
TCTCGGGCCG 
CGGCTGTCCC 
AGCCTTGCAA 
TCGGACCAAC 
CCGCAAGCCG 
GGGCCCGGGC 
GCCGATTGGG 
AGCCAGTTCC 
CCCGACACGG 
CAAAAGACCG 
CACTGCATAT 
GTTGTCCTGC 
GCTGTATTTA 
TCGGGGGCTG 
TGTCTGAACT 



51 

I 

TGAATGGCTC 
CCCTGTCTCT 
AAGACTCCAG 
CCAACCAGAG 
TCACGCCAGA 
CCCTTCCCGA 
CGACGGCGTC 
GACCCCAAGC 
TGCTGGCAGA 
CCAGGGGGCG 
GTTGCTGCCG 
TGCTGTCGCC 
GGGCGGCAAA 
AGCCAGCGCC 
ACACCGGGGT 
GAGCAGTCCT 
CCTGTGGAAT 
TATAAGTCTG 
GTCTGATGGA 
GTTAAAAAAA 



Seq ID NO: 685 Protein sequence 
Protein Accession #: NP 004855.1 



1 11 

I I 
MPGQEIiRTVN GSQMLLVLIiV 

EDLLTRLRAN QSWEDSNTDL 
HRALFRLSPT ASRSWDVTRP 
ELHLRPQAAR GRRRARARNG 
IGACPSQFRA ANMHAQIKTS 
LAKDCHCI 



21 
I 

LSWLPHGGAL 
VPAPAVRILT 
LRRQLSLARP 
DDCPLGPGRC 
LHRLKPDTEP 



31 

I 

SliAEASRASF 

PEVRLGSGGH 
QAPALHLRLS 
CRIiHTVRASL 
APCCVPASYN 



41 

I 

PGPSELHSED 
LHLRISRAAL 
PPPSQSDQLL 
EDLGWADWVL 
PMVLIQKTDT 



51 
I 

SRFRELRKRY 
PEGLPEASRL 
AESSSARPQL 
SPREVQVTMC 
GVSLQTYDDL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



60 

12 0 
180 
240 
300 



Seq ID NO: 686 DNA sequence 
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Nucleic Acid Accession #: NM_002423.2 
Coding sec[uence : 48 . . 851 

1 11 21 31 41 51 

] I I I I I 

ACCAAATCAA CCATAGGTCC AAGAACAATT GTCTCTGGAC GGCAGCTATG CGACTCACCG 
TGCTGTGTGC TGTGTGCCTG CTGCCTGGCA GCCTGGCCCT GCCGCTGCCT CAGGAGGCGG 
GAGGCATGAG TGAGCTACAG TGGGAACAGG CTCAGGACTA TCTCAAGAGA TTTTATCTCT 
ATGACTCAGA AACAAAT^T GCCAACAGTT TAGAAGCCAA ACTCAAGGAG ATGCAAAAAT 
TGTTTGGCCT ACCTATAACT GGAATGTTAA ACTCCCGCGT CATAGAAATA ATGCAGAAGC 
CCAGATGTGG AGTGCCAGAT GTTGCAGAAT ACTCACTATT TCCAAATAGC CCAAAATGGA 
CTTCCAAAGT GGTCACCTAC AGGATCGTAT CATATACTCG AGACTTACCG CATATTACAG 
TGGATCGATT AGTGTCAAAG GCTTTATiACA TGTGGGGCAA AQA6ATCCCC CTGCATTTCA 
GGAAAGTTGT ATGGGGAACT GCTGACATCA T6ATTGGCTT TGCGCGAGGA GCTCATGGGG 
ACTCCTACCC ATTTGATGGG CCAGGAAACA CGCTGGCTCA TGCCTTTGCG CCTGGGACAG 
GTCTCGGAGG AGATGCTCAC TTCGATGAGG ATGAACGCTG GACGGATGGT AGCAGTCTAG 
GGATTAACTT CCTGTATGCT GCAACTCATG AACTTGGCCA TTCTTTGGGT ATGGGACATT 
CCTCTGATCC TAATGCAGTG ATGTATCCAA CCTATGGAAA TGGAGATCCC CAAAATTTTA 
AACTTTCCCA GGATGATATT AAAGGCATTC AGAAACTATA TGGAAAGAGA AGTAATTCAA 
GAAAGAAATA GAAAGTTCAG GCAGAACATC CATTCATTCA TTCATTGGAT TGTATATCAT 
TGTTGCACAA TCAGAATTGA TAAGCACTGT TCCTCCACTC CATTTAGCAA TTATGTCACC 
CTTTTTTATT GCAGTTGGTT TTTGAATGTC TTTCACTCCT TTTATTGGTT AAACTCCTTT 
ATGGTGTGAC TGTGTCTTAT TCCATCTATG AGCTTTGTCA GTGCGCGTAG ATGTCAATAA 
ATGTTACATA CACAAATAAA TAAAATGTTT ATTCCATGGT AAATTTA 

Seq ID NO: 687 Protein sequence 
Protein Accession #: NP_002414.1 

1 11 21 

1 I 1 

MRLTVLCAVC IiLPGSLALPL PQEAGGMSEIi 
EMQKFFGIjPX TGMLNSRVIE IMQKPRCGVP 
PHITVDRLVS KALNMWGKEI PLHFRKWWG 
APGTGLGGDA HFDEDERWTD GSSLGINFLY 
PQNFKLSQDD ZKGIQKItYGK RSNSRKK 

Seq ID NO: 688 DNA sequence 
Nucleic Acid Accession #: NM_005221.3 
Coding sequence : 1 . . 870 

1 11 21 31 41 51 

t I I I I I 

ATGACAGGAG TGTTTGACAG AAGGGTCCCC AGCATCCGAT CCGGCGACTT CCAAGCTCCG 
TTCCAGACGT CCGCAGCTAT GCACCATCCG TCTCAGGAAT CGCCAACTTT GCCCGAGTCT 
TCAGCTACCG ATTCTGACTA CTACAGCCCT ACGGGGGGAG CCCCGCACGG CTACTGCTCT 
CCTACCTCGG CTTCCTATGG CAAAGCTCTC AACCCCTACC AGTATCAGTA TCACGGCGTG 
AACGGCTCCG CCGGGAGCTA CCCAGCCAAA GCTTATGCCG ACTATAGCTA CGCTAGCTCC 
TACCACCAGT ACGGCGGCGC CTACAACCGC GTCCCAAGCG CCACCAACCA GCCAGAGAAA 
GAAGTGACCG AGCCCGAGGT GAGAATGGTG AATGGCAAAC CAAAGAAAGT TCX3TAAACCC 
AGGACTATTT ATTCCAGCTT TCAGCTGGCC GCATTACAGA GAAGGTTTCA GAAGACTCAG 
TACCTCGCCT TGCCGGAACG CGCCGAGCTG GCCGCCTCGC TGGGATTGAC ACAAACACAG 
GTGAAAATCT GGTTTCAGAA CAAAAGATCC AAGATCAAGA AGATCATGAA AAACGGGGAG 
ATGCCCCCGG AGCACAGTCC CAGCTCCAGC GACCCAATGG CGTGTAACTC GCCGCAGTCT 
CCAGCGQTGT GGGAGCCCCA GGGCTCGTCC CGCTGGCTCA GCCACCACCC TCATGCCCAC 
CCTCCGACCT CCAACCAGTC CCCAGCGTCC AGCTACCTGG AGAACTCTGC ATCCTGGTAC 
ACAAGTGCAG CCAGCTCAAT CAATTCCCAC CTGCCGCCGC CQGGCTCCTT ACAGCACCCG 
CTGGCGCTGG CCTCCGGGAC ACTCTATTAG 

Seq ID NO; 689 Protein sequence 
Protein Accession #: NP_005212,1 

1 11 21 31 41 51 

I I i I ( i 

MTGVPDRRVP SIRSGDFQAP FQTSAAMHHP SQESPTLPES SATDSDYYSP TGGAPHGYCS 
PTSASYGKAL NPYQYQYHGV NGSAGSYPAK AYADYSYASS YHQYGGAYNR VPSATNQPEK 
EVTEPEVRMV NGKPKKVRKP RTXYSSPQLA ALQRRFQKTQ YLALPBRAEL AASLGLTQTQ 
VKIWFQNKRS KIKKIMKNGE MPPEHSPSSS DPMACNSPQS PAVWEPQGSS RSLSHHPHAH 
PPTSNQSPAS SYtiENSASWY TSAASSINSH LPPPGSLQHP LALASGTLY 



31 41 51 

I 1 I 

QWEQAQDYLK RPYLYDSETK NANSLEAKLK 
DVAEYSLFPN SPKWTSKWT YRIVSYTRDL 
TADIMIGFAR GAHGDSYPPD GPGNTLAHAF 
AATHEIiGHSIi GMGHSSDPNA VMYPTYGNGD 
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It is understood that the examples described above in no way serve to limit the true 
scope of this invention, but rather are presented for illustrative purposes. All publications, 
sequences of accession numbers, and patent applications cited in this specification are herein 
ncorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 

1 LA method of detecting a lung cancer-associated transcript in a cell 

2 from a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

4 as shown in Tables lA-16. 

1 2. The method of claim 1, wherein the polynucleotide selectively 

2 hybridizes to a sequence at least 95% identical to a sequence as shown in Tables lA-16. 

1 3. The method of claim 1 , wherein the biological sample is a tissue 

2 sample. 

1 4. The method of claim 1 , wherein the biological sample comprises 

2 isolated nucleic acids. 

1 5. The method of claim 4, wherein the nucleic acids are mRNA. 

1 6. The method of claim 4, further comprising the step of ampUfying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 7. The method of claim 1, wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1 A-16. 

1 8. The method of claim 1 , wherein the polynucleotide is labeled. 

1 9. The method of claim 8, wherein the label is a fluorescent label. 

1 10. The method of claim 1, wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 11. The method of claim 1, wherein the patient is undergoing a therapeutic 

2 regimen to treat lung cancer. 

1 12. The method of claim 1, wherein the patient is suspected of having limg 

2 cancer. 

1 1 3 . A method of monitoring the efficacy of a ther^eutic treatment of lung 

2 cancer, the method comprising the steps of: 
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3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 

5 (ii) determining the level of a lung cancer-associated transcript in the 

6 biological sample by contacting the biological sample with a polynucleotide that selectively 

7 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1 A-16, 

8 thereby monitoring the efficacy of the therapy. 

1 14, The method of claim 13, further comprising the step of: (iii) comparing 

2 the level of the lung cancer-associated transcript to a level of the limg cancer-associated 

3 transcript in a biological sample from the patient prior to, or earlier in, the therapeutic 

4 treatment. 

1 15. The method of claim 13, wherein the patient is a human. 

1 16. A method of monitoring the efficacy of a therapeutic treatment of lung 

2 cancer, the method comprising the steps of; 

3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 

5 (ii) determining the level of a lung cancer-associated antibody in the biological 

6 sample by contacting the biological sample with a polypeptide encoded by a polynucleotide 

7 that selectively hybridizes to a sequence at least 80% identical to a sequence as shown in 

8 Tables 1 A-16, wherein the polypeptide specifically binds to the lung cancer-associated 

9 antibody, thereby monitoring the efficacy of the therapy. 

1 17. The method of claim 16, further comprising the step of: (iii) comparing 

2 the level of the lung cancer-associated antibody to a level of the lung cancer-associated 

3 antibody in a biological sample from the patient prior to, or earlier in, the therapeutic 

4 treatment. 

1 18. The method of claim 1 6, wherein the patient is a human. 

1 1 9. A method of monitoring the efficacy of a therapeutic treatment of lung 

2 cancer, the method comprising the steps of: 

3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 
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5 (ii) determining the level of a lung cancer-associated polypeptide in the 

6 biological sample by contacting the biological sample with an antibody, wherein the antibody 

7 specifically binds to a polypeptide encoded by a polynucleotide that selectively hybridizes to 

8 a sequence at least 80% identical to a sequence as shown in Tables IA-I65 thereby 

9 monitoring the efficacy of the therapy. 

1 20. The method of claim 19, further comprising the step of: (iii) comparing 

2 the level of the lung cancer-associated polypeptide to a level of the Ixmg cancer-associated 

3 polypeptide in a biological sample from the patient prior to, or earlier in, the therapeutic 

4 treatment. 

2 1 . The method of claim 1 9, wherein the patient is a himian. 

22. An isolated nucleic acid molecule consisting of a polynucleotide 
sequence as shown in Tables 1 A-16. 

23. The nucleic acid molecule of claim 22, which is labeled. 

24. The nucleic acid of claim 23, wherein the label is a fluorescent label 

25. An expression vector comprising the nucleic acid of claim 22. 

26. A host cell comprising the expression vector of claim 25. 

27. An isolated polypeptide which is encoded by a nucleic acid molecule 
having polynucleotide sequence as shown in Tables lA-16. 

28. An antibody that specifically binds a polypeptide of claim 27, 

29. The antibody of claim 28, further conjugated to an effector component. 

30. The antibody of claim 29, wherein the effector component is a 
fluorescent label. 

3 1 . The antibody of claim 29, wherein the effector component is a 
radioisotope or a cytotoxic chemical. 

32. The antibody of claim 29, which is an antibody fragment. 



450 



wo 02/086443 PCT/US02/12476 

1 33. The antibody of claim 29, which is a humanized antibody 

1 34. A method of detecting a lung cancer cell in a biological sample from a 

2 patient, the method comprising contacting the biological sample with an antibody of claim 

3 28. 

1 35. The method of claim 34, wherein the antibody is further conjugated to 

2 an effector component. 

1 36. The method of claim 35, wherein the effector component is a 

2 fluorescent label. 

1 37. A method of detecting antibodies specific to lung cancer in a patient, 

2 the method comprising contacting a biological sample from the patient with a polypeptide 

3 encoded by a nucleic acid comprises a sequence from Tables 1 A-16. 

1 38. A method for identifying a compound that modulates a lung cancer- 

2 associated polypeptide, the method comprising the steps of: 

3 (i) contacting the compound with a lung cancer-associated polypeptide, the 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

5 80% identical to a sequence as shown in Tables 1 A-16; and 

6 (ii) determining the functional effect of the compound upon the polypeptide. 

1 39. The method of claim 38, wherein the functional effect is a physical 

2 effect. 

1 40. The method of claim 38, wherein the functional effect is a chemical 

2 effect. 

1 41 . The method of claim 38, wherein the polypeptide is expressed in a 

2 eukaryotic host cell or cell membrane. 

1 42. The method of claim 38, wherein the functional effect is determined by 

2 measuring ligand binding to the polypeptide. 

1 43 . The method of claim 3 8, wherein the polypeptide is recombinant. 
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1 44. A method of inhibiting proliferation of a lung cancer-associated cell to 

2 treat lung cancer in a patient, the method comprising the step of administering to the subject a 

3 therapeutically effective amount of a compound identified using the method of claim 38. 

1 45 , The method of claim 44, wherein the compound is an antibody, 

1 46. The method of claim 45, wherein the patient is a human. 

1 47. A drug screening assay comprising the steps of 

2 (i) administering a test compound to a mammal having Irnig cancer or a cell 

3 isolated therefrom; 

4 (ii) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1 A-16 in a 

6 treated cell or mammal with the level of gene expression of the polynucleotide in a control 

7 cell or mammal, wherein a test compoxmd that modulates the level of expression of the 

8 polynucleotide is a candidate for the treatment of limg cancer. 

1 48. The assay of claim 47, wherein the control is a mammal with lung 

2 cancer or a cell therefrom that has not been treated with the test compound. 

1 49. The assay of claim 47, wherein the control is a normal cell or mammal. 

1 50. A method for treating a mammal having lung cancer comprising 

2 administering a compound identified by the assay of claim 47. 

1 5 1 . A pharmaceutiPcal composition for treating a mammal having lung 

2 cancer, the composition comprising a compound identified by the assay of claim 47 and a 

3 physiologically acceptable excipient. 
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